Privacy Assessment Methodology for Machine Learning Models and Data Sources

Publications

Privacy Assessment Methodology for Machine Learning Models and Data Sources

Privacy Assessment Methodology for Machine Learning Models and Data Sources
Rudolf Erdei, Emil Pasca, Daniela Delinschi, Anca Avram, Ionela Chereja, Oliviu Matei

Abstract. The widespread use of machine learning amplifies privacy risks both at the level of training data and at the level of the resulting models. This paper proposes a methodology for the joint privacy assessment of machine learning models and the data sources used to build them. The approach combines a structured inventory of data sources, threat scenarios and privacy-relevant model properties (memorisation, leakage potential, re-identification risk) with quantitative indicators that can be computed during the model lifecycle. The methodology supports compliance with privacy regulations and enables informed trade-offs between utility and privacy, with case studies drawn from agricultural and IoT data domains.

Keywords: privacy assessment; machine learning; data sources; privacy risk; data protection

📋 Cite this publication



Rudolf Erdei, Emil Pasca, Daniela Delinschi, Anca Avram, Ionela Chereja, Oliviu Matei, "Privacy Assessment Methodology for Machine Learning Models and Data Sources", Proc. 19th SOCO Int. Conf. on Soft Computing Models in Industrial and Environmental Applications, Springer, 2024, 2023.


Reference: Proc. 19th SOCO Int. Conf. on Soft Computing Models in Industrial and Environmental Applications, Springer, 2024.

Evaluation of Feature Selection Methods in Estimation  of Precipitation Based on Deep Learning Artificial  Neural Networks

Evaluation of Feature Selection Methods in Estimation of Precipitation Based on Deep Learning Artificial Neural Networks

Precipitation is the most important element of the water cycle and an indispensable element of water resources management. This paper aims to model the monthly precipitation in 8 precipitation observation stations. The effects and role of different feature weights pre-processing methods (Weight by deviation, Weight by PCA, Weight by correlation, and Weight by Support Vector Machine) on artificial intelligence modeling were investigated.

read more
A Comparison of different crossover operators in genetic algorithms for clusters shortest-path tree problem

A Comparison of different crossover operators in genetic algorithms for clusters shortest-path tree problem

The clustered shortest-path tree (CluSPT) problem is an extension of the classical shortest path problem, given a graph with the nodes partitioned into several mutually exclusive and collectively exhaustive clusters looks for a shortest-path spanning tree from a predefined source node to all the other nodes of the graph, with the property that every cluster should generate a connected subgraph.

read more
A comprehensive survey on the generalized traveling salesman problem

A comprehensive survey on the generalized traveling salesman problem

The generalized traveling salesman problem (GTSP) is an extension of the classical traveling salesman
problem (TSP), and it is among the most researched combinatorial optimization problems due to its theoretical properties, complexity aspects, and real-life applications in various areas: location-routing problems, material flow design problem, distribution of medical supplies, urban waste collection management, airport selection and routing the courier airplanes, image retrieval and ranking, digital garment manufacturing, etc.

read more

Other publications

0 Comments