Probabilistic Grid-Based Approaches for Privacy-Preserving Data Mining on Moving Object Trajectories

Privacy-preserving data mining (PPDM) has become an interesting and emerging topic in recent years because it helps hide confidential information, while allowing useful knowledge to be discovered at the same time. Data sanitization is a common way to perturb a database, and thus sensitive or confidential information can be hidden. PPDM is not a trivial task and can be concerned an Non-deterministic Polynomial-time (NP)-hard problem. Many algorithms have been studied to derive optimal solutions using the evolutionary process, although most are based on straightforward or single-objective methods used to discover the candidate transactions/items for sanitization. In this paper, we present a multi-objective algorithm using a grid-based method (called GMPSO) to find optimal solutions as candidates for sanitization. The designed GMPSO uses two strategies for updating gbest and pbest during the evolutionary process. Moreover, the pre-large concept is adapted herein to speed up the evolutionary process, and thus multiple database scans during each evolutionary process can be reduced. From the designed GMPSO, multiple Pareto solutions rather than single-objective algorithms can be derived based on Pareto dominance. In addition, the side effects of the sanitization process can be significantly reduced. Experiments have shown that the designed GMPSO achieves better side effects than the previous single-objective algorithm and the NSGA-II-based approach, and the pre-large concept can also help with speeding up the computational cost compared to the NSGA-II-based algorithm.

Download Full-text

A framework for ensemble classification and sensitivity analysis in privacy preserving data mining

International Journal of Computational Systems Engineering ◽

10.1504/ijcsyse.2019.103637 ◽

2019 ◽

Vol 5 (5/6) ◽

pp. 260-276

Author(s):

P. Chandrakanth ◽

M.S. Anbarasi

Keyword(s):

Data Mining ◽

Sensitivity Analysis ◽

Privacy Preserving ◽

Ensemble Classification ◽

Privacy Preserving Data Mining

Download Full-text

Classification and Evaluation of Privacy Preserving Data Mining Methods

2020 11th International Conference on Information and Knowledge Technology (IKT) ◽

10.1109/ikt51791.2020.9345620 ◽

2020 ◽

Author(s):

Negar Nasiri ◽

MohammadReza Keyvanpour

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Privacy Preserving Data Mining ◽

Mining Methods

Download Full-text

Analyzing and Performing Privacy Preserving Data Mining on Medical Databases

Indian Journal of Science and Technology ◽

10.17485/ijst/2016/v9i17/93024 ◽

2016 ◽

Vol 9 (17) ◽

Author(s):

D. Aruna Kumari ◽

Y. Vineela ◽

T. Mohan Krishna ◽

B. Sai Kumar

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Privacy Preserving Data Mining ◽

Medical Databases

Download Full-text

A Perturbation Method Based on Singular Value Decomposition and Feature Selection for Privacy Preserving Data Mining

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2014010104 ◽

2014 ◽

Vol 10 (1) ◽

pp. 55-76 ◽

Cited By ~ 1

Author(s):

Mohammad Reza Keyvanpour ◽

Somayyeh Seifi Moradi

Keyword(s):

Data Mining ◽

Feature Selection ◽

Singular Value Decomposition ◽

Perturbation Method ◽

Privacy Preserving ◽

Singular Value ◽

Privacy Preserving Data Mining ◽

Selection For ◽

Value Decomposition ◽

Different Levels

In this study, a new model is provided for customized privacy in privacy preserving data mining in which the data owners define different levels for privacy for different features. Additionally, in order to improve perturbation methods, a method combined of singular value decomposition (SVD) and feature selection methods is defined so as to benefit from the advantages of both domains. Also, to assess the amount of distortion created by the proposed perturbation method, new distortion criteria are defined in which the amount of created distortion in the process of feature selection is considered based on the value of privacy in each feature. Different tests and results analysis show that offered method based on this model compared to previous approaches, caused the improved privacy, accuracy of mining results and efficiency of privacy preserving data mining systems.

Download Full-text