Evaluating applicability of perturbation techniques for privacy preserving data mining by descriptive statistics

Data Mining is a computational process that able to identify patterns, trends and behaviour from large datasets. With this advantages, data mining has been applied in many fields such as finance, healthcare, retail and so on. However, information disclosure become one of an issue during data mining process. Therefore, privacy protection is needed during data mining process which known as Privacy Preserving Data Mining (PPDM). There are several techniques available in PPDM and each of the techniques has its’ own benefits and drawbacks. In this research, perturbation technique is selected as privacy preserving technique. Perturbation technique is a method that alters the original data value before the application of data mining. In PPDM applications, perturbation technique able to provide a protection of data privacy but the accuracy of data should not be ignored too. In this research, three perturbation techniques are selected which are additive noise, data swapping and resample. For data mining techniques, two methods of classification are selected which are Naïve Bayes and Support Vector Machines (SVM). With the selection of these techniques, the experimental results are evaluated based on the hiding failure, accuracy and precision. For overall result, resample is selected as the best perturbation technique in naïve bayes and SVM classification for both glass and ionosphere datasets.

Download Full-text

A framework for ensemble classification and sensitivity analysis in privacy preserving data mining

International Journal of Computational Systems Engineering ◽

10.1504/ijcsyse.2019.103637 ◽

2019 ◽

Vol 5 (5/6) ◽

pp. 260-276

Author(s):

P. Chandrakanth ◽

M.S. Anbarasi

Keyword(s):

Data Mining ◽

Sensitivity Analysis ◽

Privacy Preserving ◽

Ensemble Classification ◽

Privacy Preserving Data Mining

Download Full-text

Classification and Evaluation of Privacy Preserving Data Mining Methods

2020 11th International Conference on Information and Knowledge Technology (IKT) ◽

10.1109/ikt51791.2020.9345620 ◽

2020 ◽

Author(s):

Negar Nasiri ◽

MohammadReza Keyvanpour

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Privacy Preserving Data Mining ◽

Mining Methods

Download Full-text

Analyzing and Performing Privacy Preserving Data Mining on Medical Databases

Indian Journal of Science and Technology ◽

10.17485/ijst/2016/v9i17/93024 ◽

2016 ◽

Vol 9 (17) ◽

Author(s):

D. Aruna Kumari ◽

Y. Vineela ◽

T. Mohan Krishna ◽

B. Sai Kumar

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Privacy Preserving Data Mining ◽

Medical Databases

Download Full-text

A Perturbation Method Based on Singular Value Decomposition and Feature Selection for Privacy Preserving Data Mining

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2014010104 ◽

2014 ◽

Vol 10 (1) ◽

pp. 55-76 ◽

Cited By ~ 1

Author(s):

Mohammad Reza Keyvanpour ◽

Somayyeh Seifi Moradi

Keyword(s):

Data Mining ◽

Feature Selection ◽

Singular Value Decomposition ◽

Perturbation Method ◽

Privacy Preserving ◽

Singular Value ◽

Privacy Preserving Data Mining ◽

Selection For ◽

Value Decomposition ◽

Different Levels

In this study, a new model is provided for customized privacy in privacy preserving data mining in which the data owners define different levels for privacy for different features. Additionally, in order to improve perturbation methods, a method combined of singular value decomposition (SVD) and feature selection methods is defined so as to benefit from the advantages of both domains. Also, to assess the amount of distortion created by the proposed perturbation method, new distortion criteria are defined in which the amount of created distortion in the process of feature selection is considered based on the value of privacy in each feature. Different tests and results analysis show that offered method based on this model compared to previous approaches, caused the improved privacy, accuracy of mining results and efficiency of privacy preserving data mining systems.

Download Full-text

Granular computing in privacy-preserving data mining

2008 IEEE International Conference on Granular Computing ◽

10.1109/grc.2008.4664790 ◽

2008 ◽

Cited By ~ 2

Author(s):

Justin Zhan ◽

Tsau Young Lin

Keyword(s):

Data Mining ◽

Granular Computing ◽

Privacy Preserving ◽

Privacy Preserving Data Mining

Download Full-text

Privacy-preserving data mining

Proceedings of the 2000 ACM SIGMOD international conference on Management of data - SIGMOD '00 ◽

10.1145/342009.335438 ◽

2000 ◽

Cited By ~ 1044

Author(s):

Rakesh Agrawal ◽

Ramakrishnan Srikant

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Privacy Preserving Data Mining

Download Full-text

Probabilistic Grid-Based Approaches for Privacy-Preserving Data Mining on Moving Object Trajectories

Chapman & Hall/CRC Data Mining and Knowledge Discovery Series - Privacy-Aware Knowledge Discovery ◽

10.1201/b10373-12 ◽

2010 ◽

pp. 183-210 ◽

Cited By ~ 1

Author(s):

Gyozo Gidofalvi ◽

Xuegang Huang ◽

Torben Bach Pedersen

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Moving Object ◽

Privacy Preserving Data Mining ◽

Object Trajectories ◽

Grid Based ◽

Moving Object Trajectories

Download Full-text