Privacy Preserving Big Data Mining: Association Rule Hiding

Journal Title: Journal of Information Systems and Telecommunication - Year 2016, Vol 4, Issue 2

Abstract

Data repositories contain sensitive information which must be protected from unauthorized access. Existing data mining techniques can be considered as a privacy threat to sensitive data. Association rule mining is one of the utmost data mining techniques which tries to cover relationships between seemingly unrelated data in a data base.. Association rule hiding is a research area in privacy preserving data mining (PPDM) which addresses a solution for hiding sensitive rules within the data problem. Many researches have be done in this area, but most of them focus on reducing undesired side effect of deleting sensitive association rules in static databases. However, in the age of big data, we confront with dynamic data bases with new data entrance at any time. So, most of existing techniques would not be practical and must be updated in order to be appropriate for these huge volume data bases. In this paper, data anonymization technique is used for association rule hiding, while parallelization and scalability features are also embedded in the proposed model, in order to speed up big data mining process. In this way, instead of removing some instances of an existing important association rule, generalization is used to anonymize items in appropriate level. So, if necessary, we can update important association rules based on the new data entrances. We have conducted some experiments using three datasets in order to evaluate performance of the proposed model in comparison with Max-Min2 and HSCRIL. Experimental results show that the information loss of the proposed model is less than existing researches in this area and this model can be executed in a parallel manner for less execution time

Authors and Affiliations

Golnar Assadat Afzali, Shahriar Mohammadi

Keywords

Related Articles

Improved Generic Object Retrieval In Large Scale Databases By SURF Descriptor

Normally, the-state-of-the-art methods in field of object retrieval for large databases are achieved by training process. We propose a novel large-scale generic object retrieval which only uses a single query image and t...

Fusion of Learning Automata to Optimize Multi-constraint Problem

This paper aims to introduce an effective classification method of learning for partitioning the data in statistical spaces. The work is based on using multi-constraint partitioning on the stochastic learning automata. S...

Multimodal Biometric Recognition Using Particle Swarm Optimization-Based Selected Features

Feature selection is one of the best optimization problems in human recognition, which reduces the number of features, removes noise and redundant data in images, and results in high rate of recognition. This step affect...

Coreference Resolution Using Verbs Knowledge

Coreference resolution is the problem of determining which mention in a text refer to the same entities, and is a crucial and difficult step in every natural language processing task. Despite the efforts that have been m...

Mitosis detection in breast cancer histological images based on texture features using AdaBoost

Counting mitotic figures present in tissue samples from a patient with cancer, plays a crucial role in assessing the patient’s survival chances. In clinical practice, mitotic cells are counted manually by pathologists in...

Download PDF file
  • EP ID EP184430
  • DOI 10.7508/jist.2016.02.001
  • Views 111
  • Downloads 0

How To Cite

Golnar Assadat Afzali, Shahriar Mohammadi (2016). Privacy Preserving Big Data Mining: Association Rule Hiding. Journal of Information Systems and Telecommunication, 4(2), 70-77. https://europub.co.uk/articles/-A-184430