Privacy Preserving Big Data Mining: Association Rule Hiding

Apply

Privacy Preserving Big Data Mining: Association Rule Hiding

Journal Title: Journal of Information Systems and Telecommunication - Year 2016, Vol 4, Issue 2

Abstract

Data repositories contain sensitive information which must be protected from unauthorized access. Existing data mining techniques can be considered as a privacy threat to sensitive data. Association rule mining is one of the utmost data mining techniques which tries to cover relationships between seemingly unrelated data in a data base.. Association rule hiding is a research area in privacy preserving data mining (PPDM) which addresses a solution for hiding sensitive rules within the data problem. Many researches have be done in this area, but most of them focus on reducing undesired side effect of deleting sensitive association rules in static databases. However, in the age of big data, we confront with dynamic data bases with new data entrance at any time. So, most of existing techniques would not be practical and must be updated in order to be appropriate for these huge volume data bases. In this paper, data anonymization technique is used for association rule hiding, while parallelization and scalability features are also embedded in the proposed model, in order to speed up big data mining process. In this way, instead of removing some instances of an existing important association rule, generalization is used to anonymize items in appropriate level. So, if necessary, we can update important association rules based on the new data entrances. We have conducted some experiments using three datasets in order to evaluate performance of the proposed model in comparison with Max-Min2 and HSCRIL. Experimental results show that the information loss of the proposed model is less than existing researches in this area and this model can be executed in a parallel manner for less execution time

Authors and Affiliations

Golnar Assadat Afzali, Shahriar Mohammadi

Keywords

Big Data; Association Rule; Privacy Preserving; Anonymization; Data Mining.

Blog feed search in Persian Blogosphere

Blogs are one of the main user generated content on the web. So, it is necessary to present retrieval algorithms to the meet information need of weblog users. The goal of blog feed search is to rank blogs regarding their...

Network RAM Based Process Migration for HPC Clusters

Process migration is critical to dynamic balancing of workloads on cluster nodes in any high performance computing cluster to achieve high overall throughput and performance. Most existing process migration mechanisms ar...

A Stochastic Lyapunov Theorem with Application to Stability Analysis of Networked Control Systems

The source of randomness in stochastic systems is an input with stochastic behavior as treated in the existing literature. Special types of stochastic processes such as the Wiener process or the Brownian motion have serv...

A New Recursive Algorithm for Universal Coding of Integers

In this paper, we aim to encode the set of all positive integers so that the codewords not only be uniquely decodable but also be an instantaneous set of binary sequences. Elias introduces three recursive algorithms for...

BER Performance Analysis of MIMO-OFDM Communication Systems Using Iterative Technique Over Indoor Power Line Channels in an Impulsive Noise Environment

This paper addresses the performance of MIMO-OFDM communication system in environments where the interfering noise exhibits non-Gaussian behavior due to impulsive phenomena. It presents the design and simulation of an it...

EP ID EP184430
DOI 10.7508/jist.2016.02.001
Views 128
Downloads 0