Preserving Data Clustering with Expectation Maximization Algorithm

Journal Title: Journal of Information Systems and Telecommunication - Year 2016, Vol 4, Issue 3

Abstract

Data mining and knowledge discovery are important technologies for business and research. Despite their benefits in various areas such as marketing, business and medical analysis, the use of data mining techniques can also result in new threats to privacy and information security. Therefore, a new class of data mining methods called privacy preserving data mining (PPDM) has been developed. The aim of researches in this field is to develop techniques those could be applied to databases without violating the privacy of individuals. In this work we introduce a new approach to preserve sensitive information in databases with both numerical and categorical attributes using fuzzy logic. We map a database into a new one that conceals private information while preserving mining benefits. In our proposed method, we use fuzzy membership functions (MFs) such as Gaussian, P-shaped, Sigmoid, S-shaped and Z-shaped for private data. Then we cluster modified datasets by Expectation Maximization (EM) algorithm. Our experimental results show that using fuzzy logic for preserving data privacy guarantees valid data clustering results while protecting sensitive information. The accuracy of the clustering algorithm using fuzzy data is approximately equivalent to original data and is better than the state of the art methods in this field.

Authors and Affiliations

Leila Jafar Tafreshi, Farzin Yaghmaee

Keywords

Related Articles

Enhancing Efficiency of Software Fault Tolerance Techniques in Satellite Motion System

This research shows the influence of using multi-core architecture to reduce the execution time and thus increase performance of some software fault tolerance techniques. According to superiority of N-version Programming...

A New Robust Digital Image Watermarking Algorithm Based on LWT-SVD and Fractal Images

This paper presents a robust copyright protection scheme based on Lifting Wavelet Transform (LWT) and Singular Value Decomposition (SVD). We have used fractal decoding to make a very compact representation of watermark i...

Latent Feature Based Recommender System for Learning Materials Using Genetic Algorithm

With the explosion of learning materials available on personal learning environments (PLEs) in the recent years, it is difficult for learners to discover the most appropriate materials according to keyword searching meth...

Instance Based Sparse Classifier Fusion for Speaker Verification

This paper focuses on the problem of ensemble classification for text-independent speaker verification. Ensemble classification is an efficient method to improve the performance of the classification system. This method...

Hybrid Task Scheduling Method for Cloud Computing by Genetic and PSO Algorithms

Cloud computing makes it possible for users to use different applications through the internet without having to install them. Cloud computing is considered to be a novel technology which is aimed at handling and providi...

Download PDF file
  • EP ID EP184050
  • DOI 10.7508/jist.2016.03.004
  • Views 131
  • Downloads 0

How To Cite

Leila Jafar Tafreshi, Farzin Yaghmaee (2016). Preserving Data Clustering with Expectation Maximization Algorithm. Journal of Information Systems and Telecommunication, 4(3), 167-173. https://europub.co.uk/articles/-A-184050