Reconstruction of Perturbed Data using K-Means
Journal Title: International Journal of Computational Engineering and Management IJCEM - Year 2012, Vol 15, Issue 6
Abstract
A key element in preserving privacy and confidentiality of sensitive data is the ability to evaluate the extent of all potential disclosure for such data. In other words, we need to be able to answer to what extent confidential information in a perturbed database can be compromised by attackers or snoopers. Several randomized techniques have been proposed for privacy preserving data mining of continuous data. These approaches generally attempt to hide the sensitive data by randomly modifying the data values using some additive noise and aim to reconstruct the original distribution closely at an aggregate level. The main contribution of this paper lies in the algorithm to accurately reconstruct the community joint density given the perturbed multidimensional stream data information. Any statistical question about the community can be answered using the reconstructed joint density. There have been many efforts on the community distribution reconstruction. Our research objective is to determine whether the distributions of the original and recovered data are close enough to each other despite the nature of the noise applied. We are considering an ensemble clustering method to reconstruct the initial data distribution. As the tool for the algorithm implementations we chose the “language of choice in industrial world” – MATLAB.
Authors and Affiliations
Prasannta Tiwari, Hitesh Gupta
A Study on Impact of Advertising and Sales Promotion on Women Skin Care Consumers in the City of Jabalpur
In the era of globalisation and digitization, improved employment access, high female literacy rate and exposure to electronic media has laid a lot of importance to marketing communications in shaping the consumer buying...
Impact of Intellectual Capital Disclosure on Market Cap
The study aims to empirically investigate, the impact of Intellectual capital (IC) on financial aspects of the organizational performance and on market capitalization. The study also aims to develop a descriptive framewo...
Energy Saving using Cloud Computing
This paper describes cloud computing, a platform for next generation internet computing and various layers comprising a cloud. The paper defines green computing, how energy can be saved using cloud. It proposes novel ide...
High Speed and Reduced Power – Radix-2 Booth Multiplier
A multiplier is one of the key hardware blocks in most digital and high performance systems such as FIR filters, digital signal processors and microprocessors etc. A system’s performance is generally determined by the pe...
The Intellectual Capital Engine for Organizational Governance and Sustainability: A Theoretical Inquiry and Path Analysis
Purpose : The purpose of this paper is to review the international literature in the historical and current context of intellectual capital (IC) to leverage it from a third-dimension. This is approached through a big...