K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizontal Aggregations
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 12, Issue 5
Abstract
Data mining refers to the process of analyzing the data from different perspectives and summarizing it into useful information that is mostly used by the different users for analyzing the data as well as for preparing data sets. A data set is collection of data that is present in the tabular form. Preparing data set involves complex SQL queries, joining tables and aggregate functions. Traditional RDBMS manages the tables with vertical format and returns one number per row. It means that it returns a single value output which is not suitable for preparing a data set. This paper mainly focused on k means clustering algorithm which is used to partition data sets after horizontal aggregations and a small description about the horizontal aggregation methods which returns set of numbers instead of one number per row. This paper consists of three methods that is SPJ, CASE and PIVOT methods in order to evaluate horizontal aggregations. Horizontal aggregations results in large volumes of data sets which are then partitioned into homogeneous clusters is important in the system. This can be performed by k means clustering algorithm
Authors and Affiliations
R. Kumar
Performance Optimization in Gang Scheduling In Cloud Computing
Cloud computing is a latest new computing paradigm where applications, data and IT services are provided over the Internet. The Job Scheduling is the key role in cloud computing systems. One technique is to use g...
Passive Image Forensic Method to Detect Resampling Forgery inDigital Images
Abstract: The digital images are becoming important part in the field of information forensics and security,because of the popularity of image editing tools, digital images can be tampered in a very efficient mannerwitho...
Agile Web Service Composition and Messaging approach for e-Government services
E-Government services are increasingly being deployed using service-based architectures. Individual web services, developed from legacy and modern firmware, are composed to achieve e-service delivery. Current web service...
Various Methods for Object Tracking- A Review
Abstract: The object tracking is the technique which is used to track object from the image or from the video. The video consists of multiple frames and in each frame location of that object had been predicted. To predic...
Study of Hiding Sensitive Data in Data Mining Using Association Rules
Abstract: This paper describes Apriori algorithm for association rules for hiding sensitive data in data mining if Large data contain sensitive information that data must be protected from the unauthorized users. Here, w...