K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizontal Aggregations
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 12, Issue 5
Abstract
Data mining refers to the process of analyzing the data from different perspectives and summarizing it into useful information that is mostly used by the different users for analyzing the data as well as for preparing data sets. A data set is collection of data that is present in the tabular form. Preparing data set involves complex SQL queries, joining tables and aggregate functions. Traditional RDBMS manages the tables with vertical format and returns one number per row. It means that it returns a single value output which is not suitable for preparing a data set. This paper mainly focused on k means clustering algorithm which is used to partition data sets after horizontal aggregations and a small description about the horizontal aggregation methods which returns set of numbers instead of one number per row. This paper consists of three methods that is SPJ, CASE and PIVOT methods in order to evaluate horizontal aggregations. Horizontal aggregations results in large volumes of data sets which are then partitioned into homogeneous clusters is important in the system. This can be performed by k means clustering algorithm
Authors and Affiliations
R. Kumar
Smart Blood Bank as a Service on Cloud
Abstract: We all know the working of blood bank management system. A blood bank is a cache or bank of blood or blood components, gathered as a result of blood donation or collection, stored and preserved for later use in...
A survey on comparative study of solar energy on Improving the performance of solar power plants through IOT and predictive data analytics
Abstract: To increase the utilization and development of solar energy which is Eco friendly. Weather Predication is done in order to improve the performance and maintain its consistency for long term to deliversecure and...
Scalable Image Classification Using Compression
The increasing rate of data growth has led to finding techniques for faster processing of data. Big Data analytics has recently emerged as a promising field for examining huge volume of datasets containing differen...
Web-Based System for Software Requirements Quality Analysis Using Case-Based Reasoning and Neural Network
The overall success of a software project depends on the quality of its software requirements specifications (SRS). Hence, it is very important to get the requirements correct from the onset of the project. This...
Lossless Image Compression Using Data Folding Followed ByArithmetic Coding
Abstract : The paper presents a lossless image compression technique using the hybridization of two differententropy coding techniques. Initially data folding technique has been applied to the image. A row folding...