A Novel Benchmark K-Means Clustering on Continuous Data

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 8

Abstract

Cluster analysis is one of the prominent techniques in the field of data mining and k-means is one of the most well known popular and partitioned based clustering algorithms. K-means clustering algorithm is widely used in clustering. The performance of k-means algorithm will affect when clustering the continuous data. In this paper, a novel approach for performing k-means clustering on continuous data is proposed. It organizes all the continuous data sets in a sorted structure such that one can find all the data sets which are closest to a given centroid efficiently. The key institution behind this approach is calculating the distance from origin to each data point in the data set. The data sets are portioned into k-equal number of cluster with initial centroids and these are updated all at a time with closest one according to newly calculated distances from the data set. The experimental results demonstrate that proposed approach can improves the computational speed of the direct k-means algorithm in the total number of distance calculations and the overall time of computations particularly in handling continuous data.

Authors and Affiliations

K. Prasanna , M. Sankara Prasanna Kumar , G. Surya Narayana

Keywords

Related Articles

Design and Implementation of a Three Dimensional CNC Machine

This paper discusses the design and implementation of low cost three dimensional computerized numerical control (CNC) machines for Industrial application. The primary function of this microcontroller based CNC machine is...

Automated Transformation of Distributed Software Architectural Models to Finite State Process

Software Performance Engineering (SPE) represents the collection of software engineering activities with the purpose of identification, prediction and also improvement of software performance parameters in the early stag...

A Study of Image Segmentation and Edge Detection Techniques

Image segmentation is the key behind image understanding. Image segmentation is one of the most important steps leading to the analysis of processed image data. It is the prime area of research in computer vision. A numb...

Nondestructive and Noncontact Dielectric Measurement Methods for Transformer Oil Using Free-space Microwave Measurement System in 19 – 25 GHz Frequency Range

Nondestructive, noncontact and real time evaluation of ielectric properties of low-loss liquids are important for applications such as service-aged transformer oil, biomedical, remote sensing and design of radar absorbi...

Design and Development of Result Tool for University and College Exam and it’s Performance Study

The result system tool is designed and developed for result sheet and mark sheet preparation with various report required at University level and College level. The system is useful for the exam department of the institu...

Download PDF file
  • EP ID EP134716
  • DOI -
  • Views 106
  • Downloads 0

How To Cite

K. Prasanna, M. Sankara Prasanna Kumar, G. Surya Narayana (2011). A Novel Benchmark K-Means Clustering on Continuous Data. International Journal on Computer Science and Engineering, 3(8), 2974-2977. https://europub.co.uk/articles/-A-134716