Comparative analysis of mid-point based and proposed mean based K-Means Clustering Algorithm for Data Mining

Abstract

In the original k-means algorithm the initial centroids are taken just randomly out of the input data set. But this random selection of initial centroids leads the computation of the algorithm into local optima. Each time the end clustering results will come out to be different. This is the limitation which needs to be dealt with in order to make the k-means algorithm more efficient. The mid –point is used as a metric for computing the initial centroids but this algorithm may be suitable for a wide variety of problems but it is not suitable for all kinds of problems. As it concentrates on calculating the mid-point of different subsets of the data set, so it is most suitable to problems where the input data is regularly or uniformly distributed across the space. But in the situations where the input data is irregular or non-uniformly distributed, this algorithm will not produce the appropriate results. This paper presents the mean as the metric for choosing initial centroid and the comparison of both the algorithms.

Authors and Affiliations

Kirti Aggarwal, Neha Aggarwal

Keywords

Related Articles

Image Processing in Quality Assessment of Pulses

Food is essential for nourishment and sustenance of life. The addition of impurities in food affects the composition and quality of food. Quality assessment of grains is a very big challenge since time immemorial. Human...

Analysis of Critical Strategic Factors for Successful Implementation of Poverty Alleviation Programmes in Nigeria

Poverty as a subject matter has been attracting increasing attention in the academic literature in recent years, and researchers from a variety of disciplines such as sociology, psychology, business, management, as well...

Examining the Influence of Strategic Land Rehabilitation on Sustainable Entrepreneurship & Economic Development (A Case of Kilome – Kenyaland Reclamation)

This study sought to To examine the influence of land rehabilitation on sustainable economic development in Kilome, specifically studying how entrepreneurial attributes, entrepreneurial value of Kilome area resources, po...

Role of Contextual Factors in using eLearning Systems for Higher Education in Developing Countries

The same basic computing facilities are available in most of the Asian countries like Pakistan; however it is never possible to attain the same outputs from digital systems working either in public or private sector. Thi...

“E Learning – The Next Religion of Education” An In-depth Analysis of its Effectiveness from Different Perspectives in Context of India

Learning in this new era of internet has changed completely moving from the traditional black board to ICT smart class and then to web based learning and the journey is still on. Due to its convenience in terms time and...

Download PDF file
  • EP ID EP97983
  • DOI -
  • Views 111
  • Downloads 0

How To Cite

Kirti Aggarwal, Neha Aggarwal (2012). Comparative analysis of mid-point based and proposed mean based K-Means Clustering Algorithm for Data Mining. International Journal of Computational Engineering and Management IJCEM, 15(4), 71-74. https://europub.co.uk/articles/-A-97983