A Variant of Genetic Algorithm Based Categorical Data Clustering for Compact Clusters and an Experimental Study on Soybean Data for Local and Global Optimal Solutions

Abstract

Almost all partitioning clustering algorithms getting stuck to the local optimal solutions. Using Genetic algorithms (GA) the results can be find globally optimal. This piece of work offers and investigates a new variant of the Genetic algorithm (GA) based k-Modes clustering algorithm for categorical data. A statistical analysis have been done on the popular categorical dataset which shows the user specified cluster centres stuck at local optimal solution in K-modes algorithm even in all the higher iterations and the proposed algorithm overcome this problem of local optima. To the best of our knowledge, such comparison has been reported here for the first time for the case of categorical data. The obtained results, shows that the proposed algorithm is better over the conventional k-Modes algorithm in terms of optimal solutions and within cluster variation measure.

Authors and Affiliations

Abha Sharma, R. S. Thakur

Keywords

Related Articles

Normalisation of Technology use in a Developing Country Higher Education Institution

The purpose of this study is to understand how the use of an online course and lecturer evaluation becomes a normalised way of evaluating courses and lecturers in a developing country higher education institution. Extant...

A Coreference Resolution Approach using Morphological Features in Arabic

Coreference resolution is considered one of the challenges in natural language processing. It is an important task that includes determining which pronouns are referring to which entities. Most of the earlier approaches...

Comparison of Burden on Youth in Communicating with Elderly using Images Versus Photographs

Conversation is a good preventative against behavioral problems in the elderly. However, caregivers are usually very busy tending to patients and lack the time to communicate extensively with them. Toward overcoming such...

Comparison and Analysis of Different Software Cost Estimation Methods

Software cost estimation is the process of predicting the effort required to develop a software system. The basic input for the software cost estimation is coding size and set of cost drivers, the output is Effort in ter...

Albanian Sign Language (AlbSL) Number Recognition from Both Hand’s Gestures Acquired by Kinect Sensors

Albanian Sign Language (AlbSL) is relatively new and until now there doesn’t exist a system that is able to recognize Albanian signs by using natural user interfaces (NUI). The aim of this paper is to present a real-time...

Download PDF file
  • EP ID EP164402
  • DOI 10.14569/IJACSA.2016.070256
  • Views 82
  • Downloads 0

How To Cite

Abha Sharma, R. S. Thakur (2016). A Variant of Genetic Algorithm Based Categorical Data Clustering for Compact Clusters and an Experimental Study on Soybean Data for Local and Global Optimal Solutions. International Journal of Advanced Computer Science & Applications, 7(2), 410-419. https://europub.co.uk/articles/-A-164402