A Variant of Genetic Algorithm Based Categorical Data Clustering for Compact Clusters and an Experimental Study on Soybean Data for Local and Global Optimal Solutions

Abstract

Almost all partitioning clustering algorithms getting stuck to the local optimal solutions. Using Genetic algorithms (GA) the results can be find globally optimal. This piece of work offers and investigates a new variant of the Genetic algorithm (GA) based k-Modes clustering algorithm for categorical data. A statistical analysis have been done on the popular categorical dataset which shows the user specified cluster centres stuck at local optimal solution in K-modes algorithm even in all the higher iterations and the proposed algorithm overcome this problem of local optima. To the best of our knowledge, such comparison has been reported here for the first time for the case of categorical data. The obtained results, shows that the proposed algorithm is better over the conventional k-Modes algorithm in terms of optimal solutions and within cluster variation measure.

Authors and Affiliations

Abha Sharma, R. S. Thakur

Keywords

Related Articles

Genetic Algorithm for Data Exchange Optimization

Dynamic architectures have emerged to be a promising implementation platform to provide flexibility, high performance, and low power consumption for computing devices. They can bring unique capabilities to computational...

Dependency Test: Portraying Pearson's Correlation Coefficient Targeting Activities in Project Scheduling

In this paper, we discuss project scheduling with conflicting activity-resources. Several project activities require same resources but, may be scheduled with the certain lapse of time resulting in repeatedly using the s...

A Survey of Topic Modeling in Text Mining

Topic models provide a convenient way to analyze large of unclassified text. A topic contains a cluster of words that frequently occur together. A topic modeling can connect words with similar meanings and distinguish be...

Unsupervised Morphological Relatedness

Assessment of the similarities between texts has been studied for decades from different perspectives and for several purposes. One interesting perspective is the morphology. This article reports the results on a study o...

Communication and Computation Aware Task Scheduling Framework Toward Exascale Computing

The race for Exascale Computing has naturally led computer architecture to transit from the multicore era and into the heterogeneous era. Exascale Computing within the heterogenous environment necessarily use the best-fi...

Download PDF file
  • EP ID EP164402
  • DOI 10.14569/IJACSA.2016.070256
  • Views 77
  • Downloads 0

How To Cite

Abha Sharma, R. S. Thakur (2016). A Variant of Genetic Algorithm Based Categorical Data Clustering for Compact Clusters and an Experimental Study on Soybean Data for Local and Global Optimal Solutions. International Journal of Advanced Computer Science & Applications, 7(2), 410-419. https://europub.co.uk/articles/-A-164402