AN IMPROVED HYBRIDIZED KMEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT’S PERFORMANCE ANALYSIS

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3

Abstract

In practical life we can see the rapid growth in the various data objects around us, which thereby demands the increase of features and attributes of the data set. This phenomenon, in turn leads to the increase of dimensions of the various data sets. When increase of dimension occurred, the ultimate problem referred to as the ‘the curse of dimensionality’ comes in to picture. For this reason, in order to mine a high dimensional data set an improved and an efficient dimension reduction technique is very crucial and apparently can be considered as the need of the hour. Numerous methods have been proposed and many experimental analyses have been done to find out an efficient reduction technique so as to reduce the dimension of a high dimensional data set without affecting the original data’s. In this paper we proposed the use of Canonical Variate analysis, which serves the purpose of reducing the dimensions of a high dimensional dataset in a more efficient and effective manner. Then to the reduced low dimensional data set, a clustering technique is applied using a modified k-means clustering. In our paper for the purpose of initializing the initial centroids of the Improved Hybridized K Means clustering algorithm (IHKMCA) we make use of genetic algorithm, so as to get a more accurate result. The results thus found from the proposed work have better accuracy, more efficient and less time complexity as compared to other approaches.

Authors and Affiliations

H. S Behera , Rosly Boy Lingdoh , Diptendra Kodamasingh

Keywords

Related Articles

Monitoring Of Air Polution By Using Fuzzy Logic

The Air Quality Index is a simple and generalized way to describe the air quality in China, Hong Kong, Malaysia and now in India. Indian Air Quality Index (IND-AQI) is mainly a health related index with the descriptor wo...

Preprocessing and Screen-Cursor Mapping for a Virtual TouchScreen on a Projected Area

Virtual Touch Screen, on a projected area, is a system in which the projection on any ordinary flat surface provides us a graphical work-field for controlling specific kind of operations without any sophisticated touch s...

MATHEMATICAL MODELING ON NETWORK FRACTIONAL ROUTING THROUGH INEQUALITIES

A Network is a set, directed, acyclic multigraph over which messages can be transmitted from source node to sink node. Linear programming is one of the most important optimization techniques to help decision making in ne...

Identification and Removal of Impulsive noise using Hypergraph Model

Image noise is unwanted information of an image. Noise can occur during image capture, transmission, or processing and it may depend or may not depend on image content.. In order to remove the noise from the noisy image,...

Curve Length Estimation using Vertix Chain Code Curve Length Estimation

Most of the applications in image analysis are based on reeman chain code. In this paper, for the first time, vertex chain code (VCC) proposed by Bribiesca is applied to improve length estimation of the 2D digitized cur...

Download PDF file
  • EP ID EP119057
  • DOI -
  • Views 253
  • Downloads 0

How To Cite

H. S Behera, Rosly Boy Lingdoh, Diptendra Kodamasingh (2011). AN IMPROVED HYBRIDIZED KMEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT’S PERFORMANCE ANALYSIS. International Journal on Computer Science and Engineering, 3(3), 1183-1190. https://europub.co.uk/articles/-A-119057