AN IMPROVED HYBRIDIZED KMEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT’S PERFORMANCE ANALYSIS
Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3
Abstract
In practical life we can see the rapid growth in the various data objects around us, which thereby demands the increase of features and attributes of the data set. This phenomenon, in turn leads to the increase of dimensions of the various data sets. When increase of dimension occurred, the ultimate problem referred to as the ‘the curse of dimensionality’ comes in to picture. For this reason, in order to mine a high dimensional data set an improved and an efficient dimension reduction technique is very crucial and apparently can be considered as the need of the hour. Numerous methods have been proposed and many experimental analyses have been done to find out an efficient reduction technique so as to reduce the dimension of a high dimensional data set without affecting the original data’s. In this paper we proposed the use of Canonical Variate analysis, which serves the purpose of reducing the dimensions of a high dimensional dataset in a more efficient and effective manner. Then to the reduced low dimensional data set, a clustering technique is applied using a modified k-means clustering. In our paper for the purpose of initializing the initial centroids of the Improved Hybridized K Means clustering algorithm (IHKMCA) we make use of genetic algorithm, so as to get a more accurate result. The results thus found from the proposed work have better accuracy, more efficient and less time complexity as compared to other approaches.
Authors and Affiliations
H. S Behera , Rosly Boy Lingdoh , Diptendra Kodamasingh
A New Technique to Backup and Restore DBMS using XML and .NET Technologies
In this paper, we proposed a new technique for backing up and restoring different Database Management Systems (DBMS). he technique is enabling to backup and restore a part of or the whole database using a unified interf...
Design and Development of Wireless Sensor Node
This paper presents design and development of intelligent sensor node for environmental monitoring. The node is equipped with multimode sensors for sensing different environmental parameters, the node can sense four diff...
An Artificial Immune System Model for Multi Agents Resource Sharing in Distributed Environments
Natural Immune system plays a vital role in the survival of the all living being. It provides a mechanism to defend itself from external predates making it consistent systems, capable of adapting itself for survival inca...
Performance Evaluation of Algorithms using a Distributed Data Mining Frame Work based on Association Rule Mining
Numerous current data mining tasks can be implemented effectively only in a distributed data mining. Thus distributed data mining has achieved significant importance in the last decade. The proposed distributed data mini...
Face and Gender Recognition Using Principal Component Analysis
Face recognition is a biometric analysis tool that has enabled surveillance systems to detect humans and recognize humans without their co-operation. In this paper we evaluate the basics of the Principal Component Analys...