Clustering Mixed Data Set Using Modified MARDL Technique
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5
Abstract
Clustering is tend to be an important issue in data mining applications. Many clustering algorithms are available to cluster datasets that contain either numeric or categorical attributes. The real life database consists of numeric, ategorical and mixed type of attributes. It is an essential task to cluster these data sets to extract significant knowledge from the existing database or to obtain statistical information about the database. Clustering large database is a time consuming process. Sampling is a process of obtaining a small set of data from the large database. Applying sampling technique would not cluster all the data points. Labeling non- clustered data point is an issue in data mining process. This paper mainly focuses on clustering mixed data set using modified MARDL (MAximal Resemblance Data Labeling) technique and to allocate each unlabeled data point into the corresponding appropriate cluster based on the novel clustering epresentative namely, N-Nodeset Importance Representative (NNIR). Accuracy and Error rate are considered as the metrics for evaluating the performance of the existing and proposed algorithm for mixed data set. The experimental result shows that MARDL for mixed data set algorithm performs better than the existing enhanced k-means.
Authors and Affiliations
Mrs. J. Jayabharathy , Dr. S. Kanmani , S. Pazhaniammal
Secured, Authenticated Communication Model for Dynamic Multicast Groups
Secure Multicast networks forms the backbone for many web and multimedia applications such as Interactive TV, Teleconference etc. The main challenge for secure multicast is scalability, efficiency and authenticity. A co...
Adjoint LMS (ALMS) Algorithm Based Active Noise Control with Feedback Path Modeling
In active noise control (ANC) systems, there exists an inherent feedback from the loudspeaker to the primary microphone. Adjoint least mean square (ALMS) algorithm is known to be an alternative to the widely used filtere...
A new similarity measure for image segmentation
We try to extract the component image virtually hidden in the principle image. Information theoretic quantities are used to measure similarity of images in a statistical frame work. Statistical method describes the textu...
A Survey : Code Optimization using Refactoring
This position paper identifies emerging trends in refactoring research particulary Refactoring, and enumerates a list of open questions, from practical as well as a theoretical point of view. We suggest these directions...
IMPROVED SOFTWARE QUALITY ASSURANCE TECHNIQUES USING SAFE GROWTH MODEL
In our lives are governed by large, complex systems with ncreasingly complex software, and the safety, security, and reliability of these systems has become a major concern. As the software in today’s systems grows larg...