Clustering Mixed Data Set Using Modified MARDL Technique
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5
Abstract
Clustering is tend to be an important issue in data mining applications. Many clustering algorithms are available to cluster datasets that contain either numeric or categorical attributes. The real life database consists of numeric, ategorical and mixed type of attributes. It is an essential task to cluster these data sets to extract significant knowledge from the existing database or to obtain statistical information about the database. Clustering large database is a time consuming process. Sampling is a process of obtaining a small set of data from the large database. Applying sampling technique would not cluster all the data points. Labeling non- clustered data point is an issue in data mining process. This paper mainly focuses on clustering mixed data set using modified MARDL (MAximal Resemblance Data Labeling) technique and to allocate each unlabeled data point into the corresponding appropriate cluster based on the novel clustering epresentative namely, N-Nodeset Importance Representative (NNIR). Accuracy and Error rate are considered as the metrics for evaluating the performance of the existing and proposed algorithm for mixed data set. The experimental result shows that MARDL for mixed data set algorithm performs better than the existing enhanced k-means.
Authors and Affiliations
Mrs. J. Jayabharathy , Dr. S. Kanmani , S. Pazhaniammal
Prediction of Profitability of Industries using Weighted SVR
In order to measure the profitability of an industry by predicting Pre-Tax Operating Margin by applying regression technique on Price/Sales Ratio and Net Margin of various industries. Prediction of Pre-Tax Operating Marg...
An Optimized Round Robin Scheduling Algorithm for CPU Scheduling
The main objective of this paper is to develop a new approach for round robin scheduling which help to improve the CPU efficiency in real time and time sharing operating system. There are many algorithms available for CP...
UNICODE and Colors Integration tool for Encryption and Decryption
Cryptography, to most people, is concerned with keeping communications private. Indeed, the protection of sensitive communications has been the emphasis of cryptography throughout much of its history. Encryption is the t...
Organizational improvement using Organizational paradigms with the support of people paradigms
An organization is a vital part of social environment. Different parts of organization have great impact to the environment. On the other hand the different organizational strategy helps to improve the efficiency of orga...
MEASURING THE QUALITY OF OBJECT ORIENTED SOFTWARE MODULARIZATION DEFINING METRICS AND ALGORITHM
We proposed a System to measure the quality of modularization of object-oriented software system. Our work is proposed in three Parts as follows: MODULE 1: DEFINING METRICS FOR OBJECT ORIENTED SOFTWARE AND ALGORITHM M...