A New Link Based Approach for Categorical Data Clustering

Journal Title: UNKNOWN - Year 2012, Vol 1, Issue 3

Abstract

The data generated by conventional categorical data clustering is incomplete because the information provided is also incomplete. This project presents a new link-based approach, which improves the categorical clustering by discovering unknown entries through similarity between clusters in an ensemble. A graph partitioning technique is applied to a weighted bipartite graph to obtain the final clustering result. So the link-based approach outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. Data clustering is one of the fundamental tools we have for understanding the structure of a data set. It plays a crucial, foundation role in machine learning, data mining, information retrieval and pattern recognition. The experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. This paper proposes an Algorithm called Weighted Triple-Quality (WTQ), which also uses k-means algorithm for basic clustering. Once using does the basic clustering consensus functions we can get cluster ensembles of categorical data. This categorical data is converted to refined matrix.

Authors and Affiliations

Keywords

Related Articles

Identification of Image Spam by Using Histogram and Hough Transform

Today, the internet is the most powerful tools throughout the world. But the explosive growth of unsolicited emails has prompted the development of numerous spam filtering techniques. It needlessly obstruct the entire sy...

A Comprehensive Relationship between Job Satisfaction and Turnover Intension of Private Commercial Bank Employees’ in Bangladesh

The main objectives of this paper are to find out the important factors which determine employees’ job satisfaction of private commercial banks in Bangladesh and to show the relationship between job satisfaction and empl...

A Survey on Methods of Plant Disease Detection

Plant disease detection is emerging field in India as agriculture is important sector in Economy and Social life. Earlier unscientific methods were in existence. Gradually with technical and scientific advancement, more...

Potential Energy Curves and Dissociation Energy for (X1

The present work concerns by study of spectroscopic properties for Copper Hydride Cu63H1. Dissociation energy had been calculated theoretically for ground state X1

Effect of Salt Stress on Physiological and Biochemical Characteristics in Solanum nigrum L.

Solanum nigrum L. is one of the important medicinal plant and contains solasodine, a steroidal glycoalkaloid, which considered as potential alternative to diosgenin for commercial synthesis of various steroidal drugs. In...

Download PDF file
  • EP ID EP333852
  • DOI -
  • Views 167
  • Downloads 0

How To Cite

(2012). A New Link Based Approach for Categorical Data Clustering. UNKNOWN, 1(3), -. https://europub.co.uk/articles/-A-333852