A New Link Based Approach for Categorical Data Clustering

Journal Title: International Journal of Science and Research (IJSR) - Year 2012, Vol 1, Issue 3

Abstract

The data generated by conventional categorical data clustering is incomplete because the information provided is also incomplete. This project presents a new link-based approach, which improves the categorical clustering by discovering unknown entries through similarity between clusters in an ensemble. A graph partitioning technique is applied to a weighted bipartite graph to obtain the final clustering result. So the link-based approach outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. Data clustering is one of the fundamental tools we have for understanding the structure of a data set. It plays a crucial, foundation role in machine learning, data mining, information retrieval and pattern recognition. The experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. This paper proposes an Algorithm called Weighted Triple-Quality (WTQ), which also uses k-means algorithm for basic clustering. Once using does the basic clustering consensus functions we can get cluster ensembles of categorical data. This categorical data is converted to refined matrix.

Authors and Affiliations

Keywords

Related Articles

A Survey on Security and Accuracy Constrained Privacy Preserving Task Based Access Control Mechanism for Relational Data

Data privacy problems square measure more and more turning into vital for several applications. Access management mechanisms give protection to our sensitive business information from unwanted user. Resource and informat...

Clinicoradiological Outcome of Short Segment Fusion in Thoracolumbar Vertebral Fractures….A Study of 20 Cases

"Abstract Study design: Prospective consecutive series. Objective: To evaluate the efficacy of short segment instrumented fusion in thoracolumbar vertebral compression and burst fractures. Summary of Background Data: Tra...

A Technique for Filtering Unnecessary Messages from Online Social Network

One fundamental issue in today's Online Social Networks (OSNs) is to give users the ability to control the messages posted on their own private space to avoid that unwanted content is displayed. Up to now, OSNs provide l...

Performance Study on Twin Plug Spark Ignition Engine at Different Ignition Timings

The present paper describes some results of research in the area of twin spark ignition engine. The potential of dual plug spark ignition engine is assessed by studying its performance and emission characteristics relati...

Performance and Reliability of Computer System

The system needs to be auditable, reliable, and manageable from a security point of view and must provide records to the security control supervisor, so that system performance, security safeguards, and user activities c...

Download PDF file
  • EP ID EP333852
  • DOI -
  • Views 147
  • Downloads 0

How To Cite

(2012). A New Link Based Approach for Categorical Data Clustering. International Journal of Science and Research (IJSR), 1(3), -. https://europub.co.uk/articles/-A-333852