A New Link Based Approach for Categorical Data Clustering

Journal Title: International Journal of Science and Research (IJSR) - Year 2012, Vol 1, Issue 3

Abstract

The data generated by conventional categorical data clustering is incomplete because the information provided is also incomplete. This project presents a new link-based approach, which improves the categorical clustering by discovering unknown entries through similarity between clusters in an ensemble. A graph partitioning technique is applied to a weighted bipartite graph to obtain the final clustering result. So the link-based approach outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. Data clustering is one of the fundamental tools we have for understanding the structure of a data set. It plays a crucial, foundation role in machine learning, data mining, information retrieval and pattern recognition. The experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble technique. This paper proposes an Algorithm called Weighted Triple-Quality (WTQ), which also uses k-means algorithm for basic clustering. Once using does the basic clustering consensus functions we can get cluster ensembles of categorical data. This categorical data is converted to refined matrix.

Authors and Affiliations

Keywords

Related Articles

Biodegradation of Chromium Contaminated Soil by Some Bacterial Species

Chromium (Cr) is one of the most common heavy metals affecting the soil quality which is introduced into the environment from industries such as leather tanning, electroplating and inorganic pigment (green, orange and ye...

Study of Quirky and Null Cases in Khoibu

This article is an attempt to study two cases of Khoibu in terms of Minimalist Syntax of Generative Grammar. Since an enormous amount of research interest has been recently shown in the study of Quirky and Null Cases, th...

A New Model of Permutation the Pieces of Nucleotides in DNA Sequences Using the Action of Dihedral Group and Graph Theory

In this paper, we give a new model of genetic algorithm using the action of largest subgroup of dihedral Group D_n, n=3^m, m is natural number, m grater than equal 2, bipartite graph, and a markov basis for (n^2-3n)/3×...

Elliptic Curve Cryptography: An efficient approach for encryption & decryption of a data sequence

Elliptic Curve Cryptography: An efficient approach for encryption & decryption of a data sequence

Design of a Single-Phase PV Fed Nine Level Inverter to Drive an Induction Motor

Single Phase induction motors are widely accepted motor due to their energy efficient characteristics. To drive varying mechanical loads for long duty, the machine needs to be controlled to increase its efficiency and mi...

Download PDF file
  • EP ID EP333852
  • DOI -
  • Views 132
  • Downloads 0

How To Cite

(2012). A New Link Based Approach for Categorical Data Clustering. International Journal of Science and Research (IJSR), 1(3), -. https://europub.co.uk/articles/-A-333852