A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 10, Issue 8

Abstract

Although attempts have been made to solve the problem of clustering categorical data via cluster ensembles, with the results being competitive to conventional algorithms, it is observed that these techniques unfortunately generate a final data partition based on incomplete information. The underlying ensemble-information matrix presents only cluster-data point relations, with many entries being left unknown. The paper presents an analysis that suggests this problem degrades the quality of the clustering result, and it presents a BSA (Bootstrap Aggregation) is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy along with a new link-based approach, which improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble. In particular, an efficient BSA and link-based algorithm is proposed for the underlying similarity assessment. Afterward, to obtain the final clustering result, a graph partitioning technique is applied to a weighted bipartite graph that is formulated from the refined matrix. Experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble techniques.

Authors and Affiliations

S Pavan Kumar Reddy, U Sesadri

Keywords

Related Articles

Investigation, Formulation and Development of an Open GUI for the Touchscreen Smartphone

The use of touchscreens in handheld mobile devices, including mobile phones, PDA’s, media players and tablet PC’s, has rapidly increased in recent times. One of the most important aspects of these devices is the soft...

Mobile Ad Hoc Networks: Comparasion of Multipath Routing Protocols with Unipath Routing Protocols

The latest technology MANETs is being studied widely and attracting a large variety of applications. Due to varying network topology, The most common challenging factor in MANET is routing [1][2]. Future applications of...

Feature-based Similarity Method for Aligning the Malay and English News Document

Corpus-based translation approach can be used to obtain reliable translation knowledge in addition to the use of dictionaries or machine translation. But the availability of such corpus is very limited especially for the...

An Environment for detection of Bugs through SVM

Mining technique finds hidden patterns from the data stored in the repositories and turn it into useful information and knowledge. Most open source software development projects include an open bug repository in which us...

RSA Algorithm achievement with Federal information processing Signature for Data protection in Cloud Computing

Cloud computing presents IT organizations with a funda­mentally different model of operation, one that takes advantage of the maturity of web applications and networks and the rising interoperability of computing system...

Download PDF file
  • EP ID EP650241
  • DOI 10.24297/ijct.v10i8.1468
  • Views 111
  • Downloads 0

How To Cite

S Pavan Kumar Reddy, U Sesadri (2013). A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 10(8), 1913-1921. https://europub.co.uk/articles/-A-650241