A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 10, Issue 8

Abstract

Although attempts have been made to solve the problem of clustering categorical data via cluster ensembles, with the results being competitive to conventional algorithms, it is observed that these techniques unfortunately generate a final data partition based on incomplete information. The underlying ensemble-information matrix presents only cluster-data point relations, with many entries being left unknown. The paper presents an analysis that suggests this problem degrades the quality of the clustering result, and it presents a BSA (Bootstrap Aggregation) is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy along with a new link-based approach, which improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble. In particular, an efficient BSA and link-based algorithm is proposed for the underlying similarity assessment. Afterward, to obtain the final clustering result, a graph partitioning technique is applied to a weighted bipartite graph that is formulated from the refined matrix. Experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble techniques.

Authors and Affiliations

S Pavan Kumar Reddy, U Sesadri

Keywords

Related Articles

Low Power/ High Speed Design in VLSI with the application of Pipelining and Parallel processing

The main objectives of any VLSI design are Power, Delay andArea. Minimizing all the objectives is a challenge in presentsituation but all efforts to achieve one of these can lead to abetter design. This paper proposes an...

Performance Evolution of Intrusion Detection system on MANET Using Genetic Evolution

Mobile ad hoc networks (MANETs) are one of the best ever growing areas of research. By providing communications in the absence of fixed infrastructure MANETs are an attractive technology. However this edibility introduce...

Optimal Electric-Power Distribution and Load-Sharing on Smart-Grids: Analysis by Artificial Neural Network

This study refers to developing an electric-power distribution system  with optimal/suboptimal load-sharing in the complex and expanding metro power-grid infrastructure.  That is, the relevant exercise is to in...

A Survey of multimedia videoconferencing system and a proposal for a novel hybrid cloud and P2P architecture

Technological advances of the Internet and network technology have allowed the development and deployment of new services as multipoint multimedia applications: long-distance education, IPTV, distributed games and videoc...

An Improved Min-Min Task Scheduling Algorithm with Grid Utilization and Minimized Makespan

Grid computing is hardware and software infrastructure which offers a economical, distributable, coordinated and credible access to strong computational abilities [1]. For optimal use of the abilities of large distribute...

Download PDF file
  • EP ID EP650241
  • DOI 10.24297/ijct.v10i8.1468
  • Views 105
  • Downloads 0

How To Cite

S Pavan Kumar Reddy, U Sesadri (2013). A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 10(8), 1913-1921. https://europub.co.uk/articles/-A-650241