Collaborative Clustering: An Algorithm for Semi-Supervised Learning
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 1
Abstract
Supervised learning is the process of disposition of a set of consanguine data items which have known labels. The apportion of an unlabeled dataset into a conglomeration of analogous items(clusters) by the optimization of an objective function to attenuate the inter-class similarity and augment the intra-class similarity is called unsupervised learning. But when multi-modal data is used, there ensues a predicament with algorithms of either type. Hence a new breed of clustering known as Semi-Supervised clustering has been popularized. This algorithm partitions an unlabelled data set into a congregation of data items by taking only the limited available information from the user. When contemporary clustering algorithms are applied on a single dataset, different result sets are obtained. Hence an algorithm is needed to reveal the underlying structure of the dataset. In this paper an algorithm for semi-supervised learning is endowed, quartered on the principle of collaboration of clusters. This analytical study can be justified by carrying out various experiments.
Authors and Affiliations
P. Padmaja , V. R. V. Vamsi Krishna. N
A New Algorithm of Encryption and Decryption of Images Using Chaotic Mapping
The combination of chaotic theory and cryptography forms an important field of information security. In the past decade, chaos based image encryption is given much attention in the research of information security and a...
ZigBee: The Emerging Technology in Building Automation
With the development of industrial automation technology, the limitation of traditional cable control network has become increasingly prominent. Consequently, establishing a reliable data transmission network becomes a c...
Role Oriented Test Case Generation for Agent Based System
Agent Oriented Software Engineering (AOSE) is a rapidly developing area of research. Current research and development primarily focuses on the analysis, design and implementation of agent based software whereas testing i...
An Efficient Pruning Technique for Mining Frequent Itemsets in Spatial Databases
Frequent Itemset Mining is evaluating the rules and relationship within the data items are optimizing it, in the large spatial databases (for e.g. Images, Docs, AVI files etc).It is one of the major problems in DM (Data...
Query Based Duplicate Data Detection on WWW
The problem of finding relevant documents has become much more prominent due to the presence of duplicate data on the WWW. This redundancy in results increases the users’ seek time to find the desired information within...