A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2

Abstract

Abstract: Text document clustering provides an effective technique to manage a huge amount of retrieval outcome by grouping documents in a small number of meaningful classes. In unsupervised clustering method the unlabeled input data is used to estimate the parameter values. In a semi supervised document clustering both labeled and unlabeled input data is used for document clustering. A semi supervised clustering with feature supervision and constraint score is proposed in this paper. This proposed system which handles document clustering and feature Supervision simultaneously and this system finds the number of clusters automatically. Feature supervision uses pairwise constraints that performs supervision between the each documents. The semi-supervised constraint score that uses both pairwise constraints and the constraint score is to compute relevant features and irrelevant feature on document data set. A variational inference algorithm uses the Dirichlet Process Mixture model for the document clustering.

Authors and Affiliations

S. Princiya, , M. Prabakaran

Keywords

Related Articles

 Analyzing the Effect of Varying CBR on AODV, DSR, IERP Routing Protocols in MANET

 Mobile Ad Hoc Networks (MANET) are wireless networks which do not require any infrastructure support for transferring data packet between two nodes. Mobile ad-hoc network have the attributes such as wireless conn...

A Novel User Revocation And Secure Multi Keyword Ranked Search Scheme Over Encrypted Data

Abstract: Cloud Computing is an emerging technology now days in this connection in order to store and share huge volume of data over internet we required Cloud Services like IaaS (Infrastructure as a Service) where it ca...

 Auto Finding and Resolving Distributed Firewall Policy

 Abstract: In the network environment firewall is one of the protection layers. A firewall policy defines how an organization’s firewalls should handle inbound and outbound network traffic for specifi c IP addresses...

Abalone Age Prediction using Artificial Neural Network

Abstract: Artificial Neural Networks are the intelligent computation systems that can be used to solve various challenging problems such as compression, optimization, classification, pattern recognition and prediction. I...

 Using Concept of Steganography and Visual Cryptography for Secured Data hiding

 Abstract: The most advanced and updated Shamir Encryption algorithm is efficient enough to prevent and stop unauthorized and illegal access to the secured encoded data. It is best to solution to ensure reliability...

Download PDF file
  • EP ID EP152485
  • DOI 10.9790/0661-16292834
  • Views 100
  • Downloads 0

How To Cite

S. Princiya, , M. Prabakaran (2014).  A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision. IOSR Journals (IOSR Journal of Computer Engineering), 16(2), 28-34. https://europub.co.uk/articles/-A-152485