A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2
Abstract
Abstract: Text document clustering provides an effective technique to manage a huge amount of retrieval outcome by grouping documents in a small number of meaningful classes. In unsupervised clustering method the unlabeled input data is used to estimate the parameter values. In a semi supervised document clustering both labeled and unlabeled input data is used for document clustering. A semi supervised clustering with feature supervision and constraint score is proposed in this paper. This proposed system which handles document clustering and feature Supervision simultaneously and this system finds the number of clusters automatically. Feature supervision uses pairwise constraints that performs supervision between the each documents. The semi-supervised constraint score that uses both pairwise constraints and the constraint score is to compute relevant features and irrelevant feature on document data set. A variational inference algorithm uses the Dirichlet Process Mixture model for the document clustering.
Authors and Affiliations
S. Princiya, , M. Prabakaran
Analyzing the Effect of Varying CBR on AODV, DSR, IERP Routing Protocols in MANET
Mobile Ad Hoc Networks (MANET) are wireless networks which do not require any infrastructure support for transferring data packet between two nodes. Mobile ad-hoc network have the attributes such as wireless conn...
A Novel User Revocation And Secure Multi Keyword Ranked Search Scheme Over Encrypted Data
Abstract: Cloud Computing is an emerging technology now days in this connection in order to store and share huge volume of data over internet we required Cloud Services like IaaS (Infrastructure as a Service) where it ca...
Auto Finding and Resolving Distributed Firewall Policy
Abstract: In the network environment firewall is one of the protection layers. A firewall policy defines how an organization’s firewalls should handle inbound and outbound network traffic for specifi c IP addresses...
Abalone Age Prediction using Artificial Neural Network
Abstract: Artificial Neural Networks are the intelligent computation systems that can be used to solve various challenging problems such as compression, optimization, classification, pattern recognition and prediction. I...
Using Concept of Steganography and Visual Cryptography for Secured Data hiding
Abstract: The most advanced and updated Shamir Encryption algorithm is efficient enough to prevent and stop unauthorized and illegal access to the secured encoded data. It is best to solution to ensure reliability...