A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2

Abstract

Abstract: Text document clustering provides an effective technique to manage a huge amount of retrieval outcome by grouping documents in a small number of meaningful classes. In unsupervised clustering method the unlabeled input data is used to estimate the parameter values. In a semi supervised document clustering both labeled and unlabeled input data is used for document clustering. A semi supervised clustering with feature supervision and constraint score is proposed in this paper. This proposed system which handles document clustering and feature Supervision simultaneously and this system finds the number of clusters automatically. Feature supervision uses pairwise constraints that performs supervision between the each documents. The semi-supervised constraint score that uses both pairwise constraints and the constraint score is to compute relevant features and irrelevant feature on document data set. A variational inference algorithm uses the Dirichlet Process Mixture model for the document clustering.

Authors and Affiliations

S. Princiya, , M. Prabakaran

Keywords

Related Articles

 Eclat Algorithm for FIM on CPU-GPU co-operative & parallel environment

 Abstract: Extracting the frequent itemsets from a transactional database is a fundamental task in data mining field because of its broad applications in mining association rules, time series, correlations etc. The...

 Penetration Testing for Android Smartphones

 One major challenge faced by Android users today is the security of the operating system especially during setup. The use of smartphones for communication, social networking, mobile banking and payment systems ha...

 IPv6: Threats Posed By Multicast Packets, Extension Headers  and Their Counter Measures

 Security issues concerning the spreading Internet Protocol version 6 (IPv6) is one of the major issues in the world of networking today. Since it is not the default network protocol deployed nowadays (but  s...

 Authentication of grayscale document images using shamir secret sharing scheme.

 Abstract: This paper proposed a new blind authentication method based on the secret sharing technique with a data repair capability for grayscale document images .Shamir proposed threshold secret sharing scheme in...

Competent Tracking of Moving Object Using Affine & Illumination Insensitive Template Matching

Abstract : Moving object detection & tracking in real world scene is becoming significant problem in today’s era. The extensive study in this area is motivated by potential number of applications of object tracking....

Download PDF file
  • EP ID EP152485
  • DOI 10.9790/0661-16292834
  • Views 114
  • Downloads 0

How To Cite

S. Princiya, , M. Prabakaran (2014).  A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision. IOSR Journals (IOSR Journal of Computer Engineering), 16(2), 28-34. https://europub.co.uk/articles/-A-152485