A Survey on Improving the Clustering Performance in Text Mining for Efficient Information Retrieval
Journal Title: INTERNATIONAL JOURNAL OF ENGINEERING TRENDS AND TECHNOLOGY - Year 2014, Vol 8, Issue 5
Abstract
In recent years, the development of information systems in every field such as business, academics and medicine has led to increase in the amount of stored data year by year. A vast majority of data are stored in documents that are virtually unstructured. Text mining technology is very helpful for people to process huge information by imposing structure upon text. Clustering is a popular technique for automatically organizing a large collection of text. However, in real application domains, the experimenter possesses some background knowledge that helps in clustering the data. Traditional clustering techniques are rather unsuitable of multiple data types and cannot handle sparsity and high dimensional data. Co-clustering techniques are adopted to overcome the traditional clustering technique by simultaneously performing document and word clustering handling both deficiencies. Semantic understanding has become essential ingredient for information extraction, which is made by adopting constraints as a semi-supervised learning strategy. This survey reviews on the constrained co-clustering strategies adopted by researchers to boost the clustering performance. Experimental results using 20-Newsgroups dataset shows that the proposed method is effective for clustering textual documents. Furthermore, the proposed algorithm consistently outperformed all the existing constrained clustering and coclustering methods under different conditions.
Authors and Affiliations
S. Saranya , R. Munieswari
Performance Analysis of Cognitive Radio based on Cooperative Spectrum Sensing
Cognitive radio is an emerging technology that aims for efficient spectrum usage. Cognitive radios have been proposed as a solution to the spectrum underutilization problem and have been proven to increase spec...
Universal Controller Design Using Arm Controller
In this paper, different control strategies are discussed and design of universal (process) controller on ARM embedded platform is proposed. The same controller support feedback, cascade, ratio and feed forward...
Optimization of Geometry of Microfabricated Piezoelectric Actuator
This paper focuses on optimization of the geometry of piezoelectric actuator for maximization of output for applications such as micro-pump. The device structure consists of a piezoelectric disk/plate which is glue...
Simulation of Integer N Frequency Synthesizer
This document implements oscillator in the up conversion and down conversion of wireless transceivers using Integer N Frequency synthesizer with an external loop filter and VCO. The Easiest way to design and simula...
Comparison of Service Quality between Government and Private Banks in Indore
The main objective of this research paper is to measure and compare the service quality offered by government and private bank in Indore. In present time competition is increasing among the banks, this study is u...