Survey on Feature Selection in Document Clustering

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3

Abstract

Text mining is to research technologies to discover useful knowledge from enormous collections of documents, and to develop a system to provide knowledge and to support in decision making. Basically cluster means a group of similar data, document clustering means segregating the data into different groups of similar data. Clustering is a fundamental data analysis technique used for various applications such as biology, psychology, control and signal processing, information theory and mining technologies. Text mining is not a stand-alone task that human analysts typically engage in. The goal is to transform text composed of everyday language into a structured, database format. In this way, heterogeneous documents are summarized and presented in a uniform manner. Among others, the challenging problems of text clustering are big volume, high dimensionality and complex semantics.

Authors and Affiliations

MS. K. Mugunthadevi , MRS. S. C. Punitha , Dr. . M. Punithavalli

Keywords

Related Articles

AN ARTIFICIAL FISH SWARM OPTIMIZED FUZZY MRI IMAGE SEGMENTATION APPROACH FOR IMPROVING IDENTIFICATION OF BRAIN TUMOUR

In image processing, it is difficult to detect the abnormalities in brain especially in MRI brain images. Also the tumor segmentation from MRI image data is an important; however it is time consuming while carried out by...

PERFORMANCE EVALUATION OF THREEPHASE INDUCTION MOTOR DRIVE FED FROM Z-SOURCE INVERTER

This paper presents a Z-source inverter which has been proposed as an alternative power conversion concept for adjustable speed AC drives. It is having both voltages buck and boost capabilities as they allow inverters to...

A New Approach for Designing Cryptographic Systems based on Feistel Structure

Many Classical and modern cryptographic algorithms have been developed by the Cryptographers to facilitate data security operations. Classical ciphers are not being widely used because of limited key space. Public key cr...

Implementation Of 3D DWT With 5/3 LeGall Filter For Image Processing

The discrete wavelet transform (DWT) is being increasingly used for image coding 3D-DWT provides interesting possibilities and has been studied to resolve the problem associated with compression of large size images. How...

Impulse Noise removal in Digital Images

In this paper, we introduce a new class of filter, the modified spatial median filter (MSMF) for the removal of impulse noise in digital images. The proposed filter is compared with four different filtering algorithms ba...

Download PDF file
  • EP ID EP113574
  • DOI -
  • Views 110
  • Downloads 0

How To Cite

MS. K. Mugunthadevi, MRS. S. C. Punitha, Dr. . M. Punithavalli (2011). Survey on Feature Selection in Document Clustering. International Journal on Computer Science and Engineering, 3(3), 1240-1244. https://europub.co.uk/articles/-A-113574