Survey on Feature Selection in Document Clustering

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3

Abstract

Text mining is to research technologies to discover useful knowledge from enormous collections of documents, and to develop a system to provide knowledge and to support in decision making. Basically cluster means a group of similar data, document clustering means segregating the data into different groups of similar data. Clustering is a fundamental data analysis technique used for various applications such as biology, psychology, control and signal processing, information theory and mining technologies. Text mining is not a stand-alone task that human analysts typically engage in. The goal is to transform text composed of everyday language into a structured, database format. In this way, heterogeneous documents are summarized and presented in a uniform manner. Among others, the challenging problems of text clustering are big volume, high dimensionality and complex semantics.

Authors and Affiliations

MS. K. Mugunthadevi , MRS. S. C. Punitha , Dr. . M. Punithavalli

Keywords

Related Articles

UEP based on Proximity Pilot Subcarriers with QAM in OFDM

A novel UEP (Unequal Error Protection) method is proposed that utilizes the subcarrier positions relative to pilot subcarriers in an OFDM multicarrier frame along with QAM (Quadrature Amplitude Modulation) schemes. With...

Performance of SIFT based Video Retrieval

Video has become an important element of multimedia computing and communication environments, with applications as varied as broadcasting, education, publishing and military intelligence. In Video Retrieval system, each...

An Overview of Side Channel Attacks and Its Countermeasures using Elliptic Curve Cryptography

In order to provide security the electronic devices and their execution systems contain implementations of cryptographic algorithms. This paper explains basic level of side channel attacks and their countermeasure. These...

SEGMENTATION OF CT SCAN LUMBAR SPINE IMAGE USING MEDIAN FILTER AND CANNY EDGE DETECTION ALGORITHM

The lumbar vertebrae are the largest segments of the movable part of the vertebral column, they are elected L1 to L5, starting at the top. The spinal column, more commonly called the backbone, is made up primarily of ver...

An Algorithmic Approach for Efficient Image Compression using Neuro-Wavelet Model and Fuzzy Vector Quantization Technique

Applications, which need to store large database and/or transmit digital images requiring high bit-rates over channels with limited bandwidth, have demanded improved image compression techniques. This paper describes pra...

Download PDF file
  • EP ID EP113574
  • DOI -
  • Views 126
  • Downloads 0

How To Cite

MS. K. Mugunthadevi, MRS. S. C. Punitha, Dr. . M. Punithavalli (2011). Survey on Feature Selection in Document Clustering. International Journal on Computer Science and Engineering, 3(3), 1240-1244. https://europub.co.uk/articles/-A-113574