Effective Term Based Text Clustering Algorithms

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5

Abstract

Text clustering methods can be used to group large sets of text documents. Most of the text clustering methods do not address the problems of text clustering such as very high dimensionality of the data and understandability of the clustering descriptions. In this paper, a frequent term based approach of clustering has been introduced; it provides a natural way of reducing a large dimensionality of the document vector space. This approach is based on clustering the low dimensionality frequent term sets and not on clustering high dimensionality vector space. Four algorithms for effective term based text clustering has been presented. An experimental evaluation on classical text ocuments as well as on web ocuments demonstrates that the proposed algorithms obtain clustering of comparable quality significantly more efficient than existing text clustering algorithms.

Authors and Affiliations

P. Ponmuthuramalingam , T. Devi

Keywords

Related Articles

AN EFFICIENT TEXT CLASSIFICATION USING KNN AND NAIVE BAYESIAN

The main objective is to propose a text classification based on the features selection and preprocessing thereby reducing the dimensionality of the Feature vector and increase the classification accuracy. Text classifica...

Cluster level optimization of residual energy consumption in wireless sensor networks for lifetime enhancement

Network lifetime is perhaps the most important metric for the evaluation of sensor networks. In a resource-constrained environment, the consumption of every limited resource must be considered The network can only fulfil...

Cloud Computing: A solution to Geographical Information Systems (GIS) Cloud Computing and GIS

Geographical Information Systems or Geospatial Information Systems (GIS) is a collection of tools that captures, stores, analyzes, manages, and presents data that are linked to geographical locations. GIS plays an essent...

PRE-DIAGNOSIS OF LUNG CANCER USING FEED FORWARD NEURAL NETWORK AND BACK PROPAGATION ALGORITHM

Cancer is the most important cause of death for both men and women. The early detection of cancer can be helpful in curing the disease completely. So the requirement of techniques to detect the occurrence of cancer nodul...

A Bee-Hive Optimization Approach to Improve the Network Lifetime in Wireless Sensor Networks

In Wireless Sensor Networks (WSN), maximizing the lifetime is a challenging problem. The main task of a network is to receive information from node and transmit to base station for processing. If all nodes forward data p...

Download PDF file
  • EP ID EP119030
  • DOI -
  • Views 106
  • Downloads 0

How To Cite

P. Ponmuthuramalingam, T. Devi (2010). Effective Term Based Text Clustering Algorithms. International Journal on Computer Science and Engineering, 2(5), 1665-1673. https://europub.co.uk/articles/-A-119030