Effective Term Based Text Clustering Algorithms
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5
Abstract
Text clustering methods can be used to group large sets of text documents. Most of the text clustering methods do not address the problems of text clustering such as very high dimensionality of the data and understandability of the clustering descriptions. In this paper, a frequent term based approach of clustering has been introduced; it provides a natural way of reducing a large dimensionality of the document vector space. This approach is based on clustering the low dimensionality frequent term sets and not on clustering high dimensionality vector space. Four algorithms for effective term based text clustering has been presented. An experimental evaluation on classical text ocuments as well as on web ocuments demonstrates that the proposed algorithms obtain clustering of comparable quality significantly more efficient than existing text clustering algorithms.
Authors and Affiliations
P. Ponmuthuramalingam , T. Devi
ON THE DESIGN OF PROJECTIVE BINARY EDWARDS ELLIPTIC CURVES OVER GF (P) BENEFITING FROM MAPPING ELLIPTIC CURVES COMPUTATIONS TO VARIABLE DEGREE OF PARALLEL DESIGN
Finding multiplicative inverse (Modular Inversion) operation is the most time-consuming operation in Elliptic Curve Crypto-system (ECC) operations which affects the performance of ECC. Moreover, several factors that affe...
Source Feature Based Gender Identification System Using GMM
In this paper, through different experimental studies it is demonstrated that the excitation component of speech can be exploited for text independent gender identification system. Linear prediction (LP) residual is used...
An Efficient Automatic Attendance System using Fingerprint Verification Technique
Abstract— The main aim of this paper is to develop an accurate, fast and very efficient automatic attendance system using fingerprint verification technique. We propose a system in which fingerprint verification is done...
A Survey on Cloud Computing
Cloud Computing is a very recent term which is mainly based on distributed computing, virtualization, utility computing, networking and web and software services. This kind of service oriented architecture reduces inform...
A Dynamic Slack Management Technique for Real- Time Distributed Embedded System with Enhanced Fault Tolerance and Resource Constraints
This project work aims to develop a dynamic slack management technique, for real-time distributed embedded systems to reduce the total energy consumption in addition to timing, precedence and resource constraints. The Sl...