Effective Term Based Text Clustering Algorithms
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5
Abstract
Text clustering methods can be used to group large sets of text documents. Most of the text clustering methods do not address the problems of text clustering such as very high dimensionality of the data and understandability of the clustering descriptions. In this paper, a frequent term based approach of clustering has been introduced; it provides a natural way of reducing a large dimensionality of the document vector space. This approach is based on clustering the low dimensionality frequent term sets and not on clustering high dimensionality vector space. Four algorithms for effective term based text clustering has been presented. An experimental evaluation on classical text ocuments as well as on web ocuments demonstrates that the proposed algorithms obtain clustering of comparable quality significantly more efficient than existing text clustering algorithms.
Authors and Affiliations
P. Ponmuthuramalingam , T. Devi
Visual Data Mining in Indian Election System
A good leaders or Government is the basic need to develop country. In India, who is largest democratic country in the world people are not fully involved in the selection process of Leaders. On an average there are 60-65...
A Simplified Methodology for Random Topology Generator builds in Ad Hoc Network Test Bed
Emulation that uses iptables commands to create a desired logical topology is a technique that has been widely used in laboratory environments. The iptables commands are input directives for the iptables packetfiltering...
Literature Review on Patient Scheduling Techniques
Patients need to undergo several checkups, tests, surgery and treatments according to their illness. This paper describes the challenges of patient scheduling and patient scheduling techniques. An efficient scheduling te...
Fuzzy Group Decision Making Using Surrogate Worth Trade-Off Method
In this paper, fuzzy theory is used in group decision making problem. Surrogate worth trade-off method is discussed and according to fuzzy aspect of human decisions, linguistic variables are applied to state the Decision...
Emerging Requirements Of Reconfigurable Computing Systems For Performance Enhancement
The reconfigurable computing is intended to fill the gap etween the non-flexible but high speed application specific integrated circuits based technology and the mostflexible but slow speed general purpose processor bas...