An Improved Algorithm for Text Document Clustering

Abstract

Due to the advancement of internet, the volume of the electronic documents available on the web is increasing day by day. Document clustering plays important role in organization and summarization of these documents. Thus, developing a fast and effective document clustering algorithm is of great importance. This paper presents an improved algorithm for document clustering. This algorithm is an enhancement of standard k-means algorithm. Experiments are conducted to evaluate the performance of improved algorithm and the results show that improved algorithm performs better than standard k-means algorithm. In this paper, feature selection is also applied to improve the clustering effectiveness.

Authors and Affiliations

Latika

Keywords

Related Articles

An Approach to Memory management in Wireless Sensor Networks

In recent years, wireless sensor network has become an important research domain. A typical WSN is a multi-hop wireless network consisting of hundreds or thousands of small sensor devices that are capable of sensing, pro...

CHEQUE CLEARANCE SYSTEM USING VARIOUS VALIDATION TECHNIQUES

Processing cheques manually is been done for decades and more efficient systems are still under research. In this entire processing system, providing a secured transaction with efficient verification is what more importa...

IMPACT AND UTILIZATION OF PROJECTORS IN HIGHER EDUCATION WITH LOW ENERGY CONSUMPTION AND LAST LONG BATTIES IN WSN ENVIRONMENT

It has been observed that in Higher Education there is no proper utilization of modern techniques and utilization of ICT. There are several types of modern technology used in class room to delivered lectures so, that stu...

Fusing Fingerprint and Iris Multimodal Biometrics using Soft Computing Techniques 

This paper presents the application of soft computing techniques in multimodal biometrics recognition. The paper investigates the comparative performance of three different approaches: nonoptimized neural network trained...

Comparative Study of Reactive/On Demand Routing Protocols for Mobile Adhoc Network

In an ad hoc network, mobile nodes communicate with each other using multi-hop wireless links. There is no stationary infrastructure such as base stations. The routing protocol must be able to keep up with the high degre...

Download PDF file
  • EP ID EP153281
  • DOI -
  • Views 98
  • Downloads 0

How To Cite

Latika (2015). An Improved Algorithm for Text Document Clustering. International Journal of Computer Science & Engineering Technology, 6(6), 358-364. https://europub.co.uk/articles/-A-153281