An Efficient Algorithm for Document Clustering in Information Retrieval

Abstract

Document clustering is a set of documents can be divided into similar groups called clusters, so that documents within a cluster have high similarity in comparison to other documents in different clusters. It has been considered intensively due to the fact of its extensive applicability in various areas like information retrieval, web mining and search engines like Google. It is determining the similarity between documents and based on the similarity it will group the documents together. It offers efficient representation and visualization of the documents; thus helps in convenient navigation also. The main objective of this research work is to cluster the documents into similar groups based on the content of the documents. In order to perform this task this research work uses two existing documents clustering algorithms, namely K-means and DBSCAN and also this work proposes a new clustering algorithm, E-DBSCAN. From the experimental results it is observed that the E-DBSCAN gives the better clustering accuracy than other algorithms.

Authors and Affiliations

Ms. R. Janani, Dr. S. Vijayarani

Keywords

Related Articles

Garden Game Playing Robot

This project deals with the development and construction of a manual robot capable to play various game activity with the parent & child robot. Parent robot has to carry child robot up to the play zone & child robot pla...

Mathematical Modelling and Analysis of Automotive Chassis with Composites Materials Using Fem

the automotive chassis serves as a frame work for supporting the body and different parts of the automobile. Also, it has to withstand the shock, twist, vibration and other stresses caused due to sudden braking, acceler...

Performance of Strip Footing on Slope Stabilized with Inclined Piles and Sheet Piles

This paper is the study of effect of inclusion of inclined piles and sheet piles along the slope on performance of strip footing placed near slope crest. The parameters such as number of rows of inclined piles/sheet pil...

slugKS Algorithm: A Preventive Mechanism to avoid Intrusion in Area Monitoring Applications

Sensor networks now-a-days have wider applicability in various areas whether it is deployment in hostile environments for health monitoring or area monitoring applications. Although the literature has reviewed lots of a...

slugComparative analysis on IPO’s ( A CASE STUDY OF DLF,VISHAL AND SPICE)

The economy of India is the eleventh largest in the world by nominal GDP and the third largest by purchasing power parity (PPP) The country is one of the G - 20...

Download PDF file
  • EP ID EP22903
  • DOI -
  • Views 211
  • Downloads 5

How To Cite

Ms. R. Janani, Dr. S. Vijayarani (2016). An Efficient Algorithm for Document Clustering in Information Retrieval. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 4(12), -. https://europub.co.uk/articles/-A-22903