An Improved Hierarchical Technique for Document Clustering

Journal Title: International Journal of Science and Research (IJSR) - Year 2015, Vol 4, Issue 4

Abstract

Data mining is the process of non-trivial discovery from implied, previously unknown, and potentially useful information from data in large databases. Hence it is a core element in knowledge discovery, often used synonymously. Clustering, one of technique for data mining used for grouping similar terms together. Earlier statistical analysis used in text mining depends on term frequency.Then, new concept based text mining model was introduced which analyses terms. Clustering of document is useful for the purpose of document organization, summarization, and information retrieval in an efficient way. Initially, clustering is applied for enhancing the information retrieval techniques. Of late, clustering techniques have been applied in the areas which involve browsing the gathered data or in categorizing the outcome provided by the search engines for the reply to the query raised by the users. In this paper, we are providing a comprehensive survey over the document clustering.

Authors and Affiliations

Keywords

Related Articles

Optimistic Analysis of Processor Using FFT Equation Execution

This paper optimistic analysis of processor using Fast Fourier Transformation equation performed. Now a day’s most desktop PCs have multiprocessing technology such as Hyper-Threading (HT), Dual-Core, and Quad-Core proces...

Comparison of PNG & JPEG Format for LSB Steganography

"with increasing number of inventions and innovations in technology which have become an integral part for humans thus, the urge for same amount in field of security and privacy is felt. The techniques likes cryptography...

The Multifaceted Aspects of Infertility

Infertility even though a medical condition, is considered as a biopsychosocial crisis and been prevalent for many decades. The World Health Organization estimated that about 10-25% of couples have infertility problems....

Confronting a Ganglioglioma: Case Report

"Introduction: Gangliogliomas are rare neuroepithelial tumours that account for only 0.4% of the CNS tumours hence no prospective studies are done regarding the management. This case report describes management of gangl...

Municipal Solid Waste Management (MSWM): A Case Study of Nagaon Town in Assam, India

"Abstract Municipal solid waste management (MSWM), mostly an urban phenomenon, is undoubtedly an important issue of global concern today that invites attention of engineers, activists, academicians, researchers, students...

Download PDF file
  • EP ID EP366339
  • DOI -
  • Views 119
  • Downloads 0

How To Cite

(2015). An Improved Hierarchical Technique for Document Clustering. International Journal of Science and Research (IJSR), 4(4), -. https://europub.co.uk/articles/-A-366339