An Improved Hierarchical Technique for Document Clustering

Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 4

Abstract

Data mining is the process of non-trivial discovery from implied, previously unknown, and potentially useful information from data in large databases. Hence it is a core element in knowledge discovery, often used synonymously. Clustering, one of technique for data mining used for grouping similar terms together. Earlier statistical analysis used in text mining depends on term frequency.Then, new concept based text mining model was introduced which analyses terms. Clustering of document is useful for the purpose of document organization, summarization, and information retrieval in an efficient way. Initially, clustering is applied for enhancing the information retrieval techniques. Of late, clustering techniques have been applied in the areas which involve browsing the gathered data or in categorizing the outcome provided by the search engines for the reply to the query raised by the users. In this paper, we are providing a comprehensive survey over the document clustering.

Authors and Affiliations

Keywords

Related Articles

Effect of Sub Maximal Exercise Training on Exercise Capacity in Patients with Chronic Obstructive Pulmonary Disease

Chronic obstructive pulmonary disease is a heterogeneous condition embracing several overlapping pathological process including chronic bronchitis, chronic Bronchiolitis (small air way disease) and emphysema. Many patien...

Effects of Electrode Polarityon SKD61 Steel Surface Properties in Powder Mixed Electrical Discharge Machining

Metal powder or alloy powder is suspended in a suitable dielectric fluid during electrical machining dischagre (EDM) is very effective in improving the productivity and quality of the machined surface. Thus, the research...

E-Learning-Future of Education

This paper studies the e-learning standards in present and future digital age. Because of rapid growth and development of computers and internet in education a large number of e-learning systems developed. Now e-learning...

Statistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis

General elections in any country provides an avenue through which citizens exercise their democratic rights in electing leaders of their choice to lead them through a predefined constitutional term in office. Leaders are...

Successful Pregnancy in a Case of Bicornuate Uterus with Pre Eclampsia and IUGR – A Case Report

Uterus didelphys is a condition of lateral fusion defect causing two hemi uteri and cervices. It constitutes approximately 5% of the mullerian duct anomalies.These malformations are associated with miscarriage, pre...

Download PDF file
  • EP ID EP366339
  • DOI -
  • Views 136
  • Downloads 0

How To Cite

(2015). An Improved Hierarchical Technique for Document Clustering. UNKNOWN, 4(4), -. https://europub.co.uk/articles/-A-366339