A Modified Hierarchical Clustering Algorithm for Document Clustering

Abstract

Clustering is the division of data into groups called as clusters. Document clustering is done to analyse the large number of documents distributed over various sites. The similar documents are grouped together to form a cluster. The success or failure of a clustering method depends on the nature of similarity measure used. The multiviewpoint-based similarity measure or MVS uses different viewpoints unlike the traditional similarity measures that use only a single viewpoint. This increases the accuracy of clustering. A hierarchical clustering algorithm creates a hierarchical tree of the given set of data objects. Depending on the decomposition approach, hierarchical algorithms are classified as agglomerative (merging) or divisive (splitting). This paper focuses on applying multiviewpoint-based similarity measure on hierarchical clustering

Authors and Affiliations

Merin Paul , P Thangam

Keywords

Related Articles

A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding

Field Programmable Gate Arrays (FPGAs) are widely used to implement special purpose processors. FPGAs are economically cheaper for low quantity production because its function can be directly reprogrammed by end users. I...

Simulation and evaluation of convolution encoder for different noisy channel over wireless communication network in CDMA environment 

In this paper we simulate and evaluate the performance of physical layer of wireless communication system of CDMA-2000 specification using radio configuration-3 under forward fundamental channel 1x in terms of bit...

An Initial Approach to Provide Security in Cloud Network  

Cloud computing is a flexible, cost-effective and proven delivery platform for providing business or consumer IT services over the Internet. Cloud resources can be rapidly deployed and easily scaled, with all pro...

Radix-4 Encoder & PPG Block for Multiplier Architecture using GDI Techniqu 

A radix-4 encoder & partial product generator circuit is implemented that demand high speed and low energy operation. It is a good approach if we implement the multiplier as a hybrid architecture of the radix-4...

Download PDF file
  • EP ID EP115193
  • DOI -
  • Views 63
  • Downloads 0

How To Cite

Merin Paul, P Thangam (2013). A Modified Hierarchical Clustering Algorithm for Document Clustering. International Journal of Advanced Research in Computer Engineering & Technology(IJARCET), 2(6), 1969-1973. https://europub.co.uk/articles/-A-115193