Empowering Document Clustering Through Multi View-Point Based Similarity Measure

Abstract

Among data mining technique, clustering is one of the most important and traditional concept also an unsupervised learning paradigm. Similarity of a document pairs can be measured by matching of concepts. Finding or extracting the most relevant concept from the documents is a challengeable task. To address this issue, in this paper we introduce a concept of multi view point based similarity measure. Our proposed methods uses multiple point of reference between document pairs to extract more relevant match concept rather than extracting only ideas based on similarity measure. Using multiple view point, gathers more information about a particular topic from many different but relevant sources or concept. This strategy works well with smaller documents but is especially effective with longer documents. By gathering more relevant concepts from the documents with multiple points of reference, the document organization and retrieval can enhance the ability to make the most use of the documents held in storage and make retrieval of ideas as well as relevant task or concept much easier and faster. Experimental results shows that our proposed method efficiently extract more relevant concept.

Authors and Affiliations

M. John Basha and Dr. S. Srinivasan

Keywords

Related Articles

Double Compression Of JPEG Image Using DWT Over RDWT

Reconstruction of the history of an image is a difficult process with visual document analysis. Suppose, if an image undergo double compression, then the compressed image is not the exact bit stream generated by the...

Enhancement of the image by using Histogram Modification and High-pass Filtering Mask

Generally the major problem is low contrast image analysis in medical Field. Low contrast digital images reduce the ability of observer in analyzing the image. This low contrast images are obtained from low radiated...

Implementation of Multinomial Standard Product for RSA State Identify Algorithm

This paper presents architecture and modeling of public key RSA encryption/decryption systems. The RSA(rivest -shamir-adleman)algorithm is a secure, high quality public key algorithm.public key supports confidential...

Design and Development of Multitasking Robot

A method for integrating real time obstacle avoidance capability in two-legged walking robots i.e.Biped using parallel leg mechanism. Elaborating the way of different task assign to a robot i.e., introducing a multit...

A Survey on Greedy Reconstruction Algorithms in Compressive Sensing

Compressive sensing (CS) is a field of signal processing that provides a framework for image recovery using sub-Nyquist sampling rates. CS has recently gained a lot of attention due to its exploitation of signal sparsit...

Download PDF file
  • EP ID EP27622
  • DOI -
  • Views 347
  • Downloads 4

How To Cite

M. John Basha and Dr. S. Srinivasan (2013). Empowering Document Clustering Through Multi View-Point Based Similarity Measure. International Journal of Research in Computer and Communication Technology, 2(8), -. https://europub.co.uk/articles/-A-27622