Empowering Document Clustering Through Multi View-Point Based Similarity Measure

Abstract

Among data mining technique, clustering is one of the most important and traditional concept also an unsupervised learning paradigm. Similarity of a document pairs can be measured by matching of concepts. Finding or extracting the most relevant concept from the documents is a challengeable task. To address this issue, in this paper we introduce a concept of multi view point based similarity measure. Our proposed methods uses multiple point of reference between document pairs to extract more relevant match concept rather than extracting only ideas based on similarity measure. Using multiple view point, gathers more information about a particular topic from many different but relevant sources or concept. This strategy works well with smaller documents but is especially effective with longer documents. By gathering more relevant concepts from the documents with multiple points of reference, the document organization and retrieval can enhance the ability to make the most use of the documents held in storage and make retrieval of ideas as well as relevant task or concept much easier and faster. Experimental results shows that our proposed method efficiently extract more relevant concept.

Authors and Affiliations

M. John Basha and Dr. S. Srinivasan

Keywords

Related Articles

Strategy And Guidelines For Sheltering Database To Database Communication Through Database Links

The majority of organizations today prefer Oracle database systems to maintain their operational and transactional data. Multiple databases will be used for different fields of operation in an organization. There aris...

Person Authentication by voice and image using ANFIS and Shifted MFCC

In security systems, the interest in using biometric technologies for person authentication has grown rapidly. Voice is one of the most promising and mature biometric modalities for secured access control. Here prese...

A Novel Image Classification System Based on Evidence Probabilistic Transformation

This paper uses the evidence probabilistic transformation (EPT) for unsupervised image retrieval framework. The main advantages with EPT are substantially resolves the "take-them-or-leave-them" problem, gives a firme...

Vigorous Data Provision And Organization By Using Gossip Protocol

Cloud computing is focused on virtual mechanism data storage and sharing on web services. The cloud environment consists of several elements such as clients and distributions. It includes fault tolerance, high availa...

High Speed And Low Power Data Compressors

The 3-2, 4-2 and 5-2 compressors are the basic components in many applications, in particular partial product summation in multipliers. In this paper novel architectures and designs of high speed, low power 3-2, 4-2 a...

Download PDF file
  • EP ID EP27622
  • DOI -
  • Views 316
  • Downloads 4

How To Cite

M. John Basha and Dr. S. Srinivasan (2013). Empowering Document Clustering Through Multi View-Point Based Similarity Measure. International Journal of Research in Computer and Communication Technology, 2(8), -. https://europub.co.uk/articles/-A-27622