Efficient Term Frequency and Optimal Similarity Measure of Snippet for Web Search Results

Journal Title: Engineering and Scientific International Journal - Year 2015, Vol 2, Issue 1

Abstract

All clustering methods have to assume some cluster relationship among the data objects that they are applied on. Similarity between a pair of objects can be defined either explicitly or implicitly. In this paper, we introduce a novel multi-viewpoint based similarity measure and two related clustering methods. The major difference between a traditional similarity measure and ours is that the former uses only a multi-viewpoint on clustered, which is the origin, while the latter utilizes many different viewpoints, which are objects, assumed to not be in the same cluster with the two objects being measured. Using multiple viewpoints, more informative assessment of similarity could be achieved. It combines the neighbourhood preservation capability of multidimensional content with the familiar optimal snippet-based representation by employing a multidimensional content to derive two-dimensional layouts of the query search results that preserve text similarity relations, or neighbour hoods. Theoretical analysis and empirical study are conducted to support this claim. Two criterion functions for document clustering are proposed based on this new measure. We compare them with several well-known clustering algorithms that use other popular similarity measures on various document collections to verify the advantages of our proposal.

Authors and Affiliations

Rohini D

Keywords

Related Articles

Physico-chemical parameters of Turori Dam, Turori, Dist., Osmanabad, During the Period Feb 2009 to Jan 2010

During the present study period the water samples collected from the Turori dam with the interval of one month from the selected spots of dam during the period of one year i.e. from Feb 2009 to Jan 2010. The parameters a...

Quasi-stationary thermoelastic problem with moving heat source in unidirectional Dirichlet’s rod

In this paper we deal quasi-stationary, nonhomogeneous thermo elastic problem with Dirichlet’s boundary condition in one dimensional rod of isotropic material occupying the region 0   x a . Initial temperature of the r...

Quasi-stationary thermo elastic problem with moving heat source in unidirectional Robin’s rod

In this paper we deal quasi-stationary, nonhomogeneous thermo elastic problem with Robin’s boundary condition in one dimensional rod of isotropic material occupying the region 0   x a . Initial temperature of the rod i...

Cloud Virtualization : An Overview

Cloud computing is one of today's most exciting technology because of its cost-reducing, flexibility, and scalability. With the fast growing of cloud computing technology, Data security becomes more and more important in...

Secured Smart Card System using PostGreSQL Database

This paper is concentrated on smart card security in increased transmission rate of information. The retrieval of data from the database for the application of smart cards is normally made up with embedded system technol...

Download PDF file
  • EP ID EP631445
  • DOI -
  • Views 124
  • Downloads 0

How To Cite

Rohini D (2015). Efficient Term Frequency and Optimal Similarity Measure of Snippet for Web Search Results. Engineering and Scientific International Journal, 2(1), 19-22. https://europub.co.uk/articles/-A-631445