Efficient Term Frequency and Optimal Similarity Measure of Snippet for Web Search Results

Journal Title: Engineering and Scientific International Journal - Year 2015, Vol 2, Issue 1

Abstract

All clustering methods have to assume some cluster relationship among the data objects that they are applied on. Similarity between a pair of objects can be defined either explicitly or implicitly. In this paper, we introduce a novel multi-viewpoint based similarity measure and two related clustering methods. The major difference between a traditional similarity measure and ours is that the former uses only a multi-viewpoint on clustered, which is the origin, while the latter utilizes many different viewpoints, which are objects, assumed to not be in the same cluster with the two objects being measured. Using multiple viewpoints, more informative assessment of similarity could be achieved. It combines the neighbourhood preservation capability of multidimensional content with the familiar optimal snippet-based representation by employing a multidimensional content to derive two-dimensional layouts of the query search results that preserve text similarity relations, or neighbour hoods. Theoretical analysis and empirical study are conducted to support this claim. Two criterion functions for document clustering are proposed based on this new measure. We compare them with several well-known clustering algorithms that use other popular similarity measures on various document collections to verify the advantages of our proposal.

Authors and Affiliations

Rohini D

Keywords

Related Articles

A Study on E-Smart Card System

Smart card provides detection, verification, data storage space and application processing. Smart cards are openly connected to the volume of information and applications that are automatically used on a card. A single s...

Impact on Thermal Properties of Burnt Clay Bricks using Foundry Sand

Industrial waste such as foundry sand and many others are posing problems to manufacturing industries as strict environmental policies are not allowing open dumping or stacking of these waste materials. To solve the prob...

Efficient Term Frequency and Optimal Similarity Measure of Snippet for Web Search Results

All clustering methods have to assume some cluster relationship among the data objects that they are applied on. Similarity between a pair of objects can be defined either explicitly or implicitly. In this paper, we intr...

Improving Iris Performance using Segmentation with CASIA Database

We can recognize humans each other according to their numerous characteristics of age. Identity verification (authentication) in computer systems has been traditionally based on something like password, key, card, pin an...

Self-Similar Cylindrical Ionizing Shock Waves in a Non-uniform Gas with Radiation Heat-Flux

Self-similar flows in the background, a gasionizing cylindrical shock wave, associated with radiation heat-flux, in an ideal gas are considered. The ionizing shock is considered to be propagating in a medium at rest with...

Download PDF file
  • EP ID EP631445
  • DOI -
  • Views 154
  • Downloads 0

How To Cite

Rohini D (2015). Efficient Term Frequency and Optimal Similarity Measure of Snippet for Web Search Results. Engineering and Scientific International Journal, 2(1), 19-22. https://europub.co.uk/articles/-A-631445