Semantic Similarity Using Web Search Engine

Journal Title: UNKNOWN - Year 2013, Vol 2, Issue 12

Abstract

Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words (or entities) is still difficult. We propose a method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, we define various word co-occurrence measures using page counts and integrate those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, we propose a pattern extraction algorithm and a pattern clustering algorithm. The optimal combination of page counts-based co-occurrence measures and lexical pattern clusters is obtained using support vector machines.

Authors and Affiliations

Keywords

Related Articles

Survey on Cluster Based Routing Protocol in MANETs

" In Mobile Ad-hoc Networks (MANETs), many clustering schemes are proposed. The systematic classification is used. These schemes provides the better understanding and for better improvements. The Cluster Based Routing Pr...

Biceps Brachii with Third Head: A Case Report

The biceps brachii is known to show variations in the number of heads. Unilateral three headed biceps brachii was found in the right upper limb of adult cadaver. The muscle was innervated by a branch from musculocutaneou...

Mitigation of Voltage Sag and Swell by Using Magnitude Tracking Method

Our project proposes a concept to mitigate the voltage sag and swell. There are three types of voltages is being considered. Normal voltage, Lower voltage (sag), Higher voltage (swell) .Normal voltage is being considered...

An Overview on Development of Aluminium Metal Matrix Composites with Hybrid Reinforcement

Aluminum alloys are widely used in aerospace and automobile industries due to their low density and good mechanical properties, better corrosion resistance and wear, low thermal coefficient of expansion as compared to...

Hepatic Lesions Spectrum in Sudanese Patients by using Computed Tomography

Hepatic Lesions Spectrum in Sudanese Patients by using Computed Tomography

Download PDF file
  • EP ID EP339093
  • DOI -
  • Views 79
  • Downloads 0

How To Cite

(2013). Semantic Similarity Using Web Search Engine. UNKNOWN, 2(12), -. https://europub.co.uk/articles/-A-339093