A Comparative Study of Machine Learning Approaches- SVM and LS-SVM using a Web Search Engine Based Application

Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 5

Abstract

Semantic similarity refers to the concept by which a set of documents or words within the documents are assigned a weight based on their meaning. The accurate measurement of such similarity plays important roles in Natural language Processing and Information Retrieval tasks such as Query Expansion and Word Sense Disambiguation. Page counts and snippets retrieved by the search engines help to measure the semantic similarity between two words. Different similarity scores are calculated for the queried conjunctive word. Lexical pattern extraction algorithm identifies the patterns from the snippets. Two machine learning approaches- Support Vector Machine and Latent Structural Support Vector Machine are used for measuring semantic similarity between two words by combining the similarity scores from page counts and cluster of patterns retrieved from the snippets. A comparative study is made between the similarity results from both the machines. SVM classifies between synonymous and non-synonymous words using maximum marginal hyper plane. LS-SVM shows a much more accurate result by considering the latent values in the dataset.

Authors and Affiliations

S. S. Arya , S. Lavanya

Keywords

Related Articles

Real-time Image Scene Classification and Segmentation System

An original approach is proposed. Called the Pre-Segmented Region of Interest Classification Scene System (PSROI), this system is able both to classify a scene during a digital camera's pre-capture phase, and to determin...

Building Classification System to Predict Risk factors of Diabetic Retinopathy Using Text mining

This Making medical decisions such as diagnosing the diseases that cause a patient’s illness is often a complex task. The Diabetic retinopathy is one of the complications of iabetes and Diabetic retinopathy is one of th...

Optimal Capacitor For Maximum Output Power Tracking Of Self Excited Induction Generator Using Fuzzy Logic Approach

This paper aims to determine the optimal capacitors required for maximum output power of a single phase self excited induction generator (SEIG). This paper deals with theoretical, fuzzy logic and practical approach in or...

A Study of Image Segmentation and Edge Detection Techniques

Image segmentation is the key behind image understanding. Image segmentation is one of the most important steps leading to the analysis of processed image data. It is the prime area of research in computer vision. A numb...

An Algorithm for Frequent Pattern Mining Based On Apriori

Frequent pattern mining is a heavily researched area in the field of data mining with wide range of applications. Mining frequent patterns from large scale databases has emerged as an important problem in data mining and...

Download PDF file
  • EP ID EP161105
  • DOI -
  • Views 115
  • Downloads 0

How To Cite

S. S. Arya, S. Lavanya (2012). A Comparative Study of Machine Learning Approaches- SVM and LS-SVM using a Web Search Engine Based Application. International Journal on Computer Science and Engineering, 4(5), 816-823. https://europub.co.uk/articles/-A-161105