A Comparative Study of Machine Learning Approaches- SVM and LS-SVM using a Web Search Engine Based Application
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 5
Abstract
Semantic similarity refers to the concept by which a set of documents or words within the documents are assigned a weight based on their meaning. The accurate measurement of such similarity plays important roles in Natural language Processing and Information Retrieval tasks such as Query Expansion and Word Sense Disambiguation. Page counts and snippets retrieved by the search engines help to measure the semantic similarity between two words. Different similarity scores are calculated for the queried conjunctive word. Lexical pattern extraction algorithm identifies the patterns from the snippets. Two machine learning approaches- Support Vector Machine and Latent Structural Support Vector Machine are used for measuring semantic similarity between two words by combining the similarity scores from page counts and cluster of patterns retrieved from the snippets. A comparative study is made between the similarity results from both the machines. SVM classifies between synonymous and non-synonymous words using maximum marginal hyper plane. LS-SVM shows a much more accurate result by considering the latent values in the dataset.
Authors and Affiliations
S. S. Arya , S. Lavanya
IMPLEMENTATION OF TASK DISSEMINATION IN WIRELESS SENSOR NETWORKS USING MESH TOPOLOGY
Wireless sensor networks, trend of the past few years involves in eploying a large number of small nodes. The nodes then sense nvironmental changes and report them to other nodes over flexible network architecture. A p...
Cheapest Paths in Multi-Interface Networks A Distributed Approach
Let G = (V,E) be a graph which models a set of wireless devices (nodes V) that can communicate by means of multiple radio interfaces , according to activating the common interface rule at each node. Every Interface can b...
Encryption using XOR based Extended Key for Information Security – A Novel Approach
The explosive growth of information, places a high demand for Information Security. Information Security deals with securing the information from unauthorized access or misuse of information either intentionally or accid...
A NOVEL APPROACH FOR PRIORTIZATION OF OPTIMIZED TEST CASES
Generation and prioritization of test cases is one of the major issue in software testing.Maximum number of faults are identified through test cases only. Clients confidence can be gained through software testing. This p...
ZigBee: The Emerging Technology in Building Automation
With the development of industrial automation technology, the limitation of traditional cable control network has become increasingly prominent. Consequently, establishing a reliable data transmission network becomes a c...