Efficient calculation of sentence semantic similarity: a proposed scheme based on machine learning approaches and NLP te
Journal Title: Scientific Journal of Review - Year 2014, Vol 3, Issue 3
Abstract
Sentence semantic similarity plays a crucial role in a variety of applications such as Machine Translation, Information Retrieval, Question Answering and Multi-document Summarization. Considering the variability of natural language expression, sentence semantic similarity detection is not a trivial task. This paper tries to make use of Natural Language Processing (NLP) as well as machine learning techniques in order to propose a scheme for sentence semantic similarity. In the first part of the proposed scheme, i.e., the NLP section, different sets of linguistic features including string-based, semantic-based, Named Entity-based and syntax-based features are extracted. In the second part, machine learning algorithms are used to construct classification models on the extracted set of features. Experimental results in the first part indicate that extracted features are valid for sentence semantic similarity. Moreover, by comparing the performance of different classification algorithms in the second part, KNN seems to be the most successful algorithm. Overall, experimental results indicate that the proposed approach can be used to improve the performance of sentence semantic similarity detection especially in terms of accuracy.
Authors and Affiliations
M. Roostaee| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., S. M. Fakhrahmad| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., M. H. Sadreddini*| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., A. Khalili| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran.
Impact and how to enter alavian in tabarestan
Alavian, were descendants of Imam Ali, the different eras began to Tabarestan. Tabarestan region due to geographical and political position is important. So, so long, the rule of the Umayyad and Abbasid rulers, the calip...
Facial gender recognition, deferent approaches
Gender recognition is one of the most interesting problems in face processing. Gender recognition can be used as a preprocessing phase in many applications. In this work we compare different approaches for gender recog...
Consumer health maintenance related to goat meat fatty acids composition and distribution as influenced by some non gene
Meat health related issues as perceived by the consumers has become motivators for liking and purchasing of meat products in developed world with a high incidence of cardiovascular disease. Apart from the genetics of the...
Demand for biogas: state of the art and future prospective
Sudan is an agricultural country with fertile land, plenty of water resources, livestock, forestry resources, and agricultural residues. Energy sources are divided into two main types; conventional energy (woody biomas...
A comparative study of translation strategies of fast-food advertising texts from English into Persian and Arabic
This paper was conducted to compare translation strategies used to translate English fast-food advertisement texts into Persian and Arabic. In this study, the textual analysis of the corpus revealed that while the ma...