Efficient calculation of sentence semantic similarity: a proposed scheme based on machine learning approaches and NLP te

Journal Title: Scientific Journal of Review - Year 2014, Vol 3, Issue 3

Abstract

Sentence semantic similarity plays a crucial role in a variety of applications such as Machine Translation, Information Retrieval, Question Answering and Multi-document Summarization. Considering the variability of natural language expression, sentence semantic similarity detection is not a trivial task. This paper tries to make use of Natural Language Processing (NLP) as well as machine learning techniques in order to propose a scheme for sentence semantic similarity. In the first part of the proposed scheme, i.e., the NLP section, different sets of linguistic features including string-based, semantic-based, Named Entity-based and syntax-based features are extracted. In the second part, machine learning algorithms are used to construct classification models on the extracted set of features. Experimental results in the first part indicate that extracted features are valid for sentence semantic similarity. Moreover, by comparing the performance of different classification algorithms in the second part, KNN seems to be the most successful algorithm. Overall, experimental results indicate that the proposed approach can be used to improve the performance of sentence semantic similarity detection especially in terms of accuracy.

Authors and Affiliations

M. Roostaee| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., S. M. Fakhrahmad| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., M. H. Sadreddini*| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., A. Khalili| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran.

Keywords

Related Articles

The link between utopia and nostalgia and its reflection on literature

Nostalgia technically refers to human feeling of missing the past and those things s/he has lost. Different factors contribute to this feeling, including the human's social and political situation and, in general, his/he...

Ineffective and invalid wills from the holy quran perspective with a curdory look at its relevant law and regulations

Will means that a person recommends that after his death such and such things should be done, or such and such thing out of his property will be the property of such and such person, or will be spent for charitable purpo...

Impact of information technology on customer satisfaction in the economics and finance organization (a case study of Zah

The topic of this study is the impact of Information Technology on customer satisfaction in the Economics and Finance Organization. In this study, the impact of two important components of information technology (Intern...

Immunoglobulin in colostrum and health of newborn Calves

Cow’s colostrum contains the basic alimentary constituents; fat, protein, carbohydrate, minerals and vitamins, in addition to immunoglobulin, biological factors, hormones and other biological particles. These constitue...

Social cohesion, among ethnic groups of Iran, in urban and rural areas

Due to the existence of subcultures, and ethnic and religious diversity of the characteristics of the community; This article is a follow - up study of factors amplifier (consensus -causing), and threatening (consensus d...

Download PDF file
  • EP ID EP89
  • DOI 10.14196/sjr.v3i3.1259
  • Views 567
  • Downloads 39

How To Cite

M. Roostaee, S. M. Fakhrahmad, M. H. Sadreddini*, A. Khalili (2014). Efficient calculation of sentence semantic similarity: a proposed scheme based on machine learning approaches and NLP te. Scientific Journal of Review, 3(3), 94-106. https://europub.co.uk/articles/-A-89