COMPARABLE EVALUATION OF CONTEMPORARY CORPUS-BASED AND KNOWLEDGE-BASED SEMANTIC SIMILARITY MEASURES OF SHORT TEXTS

Journal Title: Journal of Information Technology and Application (JITA) - Year 2011, Vol 1, Issue 1

Abstract

This paper presents methods for measuring the semantic similarity of texts, where we evaluated different approaches based on existing similarity measures. On one side word similarity was calculated by processing large text corpuses and on the other, commonsense knowledgebase was used. Given that a large fraction of the information available today, on the Web and elsewhere, consists of short text snippets (e.g. abstracts of scientifi c documents, image captions or product descriptions), where commonsense knowledge has an important role, in this paper we focus on computing the similarity between two sentences or two short paragraphs by extending existing measures with information from the ConceptNet knowledgebase. On the other hand, an extensive research has been done in the fi eld of corpus-based semantic similarity, so we also evaluated existing solutions by imposing some modifi cations. Through experiments performed on a paraphrase data set, we demonstrate that some of proposed approaches can improve the semantic similarity measurement of short text.

Authors and Affiliations

Bojan Furlan, Vladimir Sivački, Davor Jovanović, Boško Nikolić

Keywords

Related Articles

T he Use of Energy Storage Devices of Uncontrolled Type on the Mosc ow Metro (Theory and Practice)

The problem of increasing energy saving and energy efficiency in the system of traction power supply of the Moscow Metro is considered due to the use of energy storage devices of uncontrolled type. The results of simulat...

APPLICATIONS OF SMARTPHONES FOR UBIQUITOUS HEALTH MONITORING AND WELLBEING MANAGEMENT

Advances in smartphone technology and data communications facilitate the use of ubiquitous health monitoring and mobile health application as a solution of choice for the overwhelming problems of the healthcare system. I...

MULTIDIMENSIONAL NUMBERS AND SEMANTIC NUMERATION SYSTEMS:THEORETICAL FOUNDATION AND APPLICATION

In this article, we present a new class of numeration systems, namely Semantic Numeration Systems. The methodological background and theoretical foundations of such systems are considered. The concepts of abstract entity...

EXPERT SYSTEMS IN A CLOUD COMPUTING ENVIRONMENT MODEL FOR FAST-PACED DECISION MAKING

In this paper the use of cloud computing technologies and expert systems will be analyzed. Furthermore, the use of expert systems in a cloud computing environment will be addressed. Speci􀏐ically a Cloud-Based Expert Syst...

WATER TREEING IN EXTRUDED CABLE INSULATION AS REHBINDER ELECTRICAL EFFECT

The paper contains systematic comparison of signs and properties of the water treeing phenomenon (the basic mechanism of degradation of medium voltage electric cable extrudered insulation which develops under combined ac...

Download PDF file
  • EP ID EP244668
  • DOI -
  • Views 102
  • Downloads 0

How To Cite

Bojan Furlan, Vladimir Sivački, Davor Jovanović, Boško Nikolić (2011). COMPARABLE EVALUATION OF CONTEMPORARY CORPUS-BASED AND KNOWLEDGE-BASED SEMANTIC SIMILARITY MEASURES OF SHORT TEXTS. Journal of Information Technology and Application (JITA), 1(1), 65-71. https://europub.co.uk/articles/-A-244668