COMPARABLE EVALUATION OF CONTEMPORARY CORPUS-BASED AND KNOWLEDGE-BASED SEMANTIC SIMILARITY MEASURES OF SHORT TEXTS
Journal Title: Journal of Information Technology and Application (JITA) - Year 2011, Vol 1, Issue 1
Abstract
This paper presents methods for measuring the semantic similarity of texts, where we evaluated different approaches based on existing similarity measures. On one side word similarity was calculated by processing large text corpuses and on the other, commonsense knowledgebase was used. Given that a large fraction of the information available today, on the Web and elsewhere, consists of short text snippets (e.g. abstracts of scientifi c documents, image captions or product descriptions), where commonsense knowledge has an important role, in this paper we focus on computing the similarity between two sentences or two short paragraphs by extending existing measures with information from the ConceptNet knowledgebase. On the other hand, an extensive research has been done in the fi eld of corpus-based semantic similarity, so we also evaluated existing solutions by imposing some modifi cations. Through experiments performed on a paraphrase data set, we demonstrate that some of proposed approaches can improve the semantic similarity measurement of short text.
Authors and Affiliations
Bojan Furlan, Vladimir Sivački, Davor Jovanović, Boško Nikolić
T he Use of Energy Storage Devices of Uncontrolled Type on the Mosc ow Metro (Theory and Practice)
The problem of increasing energy saving and energy efficiency in the system of traction power supply of the Moscow Metro is considered due to the use of energy storage devices of uncontrolled type. The results of simulat...
APPLICATIONS OF SMARTPHONES FOR UBIQUITOUS HEALTH MONITORING AND WELLBEING MANAGEMENT
Advances in smartphone technology and data communications facilitate the use of ubiquitous health monitoring and mobile health application as a solution of choice for the overwhelming problems of the healthcare system. I...
MULTIDIMENSIONAL NUMBERS AND SEMANTIC NUMERATION SYSTEMS:THEORETICAL FOUNDATION AND APPLICATION
In this article, we present a new class of numeration systems, namely Semantic Numeration Systems. The methodological background and theoretical foundations of such systems are considered. The concepts of abstract entity...
EXPERT SYSTEMS IN A CLOUD COMPUTING ENVIRONMENT MODEL FOR FAST-PACED DECISION MAKING
In this paper the use of cloud computing technologies and expert systems will be analyzed. Furthermore, the use of expert systems in a cloud computing environment will be addressed. Speciically a Cloud-Based Expert Syst...
WATER TREEING IN EXTRUDED CABLE INSULATION AS REHBINDER ELECTRICAL EFFECT
The paper contains systematic comparison of signs and properties of the water treeing phenomenon (the basic mechanism of degradation of medium voltage electric cable extrudered insulation which develops under combined ac...