THE DETERMINATION METHOD FOR CONTEXTUAL MEANINGS OF WORDS AND DOCUMENTS

Abstract

Problems and methods are considered for program context recognition of words and text documents. Survey of existent text processing methods is provided, simple numeric algorithm is given for determination of words and documents context with a help of semantic net, having a form of tree type graph. Semantic net structure is described in detail. Given semantic net is needed to fix basic word W1 context by means of words-meaning W2 coupled with it. Words W2 represent possible W1 context meanings. For every word W2 correspond some words-characteristics W3. At the context calculation the distances between words W2 and W3 are taken into account. The distances are measured in words between. Every word W3 has metrics, according to the concept proximity to W2. There is a table of words W1,W2 and W3 with their metrics values. At context document analyses there was taken into account case or number words variations. Simple formula for context calculation is presented. Method of results proofing with a help of Chebyshev inequality is also provided. The context analyses method was checked by Monte-Carlo simulations. Tables of investigation results are provided and some recommendation for algorithm parameters tuning and optimization are also given. The analyses showed that proposed method is quite effective for context estimation at text analyses, and for any systems, where one needs computer recognition of context.

Authors and Affiliations

Elizaveta Dorenskaya, Yuri Semenov

Keywords

Related Articles

SIMPLE HEURISTIC ALGORITHM FOR DYNAMIC VM REALLOCATION IN IAAS CLOUDS

The rapid development of cloud technologies and its high prevalence in both commercial and academic areas have stimulated active research in the domain of optimal cloud resource management. One of the most active researc...

ABOUT PERSONIFICATION OF TEACHING SCHOOLCHILDREN PROGRAMMING

Information education of the personality is one of the most mobile types of education depending on the dominating paradigm of development of society, degree of development and the prospects of further development of econ...

INFORMATION TECHNOLOGY FORECAST THE EMERGENCY OF AIR POLLUTION EXHAUST GASES OF SHIPS AND VEHICLE

Information technology of monitoring of air environment quality based on the solution of a differential equation of atmospheric diffusion, measurements of concentrations of pollutants, the intensity and structure of tran...

JINR CLOUD SERVICE FOR SCIENTIFIC AND ENGINEERING COMPUTATIONS

Pretty often small research scientific groups do not have access to powerful enough computational resources required for their research work to be productive. Global computational infrastructures used by large scientific...

APPLICATION OF INFORMATION TECHNOLOGIES IN INTERNATIONAL CARGO CARRIAGE

Railway carriage is the main type of long-haul traffic in international carriage, thus, the key part of cargo turnover accounts for railway traffic. This makes it relevant to develop and actualize the Russian export pote...

Download PDF file
  • EP ID EP523561
  • DOI 10.25559/SITITO.14.201804.896-902
  • Views 114
  • Downloads 0

How To Cite

Elizaveta Dorenskaya, Yuri Semenov (2018). THE DETERMINATION METHOD FOR CONTEXTUAL MEANINGS OF WORDS AND DOCUMENTS. Современные информационные технологии и ИТ-образование, 14(4), 896-902. https://europub.co.uk/articles/-A-523561