Implantation of indexing optimization technology for highly specialized terms based on Metaphone phonetical algorithm

Abstract

When compiling databases, for example to meet the needs of healthcare establishments, there is quite a common problem with the introduction and further processing of names and surnames of doctors and patients that are highly specialized both in terms of pronunciation and writing. This is because names and surnames of people cannot be unique, their notation is not subject to any rules of phonetics, while their length in different languages may not match. With the advent of the Internet, this situation has become generally critical and can lead to that multiple copies of e-mails are sent to one address. It is possible to solve the specified problem by using phonetic algorithms for comparing words Daitch-Mokotoff, SoundEx, NYSIIS, Polyphone, and Metaphone, as well as the Levenstein and Jaro algorithms, Q-gram-based algorithms, which make it possible to find distances between words. The most widespread among them are the SoundЕx and Metaphone algorithms, which are designed to index the words based on their sound, taking into consideration the rules of pronunciation. By applying the Metaphone algorithm, an attempt has been made to optimize the phonetic search processes for tasks of fuzzy coincidence, for example, at data deduplication in various databases and registries, in order to reduce the number of errors of incorrect input of surnames. An analysis of the most common surnames reveals that some of them are of the Ukrainian or Russian origin. At the same time, the rules following which the names are pronounced and written, for example in Ukrainian, differ radically from basic algorithms for English and differ quite significantly for the Russian language. That is why a phonetic algorithm should take into consideration first of all the peculiarities in the formation of Ukrainian surnames, which is of special relevance now. The paper reports results from an experiment to generate phonetic indexes, as well as results of the increased performance when using the formed indexes. A method for adapting the search for other areas and several related languages is presented separately using an example of search for medical preparations

Authors and Affiliations

Volodymyr Buriachok, Matin Hadzhyiev, Pavlo Skladannyi, Lidiia Kuzmenko

Keywords

Related Articles

Combination of vegetable-fruit formulation composition for obtaining high quality products

<p>We have investigated a change in the active acidity of blended products made from vegetable and fruit raw materials. A possibility has been proven to control active acidity through the introduction to formulations of...

Development of a method for estimating the resistance of fibers and threads to a sliding bend based on energy consumption for external and internal friction

<p>We present materials for constructing an instrumental method for assessing resistance of threads to the sliding bend relative to cylindrical surfaces in order to solve tasks on control and prediction of conditions for...

Web­oriented decision support system for planning agreements execution

<p class="a">The problem of construction of the web-based decision support system when planning the execution of agreements at service-rendering enterprises is considered. Characteristics of operations of such enterprise...

Efficiency analysis of the technology of roller formation of finely-grained concrete products

<p><span style="font-family: 'Times New Roman'; font-size: small;">Efficiency of using the roller forming technology with forced rotation of the working body (roller or sector) was disclosed. The results of development o...

Development of an approach to using a style in software engineering

<p>An ontology-driven approach to applying styles in software engineering is developed in the study. The essence of the approach is to use ontology not only to represent styles but also to control the use of styles when...

Download PDF file
  • EP ID EP667102
  • DOI 10.15587/1729-4061.2019.181943
  • Views 58
  • Downloads 0

How To Cite

Volodymyr Buriachok, Matin Hadzhyiev, Pavlo Skladannyi, Lidiia Kuzmenko (2019). Implantation of indexing optimization technology for highly specialized terms based on Metaphone phonetical algorithm. Восточно-Европейский журнал передовых технологий, 5(2), 43-50. https://europub.co.uk/articles/-A-667102