Implantation of indexing optimization technology for highly specialized terms based on Metaphone phonetical algorithm
Journal Title: Восточно-Европейский журнал передовых технологий - Year 2019, Vol 5, Issue 2
Abstract
When compiling databases, for example to meet the needs of healthcare establishments, there is quite a common problem with the introduction and further processing of names and surnames of doctors and patients that are highly specialized both in terms of pronunciation and writing. This is because names and surnames of people cannot be unique, their notation is not subject to any rules of phonetics, while their length in different languages may not match. With the advent of the Internet, this situation has become generally critical and can lead to that multiple copies of e-mails are sent to one address. It is possible to solve the specified problem by using phonetic algorithms for comparing words Daitch-Mokotoff, SoundEx, NYSIIS, Polyphone, and Metaphone, as well as the Levenstein and Jaro algorithms, Q-gram-based algorithms, which make it possible to find distances between words. The most widespread among them are the SoundЕx and Metaphone algorithms, which are designed to index the words based on their sound, taking into consideration the rules of pronunciation. By applying the Metaphone algorithm, an attempt has been made to optimize the phonetic search processes for tasks of fuzzy coincidence, for example, at data deduplication in various databases and registries, in order to reduce the number of errors of incorrect input of surnames. An analysis of the most common surnames reveals that some of them are of the Ukrainian or Russian origin. At the same time, the rules following which the names are pronounced and written, for example in Ukrainian, differ radically from basic algorithms for English and differ quite significantly for the Russian language. That is why a phonetic algorithm should take into consideration first of all the peculiarities in the formation of Ukrainian surnames, which is of special relevance now. The paper reports results from an experiment to generate phonetic indexes, as well as results of the increased performance when using the formed indexes. A method for adapting the search for other areas and several related languages is presented separately using an example of search for medical preparations
Authors and Affiliations
Volodymyr Buriachok, Matin Hadzhyiev, Pavlo Skladannyi, Lidiia Kuzmenko
Development of the method for estimating serviceability of equipment for the transportation of compressed natural gas
<p>To ensure safe transportation of compressed natural gas, we proposed, based on the results of studies conducted, the algorithm for a method for the evaluation of combined type tanks. The method implies determining par...
Studying consumer properties of the developed cupcakes using non-traditional raw materials
<p>We have investigated the influence of non-traditional raw materials of plant origin and natural additives on the formation of consumer properties of cupcakes with improved composition. We have defined and scientifical...
Influence of grape seeds powder on preservation of fats in confectionary glaze
The polyphenol composition of grape seeds powder (GSP) and defatted grape seeds flour (DGSF) in wateralcohol (ethanol, isopropanol) extracts was studied by the chromatographic method. There was established the content o...
Development of a method to calculate the probability of a berth failure under vertical stochastic load
We developed the method for determining the probability of achieving the maximum value of loads from the cargo that is stored at the port terminal warehouse on the front wall of the berth under conditions of uncertainty...
Modeling the parallelism of empirical models of optimal complexity using a Petri net
<p>Many physical processes and phenomena in view of their complexity cannot be described analytically. In these cases, empirical modeling is applied. In this research, the method based on the genetic approach is used to...