Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

Abstract

<p>A formal approach was proposed to implement text content attribution. The study was conducted with Ukrainian scientific and technical texts. The results of application of the designed algorithms of automatic attribution of the text content based on the NLP and stylemetry methods were analyzed. Prospects and features of application of stylemetry information technologies for attribution of the text content were considered. Quantitative content analysis of scientific and technical text content takes advantage of content monitoring and text content analysis based on NLP, Web-Mining and stylemetry methods to identify the multitude of authors whose talking style is similar to that of the analyzed text fragment. This narrows the range of search for further use in the stylemetry methods to determine the degree of belonging of the analyzed text to a particular author.</p><p>Decomposition of the attribution method was carried out based on analysis of such talking coefficients as lexical diversity, degree (measure) of syntactic complexity, talking coherence, indexes of exclusivity and concentration of the text. At the same time, author's style parameters such as the number of words in a certain text, the total number of words of this text, the number of sentences, the number of prepositions, the number of conjunctions, the number of words with occurrence frequency 1, the number of words with occurrence frequency 10 or more were analyzed. Further experimental study requires testing of the proposed method in identifying keywords of texts of other categories: scientific humanitarian, artistic, journalistic, etc.</p>

Authors and Affiliations

Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk

Keywords

Related Articles

Enhancing the effectiveness of calculation of parameters for short circuit of three­phase transformers using field simulation methods

We conducted theoretical research into electromagnetic processes when testing power transformers under the mode of the test short circuit based on a three-dimensional model of the magnetic field in the frequency statemen...

Bowl bladed hydrokinetic turbine with additional steering blade numerical modeling

<p>Bowl bladed kinetic turbine has a low performance. This is a simple turbine, easy to make, easy to install and inexpensive. Kinetic turbines are made specifically for rural areas which may be far from technology facil...

Research into the recovery of exhaust gases from ice using an expansion machine and fuel conversion

<p>We have devised a scheme for the energy-generating unit based on the internal combustion engine 1Ch 6.8/5.4 with a spark ignition and a two-stage system for the recovery of heat from the exhaust gases. The basic eleme...

Investigation of the properties of Ni(OH)2 electrochrome films obtained in the presence of different types of polyvinyl alcohol

<p class="1">Electrochromic films were prepared by cathodic template synthesis in the presence of two types of polyvinyl alcohol: with the hydrolysis degree of 99 % and 85 %. The prepared films show differences in struct...

Recurrent network as a tool for calibration in automated systems and interactive simulators

<p>We have constructed a method for the auto-calibration and correction of values of the vector of magnetic induction, which is suitable for use under conditions of limited computational resources in microcontrollers and...

Download PDF file
  • EP ID EP528250
  • DOI 10.15587/1729-4061.2018.149596
  • Views 89
  • Downloads 0

How To Cite

Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk (2018). Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian. Восточно-Европейский журнал передовых технологий, 6(2), 19-31. https://europub.co.uk/articles/-A-528250