Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

Abstract

<p>A formal approach was proposed to implement text content attribution. The study was conducted with Ukrainian scientific and technical texts. The results of application of the designed algorithms of automatic attribution of the text content based on the NLP and stylemetry methods were analyzed. Prospects and features of application of stylemetry information technologies for attribution of the text content were considered. Quantitative content analysis of scientific and technical text content takes advantage of content monitoring and text content analysis based on NLP, Web-Mining and stylemetry methods to identify the multitude of authors whose talking style is similar to that of the analyzed text fragment. This narrows the range of search for further use in the stylemetry methods to determine the degree of belonging of the analyzed text to a particular author.</p><p>Decomposition of the attribution method was carried out based on analysis of such talking coefficients as lexical diversity, degree (measure) of syntactic complexity, talking coherence, indexes of exclusivity and concentration of the text. At the same time, author's style parameters such as the number of words in a certain text, the total number of words of this text, the number of sentences, the number of prepositions, the number of conjunctions, the number of words with occurrence frequency 1, the number of words with occurrence frequency 10 or more were analyzed. Further experimental study requires testing of the proposed method in identifying keywords of texts of other categories: scientific humanitarian, artistic, journalistic, etc.</p>

Authors and Affiliations

Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk

Keywords

Related Articles

Development of geo­model for concentration determination of hazardous chemicals in the atmosphere

A critical analysis of the approaches to the development of a model for determining the concentration of hazardous chemicals (HC) in the atmosphere, which are the basis of computer­aided environmental monitoring systems...

Simulation of structure formation in the Fe–C–Cr–Ni–Si surfacing materials

<p>The paper investigates the formation of equilibrium phase state in the surfacing materials 300Cr25Ni3Si3 and 500Cr40Ni40Si2BZr, obtained by electric arc surfacing that employs powder tapes PL AN-101 and PL AN-111. Thi...

Development of knowledge­based control systems with built­in functions of rules verification and correction

<p>Two improved models of control rules were proposed. A model in a form of AND/OR graph; in contrast to the known graphical model of general rules, is based on dividing the rules into groups based on the controlled obje...

Determining features of application of functional electrochemical coatings in technologies of surface treatment

<p>Approaches to the use of electrochemical coatings in surface treatment technologies are analyzed. It is shown that directed surface modification allows expanding the functional properties of the treated material, in p...

Development of method of multifactor classification of transport and logistic processes

<p class="a">A method of classification of a set of objects and/or processes in transport and logistics systems on the basis of a multifactor analysis was proposed.</p><p class="a">Combination of the methods of statistic...

Download PDF file
  • EP ID EP528250
  • DOI 10.15587/1729-4061.2018.149596
  • Views 44
  • Downloads 0

How To Cite

Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk (2018). Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian. Восточно-Европейский журнал передовых технологий, 6(2), 19-31. https://europub.co.uk/articles/-A-528250