Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian
Journal Title: Восточно-Европейский журнал передовых технологий - Year 2018, Vol 6, Issue 2
Abstract
<p>A formal approach was proposed to implement text content attribution. The study was conducted with Ukrainian scientific and technical texts. The results of application of the designed algorithms of automatic attribution of the text content based on the NLP and stylemetry methods were analyzed. Prospects and features of application of stylemetry information technologies for attribution of the text content were considered. Quantitative content analysis of scientific and technical text content takes advantage of content monitoring and text content analysis based on NLP, Web-Mining and stylemetry methods to identify the multitude of authors whose talking style is similar to that of the analyzed text fragment. This narrows the range of search for further use in the stylemetry methods to determine the degree of belonging of the analyzed text to a particular author.</p><p>Decomposition of the attribution method was carried out based on analysis of such talking coefficients as lexical diversity, degree (measure) of syntactic complexity, talking coherence, indexes of exclusivity and concentration of the text. At the same time, author's style parameters such as the number of words in a certain text, the total number of words of this text, the number of sentences, the number of prepositions, the number of conjunctions, the number of words with occurrence frequency 1, the number of words with occurrence frequency 10 or more were analyzed. Further experimental study requires testing of the proposed method in identifying keywords of texts of other categories: scientific humanitarian, artistic, journalistic, etc.</p>
Authors and Affiliations
Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk
Development of the method for rapid detection of hazardous atmospheric pollution of cities with the help of recurrence measures
The method for rapid detection of hazardous pollution of the atmosphere of cities, which is based on dynamic measures of recurrence (repeatability) of the states of the pollution concentration vector, was developed. The...
Development of a mathematical model for cost distribution of maintenance and repair of electrical equipment
The research is devoted to the development of a model for cost distribution of maintenance and repair of electrical equipment when making decisions on the management of the electric power system state. The decrease in th...
Preparation and preliminary analysis of data on energy consumption by municipal buildings
<p class="a">Systematization of data on energy consumption by buildings of different purposes makes it possible to investigate processes from the standpoint of efficient use of energy resources in order to ensure comfort...
Development of chemical methods for individual decontamination of organophosphorus compounds
<p>The methods of individual decontamination of organophosphorus esters of paralytic action were studied using the decontamination of paraoxon (O, O-diethyl-O-4-nitrophenylphosphate) and methyl parathion (O, O-dimethyl-O...
Development of cleaning methods complex of industrial gas pipelines based on the analysis of their hydraulic efficiency
<p>The majority of gas and gas condensate fields of Ukraine are developed by pressure depletion, which makes it possible to stabilize production only in conditions of low working pressures at the wellhead. In turn, the w...