Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian
Journal Title: Восточно-Европейский журнал передовых технологий - Year 2018, Vol 6, Issue 2
Abstract
<p>A formal approach was proposed to implement text content attribution. The study was conducted with Ukrainian scientific and technical texts. The results of application of the designed algorithms of automatic attribution of the text content based on the NLP and stylemetry methods were analyzed. Prospects and features of application of stylemetry information technologies for attribution of the text content were considered. Quantitative content analysis of scientific and technical text content takes advantage of content monitoring and text content analysis based on NLP, Web-Mining and stylemetry methods to identify the multitude of authors whose talking style is similar to that of the analyzed text fragment. This narrows the range of search for further use in the stylemetry methods to determine the degree of belonging of the analyzed text to a particular author.</p><p>Decomposition of the attribution method was carried out based on analysis of such talking coefficients as lexical diversity, degree (measure) of syntactic complexity, talking coherence, indexes of exclusivity and concentration of the text. At the same time, author's style parameters such as the number of words in a certain text, the total number of words of this text, the number of sentences, the number of prepositions, the number of conjunctions, the number of words with occurrence frequency 1, the number of words with occurrence frequency 10 or more were analyzed. Further experimental study requires testing of the proposed method in identifying keywords of texts of other categories: scientific humanitarian, artistic, journalistic, etc.</p>
Authors and Affiliations
Vasyl Lytvyn, Victoria Vysotska, Petro Pukach, Zinovii Nytrebych, Ihor Demkiv, Andriy Senyk, Oksana Malanchuk, Svitlana Sachenko, Roman Kovalchuk, Nadiia Huzyk
The use of golden flax seeds and oats sourbread in the production of wheat bread
<p>In the course of development of bakery products enriched with physiologically active substances of non-traditional types of raw materials, cereal and oil-bearing crops enjoy popularity. The actual direction can be a c...
Studying the efficiency of soil decontamination when using a device with the biosorbent “econadin
<p class="a">We have investigated the efficiency of soil decontamination from petroleum products using the patented perforated device of cylindrical shape with a diameter of 0.04 m, with an area of openings of 0.04 m<sup...
A comparative analysis of the assessment results of the competence of technical experts by methods of analytic hierarchy process and with using the Rasch model
<p>Known scales (criteria) for assessing the competence of experts in the field of technical regulation using the method of analytical hierarchy process (AHP) and Rasch model are investigated. The main features of constr...
Development of the method for geometric modeling of S-shaped camber line of the profile of an axial compressor blade
The method for geometric modeling of the S-shaped camber line of the profile of an axial compressor blade, which is a compound curve formed from three sections, was developed. Each of these sections is modeled in the nat...
Chemical deposition of CDS films from ammoniac-thiourea solutions
<p>This paper investigates the process of chemical deposition of CdS films from ammonia-thiourea solutions. It was established that a change in the turbidity of solution occurs in the process of chemical deposition. The...