Statistics of words occurrences in natural and random texts
Journal Title: Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì - Year 2017, Vol 872, Issue
Abstract
We study experimentally statistical distributions that describe the appearance of words in a number of natural texts, as well as in the random texts derived on their basis. It is shown that the probability mass function of the respective intervals between words is practically the same for the natural and random texts and manifests a fat tail, which is inconsistent with purely stochastic character of those intervals. Significant deviations of the vocabulary growth dynamics found for the natural and random texts from the dynamics predicted by the power Heaps’ law, together with a crossover found in the dictionary of one of the natural texts, confirm a need in generalization of that law.
Authors and Affiliations
Oleg Kushnir, Mykola Alfavitskyi, Viktor Dzikovskyi, Lyubomyr Ivanitskyi, Sergiy Rykhlyuk, Volodymyr Sokulskyi
Ontology data cleansing
This article describes the steps to clear data in the DSS. The ontology concepts of clear data were proposed and described. The analysis of methods and data cleansing technology were carried out at every stage of the pro...
Method for intelligent agents building on based adaptive ontology’s
In the article the problem of building intelligent agent whose knowledge base core is ontology has been solved. Classification of those systems according to their functioning has been done. For each class appropriate mat...
Selection of methods for Searching Some or Similar Images
The article describes the research of image analysis methods. The methods of indexing images for the search of duplicate images, as well as methods for finding similar images based on the definition of key points are des...
Gamemethod of Coalitions Formation Inmulti-agent Systems
The stochastic game method of coalitions formation in multiagent systems is offered. Adaptive algorithm for stochastic game solving are developed. Computer modelling of stochastic game is executed. The parameter influenc...
Modeling behavioral strategies competitive companies in the market of tourist service
We have considered the peculiarities of mathematical models construction, which describe different strategies of the competitive companies` behavior at the market of tourist services. We have found that at the market of...