CLUSTERING MODEL OF LOW-STRUCTURED TEXT DATA

Abstract

The article proposes a clustering model for collections of news text messages, as well as the corresponding bubble trap clustering algorithm. The essence of the proposed approach is to divide the entire vector space of text documents into shells of semantic clusters with minimal restrictions on the selection criteria in such a way that the volume of the semantic cluster and the position of its center remain unchanged in the process of adding new vectors to it, and the criterion of affiliation is a given constant accuracy metric.

Authors and Affiliations

Konstantin Otradnov, Dmitry Zhukov, Olga Novikova

Keywords

Related Articles

ALIEN AND SUPERCOMPUTER TITAN INTERACTION TECHNOLOGY

The next launch of the LHC involves using much more resources than GRID can provide. To solve this problem, ALICE is engaged in a project to expand the existing computing model in order to include additional resources in...

A METHOD OF CONSTRUCTING A BLOCK CIPHERS ROUND FUNCTION’S POLYNOMIAL OVER A FINITE FIELD

The work outlines the method of construction of round function as a polynomial of one variable over the finite field. The proposed method is based on the calculation of the initial cryptographic transformation at special...

ON IMPROVEMENT OF THE SYSTEM OF HIGHER PROFESSIONAL EDUCATION IN THE LIGHT OF THE NEW DOCTRINE OF INFORMATION SECURITY OF RUSSIA

The article analyzes the main provisions of the new doctrine of information security of Russia and the basic ways of its realization in the development of science and education in the sphere information security of socie...

CALCULATION OF HYDRODYNAMIC INDICATORS OF VORTEX GRANULATORS WORKING: PROGRAM IMPLEMENTATION OF THE MATHEMATICAL MODEL

The article deals with the software implementation of the author's mathematical model for calculating the trajectory of granule motion in a free and straitened mode, the residence time of granules in the working space of...

CLOUD SERVICES FOR NATURAL LANGUAGE PROCESSING

The paper presents the results of experiments conducted with the aim of a comparative analysis of the performance of the existing cloud services for natural language processing in Russian. The article provides an overvie...

Download PDF file
  • EP ID EP266420
  • DOI 10.25559/SITITO.2017.3.439
  • Views 130
  • Downloads 0

How To Cite

Konstantin Otradnov, Dmitry Zhukov, Olga Novikova (2017). CLUSTERING MODEL OF LOW-STRUCTURED TEXT DATA. Современные информационные технологии и ИТ-образование, 13(3), 100-115. https://europub.co.uk/articles/-A-266420