CLUSTERING MODEL OF LOW-STRUCTURED TEXT DATA

Abstract

The article proposes a clustering model for collections of news text messages, as well as the corresponding bubble trap clustering algorithm. The essence of the proposed approach is to divide the entire vector space of text documents into shells of semantic clusters with minimal restrictions on the selection criteria in such a way that the volume of the semantic cluster and the position of its center remain unchanged in the process of adding new vectors to it, and the criterion of affiliation is a given constant accuracy metric.

Authors and Affiliations

Konstantin Otradnov, Dmitry Zhukov, Olga Novikova

Keywords

Related Articles

DYNAMIC ONTOLOGY BASED RETRIEVAL AND ANALYSIS OF INFORMATION ON SCIENCE AND ENGINEERING

RDF ontologies are proposed as modelling pattern for associations of innovation cycles being performed by science-intensive technical products. An appropriate ontology is applied as framework of personalized archives of...

THE FORMATION OF THE COMPONENTS OF THE FUZZY KNOWLEDGE BASE FOR DIGITAL PLAN-SCHEMES OF THE RESULTS OF SATELLITE MONITORING OF AGRICULTURAL LANDS

The methods of forming the components of fuzzy knowledge base in the form of basic digital plan-scheme of territories determined by the morphology of satellite images, natural data and the results of subjective assessmen...

MULTILAYER PARAMETRIC MODELS OF PROCESSES IN A POROUS CATALYST PELLET

In this paper, we perform a comparative analysis of new methods for constructing approximate solutions of differential equations. As a test problem, we chose the boundary value problem for a substantially nonlinear secon...

THE EXPERIENCE OF UPDATING THE EDUCATIONAL STANDARDS OF HIGHER EDUCATION IN THE FIELD OF ICT

The article describes the experience of Russian universities in integrating professional and educational standards in the design of higher education educational programs in the field of information and communication tech...

EVALUATION OF EXPERT JUDGEMENTS CONSISTENCY WHEN CONSTRUCTING A MEMBERSHIP FUNCTION OF FUZZY SET USING THE METHOD OF LEVEL SETS

The article deals with one of the expert methods for construction of a membership function of fuzzy set – method of level sets, developed by R. Yager. The feasibility of improving of this method, by adding a procedure fo...

Download PDF file
  • EP ID EP266420
  • DOI 10.25559/SITITO.2017.3.439
  • Views 109
  • Downloads 0

How To Cite

Konstantin Otradnov, Dmitry Zhukov, Olga Novikova (2017). CLUSTERING MODEL OF LOW-STRUCTURED TEXT DATA. Современные информационные технологии и ИТ-образование, 13(3), 100-115. https://europub.co.uk/articles/-A-266420