ALGORITHM OF THE DETECTION OF THE OUTDATED INFORMATION ON THE BASIS OF ANALYSIS OF DATA SITES

Abstract

The paper proposes an algorithm for implementing the method of identifying outdated information on the basis of the analysis of text data of the pages of the site. The algorithm of the software application for the search of outdated information on the pages of the site, which describes its settings. The criteria for finding outdated information and the sequence of their checks are determined. It is foreseen to execute search queries in nested pages. The main criteria of relevance of the site information on different indicators are determined. Describes the process of running search queries, which is governed by separate settings: date editing pages; start date of the search query; periodicity of the search query. After performing the preparatory steps to find out the outdated information in the page section, a search query is performed in the database to select the pages in which the texts will search for outdated information. As a result of the operation of the algorithm, templates are used that convert text data into a single unified representation. The scientific novelty of the results obtained is that an algorithm for the automatic detection of outdated information on the basis of information analytical analysis of the site's data, which differs from the existing ones, that the detection of outdated information is analyzed not only using time indices of the time of creation / updating of pages of the site, but directly the content of text page. The principle of the function in the software is described in detail, all regular expressions are described, which is used by the function to identify date markers in the text data of the analyzed pages. The proposed algorithm is intended for use by system administrators of the site.

Authors and Affiliations

Andrii Aronov

Keywords

Related Articles

Use of some mathematical regularities in processing radar information in the interest of prevention of emergency situations of terrorist character

The possibility of using some mathematical regularities in the processing of radar information is considered in the work, in order to shorten the time of identification of dangerous targets in order to prevent emergencie...

PRINCIPLES OF INTELLECTUAL MANAGEMENT OF TELECOMMUNICATION NEW GENERATION NETWORKS

In the article the intellectual information technologies are considered and their general properties for management of telecommunication networks are investigated. The scheme of the generalized structure of the system of...

Method zero-knowledge identification of remote users.

. In article the new method for implementation of theoretical strong identification of remote abonents or tample-resistant devices of multiuser systems, based on zero-knowledge conception is presented. The need for eff...

COMPENSATIVE METHODS OF HINDRANCES PROTECTING IN WIRELESS LOCAL NETWORK

To solve the problem of protecting the wireless network from interference, the sources of which are separated with the source of the useful signal, the method of amplitude compensation is developed. The levels of the use...

INCREASE IN ENERGY EFFICIENCY OF MILLIMETRES SYSTEMS BY THE METHOD OF CHANNEL GAIN DUE TO DIFFRACTION AND REFLECTION

The paper analyses the millimetre-wave channel budget model, taking into account the radiation characteristics of a narrow-beam antenna. The proposed multi-beam model takes into account the overall gain and the millimetr...

Download PDF file
  • EP ID EP468495
  • DOI -
  • Views 61
  • Downloads 0

How To Cite

Andrii Aronov (2018). ALGORITHM OF THE DETECTION OF THE OUTDATED INFORMATION ON THE BASIS OF ANALYSIS OF DATA SITES. Телекомунікаційні та інформаційні технології, 123(2), 40-45. https://europub.co.uk/articles/-A-468495