ALGORITHM OF THE DETECTION OF THE OUTDATED INFORMATION ON THE BASIS OF ANALYSIS OF DATA SITES

Abstract

The paper proposes an algorithm for implementing the method of identifying outdated information on the basis of the analysis of text data of the pages of the site. The algorithm of the software application for the search of outdated information on the pages of the site, which describes its settings. The criteria for finding outdated information and the sequence of their checks are determined. It is foreseen to execute search queries in nested pages. The main criteria of relevance of the site information on different indicators are determined. Describes the process of running search queries, which is governed by separate settings: date editing pages; start date of the search query; periodicity of the search query. After performing the preparatory steps to find out the outdated information in the page section, a search query is performed in the database to select the pages in which the texts will search for outdated information. As a result of the operation of the algorithm, templates are used that convert text data into a single unified representation. The scientific novelty of the results obtained is that an algorithm for the automatic detection of outdated information on the basis of information analytical analysis of the site's data, which differs from the existing ones, that the detection of outdated information is analyzed not only using time indices of the time of creation / updating of pages of the site, but directly the content of text page. The principle of the function in the software is described in detail, all regular expressions are described, which is used by the function to identify date markers in the text data of the analyzed pages. The proposed algorithm is intended for use by system administrators of the site.

Authors and Affiliations

Andrii Aronov

Keywords

Related Articles

THE QUANTITATIVE OPTIMIZATION OF INFORMATION SYSTEM RESOURCES FOR EFFECTIVE DECISION SUPPORT

The article is devoted to the optimization of information system resources by quantitative factor. This optimization is carried out in the local issue of calculating rational volumes of information. For this purpose, the...

PECULIARITIES OF USING THE LAGRANGIAN POINTS IN MODERN SPACEFLIEGHT

The article discusses the possibilitу of using the Lagrangian points (libration points ) in outer space ‒ starts of rockets, spacecraft for various purposes, emergencу situations, disposal of space debris. The advantages...

DEFASIFICATION IN INFORMATIONAL TECHNOLOGIES OF FUZZY CONTROL ON THE BASIS OF MEMBERSHIP FUNCTIONS OF SEVERAL ARGUMENTS

Using the information technologies for the fuzzy control of complex systems with depended characteristics and restrictions on control variables requires the applying of fuzzy logic with the membership functions of severa...

Modern approach to the glimmer discharge elementary processes mechanism of radio-techical devices functional elements

It was shown an innovative approach to the glimmer discharge elementary processes mechanism aimed to improve the functioning of the devices in radio engineering and telecommunication systems. A row of contradictions in t...

DIAGNOSTIC MODEL OF THE GENERALIZED MODULE OF THE DIGITAL DEVICE BASED ON THE IMPROVED ENERGY DYNAMICAL SPECTRAL METHOD

In the article the method of constructing a diagnostic model of the logic elements of a digital device for the advanced spectral method of diagnosing on the basis of transient processes in the power bus is developed. The...

Download PDF file
  • EP ID EP468495
  • DOI -
  • Views 90
  • Downloads 0

How To Cite

Andrii Aronov (2018). ALGORITHM OF THE DETECTION OF THE OUTDATED INFORMATION ON THE BASIS OF ANALYSIS OF DATA SITES. Телекомунікаційні та інформаційні технології, 123(2), 40-45. https://europub.co.uk/articles/-A-468495