ON APPROACHES TO ANALYZING DEMOGRAPHIC DATA USING MACHINE LEARNING
Journal Title: Современные информационные технологии и ИТ-образование - Year 2018, Vol 14, Issue 4
Abstract
Demographic data are fairly accessible data sets that can be used for analysis with the use of modern technologies of artificial intelligence and machine learning (ML). However, they cannot be used for these purposes without special preparatory procedures. Preparatory measures include procedures involving work with signs, work with missing data, their normalization and design of signs. The article on the example of "Distribution of the population by age groups" shows the features of demographic data and suggests approaches for their preparation for the subsequent use of artificial intelligence technologies and machine learning for their analysis. The study allowed us to obtain the following results. It has been established that demographic data has a number of features that can be and should be used in the process of improving the quality of data sets for subsequent work with them using artificial intelligence and machine learning technologies. The features of demographic data include, first of all, their temporal ordering, secondly, demographic data have predictable limits of change, which are determined by socio-economic factors, and the absence of significant differences between the closest values of the observed data. Demographic data is influenced by processes in a sociopolitical and economic society in different historical periods, which must be taken into account when working with demographic data. Demographic data that can be attributed to certain historical periods should be given special attention since their values can both improve the quality of the data set for machine processing and cause the occurrence and growth of systematic and random errors. The proposed approaches can have a practical application to solving problems of population forecasting, determining the structure and composition of age groups, estimating life expectancy, determining the composition of the working (economically active) age population and a number of other tasks.
Authors and Affiliations
Anatolii Solovev, Stefan Solovev
FEATURES OF PROGRAMMING IN DSSP FOR THE TERNARY MACHINE
In article characteristic properties of the Dialogue System for Structured Programming (DSSP) in which it significantly differs from the traditional languages (Pascal, C) which are usually used for development of a basic...
THE DETERMINATION METHOD FOR CONTEXTUAL MEANINGS OF WORDS AND DOCUMENTS
Problems and methods are considered for program context recognition of words and text documents. Survey of existent text processing methods is provided, simple numeric algorithm is given for determination of words and do...
RISK ESTIMATION FOR VK.COM ACCOUNTS EXPOSED TO SUICIDE-THEMED QUESTS
The former report regards the problem of internet terrorism prevention. The main focus is given to suicide-themed quest «Blue Whale» (also known as «Siniy Kit») in vk.com social network and method for exposed accounts lo...
DESIGN-THINKING: PRACTICE OF CUSTOMER EXPERIENCE RESEARCH
The purpose of our research is to find new tools for analysing hidden client needs (pains), regulating the sequence of reasoning when making decisions during the creation of innovative projects. We propose an approach of...
INTELLECTUAL METHODS OF ANALYSIS OF GEOGRAPHIC INFORMATION INFRASTRUCTURE OF THE REGION
Fuzzy methods of exploration of geo-informational space as a system of systems are considered. The areas of application of digital specialized plan-schemes as fuzzy projections of geo-informational space are discussed. I...