“Big Data” as an Information Source and a Toolkit for Official Statistics: Capacities, Problems, Prospects
Journal Title: Статистика України - Year 2016, Vol 75, Issue 4
Abstract
Issues are discussed, related with potential use by official statistics of the so called “Big Data”, which refers to data extracted from websites, mobile phones, cash machines in retail sales networks, traffic surveillance cameras etc. These data are nicknamed as “big” mainly due to large scopes, not enabling for their processing by standard statistical tools but requiring special software and techniques. It is argued that “Big Data” have advantages such as timeliness, wide coverage of targeted population segments; their collection does not require special questionnaires or surveys, training or recruiting numerous paid personnel like supervisors or interviewers. When “Big Data” are used, accuracy requirements can be loosened, analysis of phenomena and processes can be made by quite simple procedures. As scopes of these data are increasing incessantly, often second by second, the only thing to do is to process them in a proper way, to analyze and use the output information. It is emphasized that use of “Big Data” is complicated due to the need to address problems like indeterminacy of the covered data sets; bias of estimates; accessibility of data, because they are mostly collected by private companies or belong to them; protection of private data, storage of large scopes of “Big Data” and their processing; statistical incorporation of numerous large data sets; risks of potential manipulation with data etc. Arguments are given that applied and official statistics have prototypes of tools capable to solve a major part of the above problems, once properly developed and adapted. They include methods for calibration of survey results, statistical aggregation of data, or model-based assessment of data. As regard “cloud” technologies for data storage and processing, their use can solve the problems of weak capacity of data carriers in statistical offices, and the problems of storage of private and confidential data. Results of studies conducted by leading statisticians of our days demonstrate that official statistics has no alternatives to use of “Bid Data”. The sooner this advanced field of statistics and information technologies comes in focus of the State Statistics Service, universities and research institutions, the easier new information sources and new statistical toolkit can be integrated in the official statistics within the forthcoming ten or fifteen years.
Authors and Affiliations
V. Н. Sarioglo
Звіти як основне джерело інформування користувачів про якість статистичної інформації про адміністративні правопорушення
У статті розглянуто окремі питання підготовки звіту з якості інформації державного статистичного спостереження про адміністративні правопорушення. Обґрунтовано необхідність звітування з якості цієї інформації, висвітлено...
Incomes and Expenditures of Ukrainian Households from 2015 till Earlier Half of 2016
Principles of organization and practice of the surveys devoted to living conditions of households, conducted by the State Statistics Service of Ukraine, are studied. Change in the households’ incomes and expenditures fro...
Совершенствование мониторинга реализации государственной стратегии регионального развития в Украине
Рассмотрены особенности реализации Государственной стратегии регионального развития в Укра ине, а также ее мониторинга. Сформулированы методологические и организационно-экономические предложения по совершенствованию разр...
“Big Data” as an Information Source and a Toolkit for Official Statistics: Capacities, Problems, Prospects
Issues are discussed, related with potential use by official statistics of the so called “Big Data”, which refers to data extracted from websites, mobile phones, cash machines in retail sales networks, traffic surveillan...
Tools and Indicators for Data Quality Assessment for Official Statistical Observation of Foreign Trade in Services
The quality of statistics is the most valuable asset that statistics agencies can offer to users, because it largely determines the validity of decision-making in government and business enterprise sector, promotes stati...