Integration and imputation of survey data in R: the StatMatch package

Journal Title: Revista Romana de Statistica - Year 2015, Vol 63, Issue 2

Abstract

Statistical matching methods permit to integrate two or more data sources with the purpose of investigating the relationship between variables not jointly observed. Recently these methods received much attention as valid alternative to produce new statistical outputs. The paper provides an overview on the statistical matching methods implemented in the package StatMatch for the R environment, focusing on the most widespread methods and how they were improved. Particular attention is devoted to hot deck matching methods, strictly related to the ones developed for the imputation of missing values. The corresponding functions in StatMatch are very powerful and are flexible enough to be applied for imputing missing values in a survey. The paper tackles also the problem of matching data from complex sample surveys, a very important topic in National Statistical Institutes. Finally it is described the concept of uncertainty characterizing the statistical matching framework and how this alternative approach can be exploited for different purposes.

Authors and Affiliations

Marcello D’Orazio

Keywords

Related Articles

Creating statistical reports in the past, present and future

The paper summarizes the most important milestones in the recent history of computer-aided data analysis, then suggests an alternative reporting workflow to the traditional statistical software methods by the means of an...

Supravegherea prudenţială a instituţiilor fi nanciare nebancare: evoluţii şi perspective

În articole se abordează practica autorităţii de supraveghere referitoare la evaluarea performanţelor instituţiilor financiare nebancare din perspectiva supravegherii prudenţiale. Este descris modul în care s-a reglement...

Usage Of R in Defining Labour Market Areas

Labour Market Area is a territory in which high rate of people both live and work. It does not need to be consistent with area restricted by administrative boarders. It seems rather obvious that administrative boarders a...

Tipuri de indici specifici pieţei bursiere

În analiza evoluţiei cursului acţiunilor pieţei bursiere sunt utilizaţi indici bursieri, calculaţi în timp real sau la sfârşitul zilei de tranzacţionare. Indicii bursieri se determină potrivit acţiunilor cotate pe o sing...

MATHEMATICAL RISK ANALYSIS: VIA NICHOLAS RISK MODEL AND BAYESIAN ANALYSIS 

The objective of this second part of a two-phased study was to explore the predictive power of quantitative risk analysis (QRA) method and process within Higher Education Institution (HEI). The method and process investi...

Download PDF file
  • EP ID EP89693
  • DOI -
  • Views 232
  • Downloads 0

How To Cite

Marcello D’Orazio (2015). Integration and imputation of survey data in R: the StatMatch package. Revista Romana de Statistica, 63(2), 57-68. https://europub.co.uk/articles/-A-89693