Fehlende Daten beim Record Linkage von Prozess- und Befragungsdaten : ein empirischer Vergleich ausgewählter Missing Data Techniken (Missing data in the record linkage of process and survey data : An empirical comparison of selected missing data techniques)
Abstract"To compare different missing data techniques, in this paper I use a survey where participants were among other things asked permission for combining the survey with administrative data (record linkage). For those who refuse their permission I set their survey answers to missing, creating pseudo-missing data due to an empirical relevant but unknown mechanism (compared to the statistical simulation of a missing data process). OLS Regression is performed using Complete Case Analysis (CCA), Multiple Imputation (MI) and two versions of Heckman's Sample Selection Model (SSM) to correct for the pseudo-missing data. Their results are compared to a regression based on the complete data set (Benchmark), that gives us the 'true' regression parameters. Results: All missing data techniques under analysis show only small deviations from the benchmark. If only one independent variable contains missing values, MI performs best. If the dependent variable has missing information, CCA and the Two-Step SSM perform better than MI. If missing data is a problem in many or all independent variables, all techniques except for the Maximum likelihood SSM perform equally well." (Author's abstract, IAB-Doku) ((en))
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany] in its series IAB Discussion Paper with number 200907.
Length: 29 pages
Date of creation: 12 May 2009
Date of revision:
Missing Data-Technik; Befragung; Erhebungsmethode; Forschungsansatz; empirische Sozialforschung; prozessproduzierte Daten; Datengewinnung; Stichprobenfehler; Fehler; Imputationsverfahren;
Find related papers by JEL classification:
- C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
- J20 - Labor and Demographic Economics - - Demand and Supply of Labor - - - General
This paper has been announced in the following NEP Reports:
You can help add them by filling out this form.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (IAB, Geschäftsbereich Dokumentation und Bibliothek).
If references are entirely missing, you can add them using this form.