The impact of cleansing procedures for overlaps on estimation results : evidence for German administrative data
Abstract"Process-generated and administrative datasets have become increasingly important for labor market research over the past ten years. Major advantages of these data are large sample sizes as well as absence of retrospective gaps and unit non-responses. Nevertheless, the quality and validity of the information remains unclear and a lot of preparation and data cleansing is necessary before the data are analyzable. Unfortunately, only few researchers provide access to their cleansing procedures and therefore, also the impact of them on the results of the analyses is unidentified. This paper contributes to this subject and focuses on the variation of research results due to alternative data cleansing procedures. In particular, the paper uses the framework for data preparation suggested in an evaluation study by Wunsch and Lechner (2008) as a benchmark and then induces variation by developing different cleansing procedures for overlapping and parallel observations. The descriptive results show that the differences between the data sets (based on the different procedures) show various magnitudes on some attributes concerning time and personal characteristics. Similar results appear for the subsequent analysis of the treatment effects, which do not vary in the overall shape but in the magnitude especially during the lock-in effect. In sum the results of the analysis indicate that the empirical findings of the evaluation method are fairly robust to variations in the underlying cleansing procedure." (Author's abstract, IAB-Doku) ((en))
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany] in its series FDZ Methodenreport with number 201004_en.
Length: 32 pages
Date of creation: 23 Apr 2010
Date of revision:
Publication status: published in: Schmollers Jahrbuch. Zeitschrift für Wirtschafts- und Sozialwissenschaften, Jg. 130, H. 4 (2010), p. 485-512
Datenqualität; prozessproduzierte Daten; Datenaufbereitung; Integrierte Erwerbsbiografien;
This paper has been announced in the following NEP Reports:
- NEP-ALL-2010-05-02 (All new papers)
You can help add them by filling out this form.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (IAB, Geschäftsbereich Dokumentation und Bibliothek).
If references are entirely missing, you can add them using this form.