IDEAS home Printed from
   My bibliography  Save this paper

The impact of cleansing procedures for overlaps on estimation results : evidence for German administrative data


  • Scioch, Patrycja

    (Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany])


"Process-generated and administrative datasets have become increasingly important for labor market research over the past ten years. Major advantages of these data are large sample sizes as well as absence of retrospective gaps and unit non-responses. Nevertheless, the quality and validity of the information remains unclear and a lot of preparation and data cleansing is necessary before the data are analyzable. Unfortunately, only few researchers provide access to their cleansing procedures and therefore, also the impact of them on the results of the analyses is unidentified. This paper contributes to this subject and focuses on the variation of research results due to alternative data cleansing procedures. In particular, the paper uses the framework for data preparation suggested in an evaluation study by Wunsch and Lechner (2008) as a benchmark and then induces variation by developing different cleansing procedures for overlapping and parallel observations. The descriptive results show that the differences between the data sets (based on the different procedures) show various magnitudes on some attributes concerning time and personal characteristics. Similar results appear for the subsequent analysis of the treatment effects, which do not vary in the overall shape but in the magnitude especially during the lock-in effect. In sum the results of the analysis indicate that the empirical findings of the evaluation method are fairly robust to variations in the underlying cleansing procedure." (Author's abstract, IAB-Doku) ((en))

Suggested Citation

  • Scioch, Patrycja, 2010. "The impact of cleansing procedures for overlaps on estimation results : evidence for German administrative data," FDZ Methodenreport 201004_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
  • Handle: RePEc:iab:iabfme:201004_en

    Download full text from publisher

    File URL:
    Download Restriction: no

    More about this item


    Datenqualität; prozessproduzierte Daten; Datenaufbereitung; Integrierte Erwerbsbiografien;

    NEP fields

    This paper has been announced in the following NEP Reports:


    Access and download statistics


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iab:iabfme:201004_en. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (IAB, Geschäftsbereich Dokumentation und Bibliothek). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.