IDEAS home Printed from https://ideas.repec.org/p/zbw/zewdip/1378.html
   My bibliography  Save this paper

Microdata Disclosure by Resampling: Empirical Findings for Business Survey Data

Author

Listed:
  • Gottschalk, Sandra

Abstract

A problem statistical offices and research institutes are faced with by releasing micro-data is the preservation of confidentiality. Traditional methods to avoid disclosure often destroy the structure of data, i.e., information loss is potentially high. In this paper I discuss an alternative technique of creating scientific-use-files, which reproduce the characteristics of the original data quite well. It is based on Fienberg?s (1997 and 1994) [5], [6] idea to estimate and resample from the empirical multivariate cumulative distribution function of the data in order to get synthetic data. The procedure creates datasets - the resample - which have the same characteristics as the original survey data. In this paper I present some applications of this method with (a) simulated data and (b) innovation survey data, the Mannheim Innovation Panel (MIP), and compare resampling with a common method of disclosure control, i.e. disturbance with multiplicative error, concerning confidentiality on the one hand and the appropriateness of the disturbed data for different kinds of analyses on the other. The results show that univariate distributions can be better reproduced by unweighted resampling. Parameter estimates can be reproduced quite well if (a) the resampling procedure implements the correlation structure of the original data as a scale and (b) the data is multiplicative perturbed and a correction term is used. On average, anonymized data with multiplicative perturbed values better protect against re?identification as the various resampling methods used.

Suggested Citation

  • Gottschalk, Sandra, 2003. "Microdata Disclosure by Resampling: Empirical Findings for Business Survey Data," ZEW Discussion Papers 03-55, ZEW - Leibniz Centre for European Economic Research.
  • Handle: RePEc:zbw:zewdip:1378
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/23989/1/dp0355.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Almus, Matthias & Engel, Dirk & Prantl, Susanne, 2000. "The Mannheim Foundation Panels of the Centre for European Economic Research (ZEW)," ZEW Dokumentationen 00-02, ZEW - Leibniz Centre for European Economic Research.
    2. Gottschalk, Sandra, 2002. "Anonymisierung von Unternehmensdaten: Ein Überblick und beispielhafte Darstellung anhand des Mannheimer Innovationspanels," ZEW Discussion Papers 02-23, ZEW - Leibniz Centre for European Economic Research.
    3. Martin Rosemann, 2003. "Erste Ergebnisse von vergleichenden Untersuchungen mit anonymisierten und nicht anonymisierten Einzeldaten am Beispiel der Kostenstrukturerhebung und der Umsatzsteuerstatistik," IAW Discussion Papers 09, Institut für Angewandte Wirtschaftsforschung (IAW).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Engel, Dirk, 2002. "The Impact of Venture Capital on Firm Growth: An Empirical Investigation," ZEW Discussion Papers 02-02, ZEW - Leibniz Centre for European Economic Research.
    2. Rammer, Christian & Metzger, Georg, 2010. "Unternehmensdynamik in der Wissenswirtschaft in Deutschland und im internationalen Vergleich," Studien zum deutschen Innovationssystem 10-2010, Expertenkommission Forschung und Innovation (EFI) - Commission of Experts for Research and Innovation, Berlin.
    3. David B. Audretsch & Dirk Dohse & Annekatrin Niebuhr, 2015. "Regional unemployment structure and new firm formation," Papers in Regional Science, Wiley Blackwell, vol. 94, pages 115-138, November.
    4. Gottschalk, Sandra & Greene, Francis J. & Höwer, Daniel & Müller, Bettina, 2014. "If you don't succeed, should you try again? The role of entrepreneurial experience in venture survival," ZEW Discussion Papers 14-009, ZEW - Leibniz Centre for European Economic Research.
    5. Dirk Engel & Oliver Heneric, 2006. "Stimuliert der BioRegio-Wettbewerb die Ansiedlung neuer Biotechnologieunternehmen? —Ergebnisse einer ökonometrischen Analyse," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 26(1), pages 75-102, March.
    6. Helmut Fryges & Sandra Gottschalk & Karsten Kohn, 2010. "The KfW/ZEW Start-up Panel: Design and Research Potential," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 130(1), pages 117-132.
    7. Andrew A. Toole & Dirk Czarnitzki & Christian Rammer, 2015. "University research alliances, absorptive capacity, and the contribution of startups to employment growth," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 24(5), pages 532-549, July.
    8. Metzger, Georg, 2006. "Once bitten, twice shy? The performance of entrepreneurial restarts," ZEW Discussion Papers 06-083, ZEW - Leibniz Centre for European Economic Research.
    9. Ronning Gerd, 2008. "Measuring Research Intensity from Anonymized Data: Does Multiplicative Noise with Factor Structure Save Results Regarding Quotients?," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 228(5-6), pages 644-653, October.
    10. Brixy, Udo & Murmann, Martin, 2016. "The growth and human capital structure of new firms over the business cycle," IAB-Discussion Paper 201642, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    11. Michael Fritsch, 2007. "The Geography and the Effect of Creative People in Germany," Jena Economics Research Papers 2007-001, Friedrich-Schiller-University Jena.
    12. Fier, Andreas & Heger, Diana & Hussinger, Katrin, 2005. "Die Wirkungsanalyse staatlicher Förderprogramme durch den Einsatz von Matching- und Selektionsmodellen am Beispiel der Fertigungstechnik," ZEW Discussion Papers 05-09, ZEW - Leibniz Centre for European Economic Research.
    13. Dirk Engel & Oliver Heneric, 2013. "Localization of knowledge and entrepreneurs’ mobility: the case of Germany’s biotechnology industry," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 33(2), pages 173-192, October.
    14. Schwiebacher, Franz, 2012. "Complementary assets, patent thickets and hold-up threats: Do transaction costs undermine investments in innovation?," ZEW Discussion Papers 12-015, ZEW - Leibniz Centre for European Economic Research.
    15. Martin Rosemann, 2003. "Erste Ergebnisse von vergleichenden Untersuchungen mit anonymisierten und nicht anonymisierten Einzeldaten am Beispiel der Kostenstrukturerhebung und der Umsatzsteuerstatistik," IAW Discussion Papers 09, Institut für Angewandte Wirtschaftsforschung (IAW).
    16. Engel, Dirk & Keilbach, Max, 2007. "Firm-level implications of early stage venture capital investment -- An empirical investigation," Journal of Empirical Finance, Elsevier, vol. 14(2), pages 150-167, March.
    17. Müller, Bettina & Bersch, Johannes & Gottschalk, Sandra, 2015. "Unternehmensdynamik in der Wissenswirtschaft in Deutschland 2013: Gründungen und Schließungen von Unternehmen – Gründungsdynamik in den Bundesländern – Internationaler Vergleich," Studien zum deutschen Innovationssystem 4-2015, Expertenkommission Forschung und Innovation (EFI) - Commission of Experts for Research and Innovation, Berlin.
    18. Fritsch, Michael & Kritikos, Alexander S. & Rusakova, Alina, 2012. "Who Starts a Business and Who is Self-Employed in Germany," IZA Discussion Papers 6326, Institute of Labor Economics (IZA).
    19. Paic, Peter, 2006. "Informationelle Zugänge für die empirische Untersuchung freiberuflicher Existenzgründungen?," MPRA Paper 5744, University Library of Munich, Germany.
    20. Rammer, Christian & Ohmstedt, Jörg & Binz, Hanna L. & Heneric, Oliver, 2006. "Unternehmensgründungen in der Biotechnologie in Deutschland 1991 bis 2004," ZEW Dokumentationen 06-03, ZEW - Leibniz Centre for European Economic Research.

    More about this item

    Keywords

    resampling; multiplicative data perturbation; Monte Carlo studies; business survey data;
    All these keywords.

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:zewdip:1378. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/zemande.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.