This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Distribution-Preserving Statistical Disclosure Limitation

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Simon D. Woodcock () (Simon Fraser University)
Gary Benedetto () (US Census Bureau)

Additional information is available for the following registered author(s):

Abstract

One approach to limiting disclosure risk in public-use microdata is to release multiply-imputed, partially synthetic data sets. These are data on actual respondents, but with con dential data replaced by multiply-imputed synthetic values. When imputing confidential values, a mis-specified model can invalidate inferences, because the distribution of synthetic data is determined by the model used to generate them. We present a practical method to generate synthetic values when the imputer has only limited information about the true data generating process. We combine a simple imputation model (such as regression) with a series of density-based transformations to pre- serve the distribution of the con dential data, up to sampling error, on speci ed subdomains. We demonstrate through simulation and a large scale application that our approach preserves important statistical properties of the con dential data, including higher moments, with low disclosure risk.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.econ.sfu.ca/research/RePEc/sfu/sfudps/dp07-15.pdf
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Department of Economics, Simon Fraser University in its series Discussion Papers with number dp07-15.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length: 39
Date of creation: Sep 2007
Date of revision:
Handle: RePEc:sfu:sfudps:dp07-15

Contact details of provider:
Postal: Department of Economics, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada
Phone: (778)782-3508
Fax: (778)782-5944
Web page: http://www.econ.sfu.ca/
More information through EDIRC

Order Information:
Postal: Working Paper Coordinator, Department of Economics, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada
Email:
Web: http://www.econ.sfu.ca/Research_and_Seminars/Publications/index.html

For technical questions regarding this item, or to correct its listing, contact: (Working Paper Coordinator).

Related research
Keywords: statistical disclosure limitation; confidentiality; privacy; multiple imputation; partially synthetic data;

Other versions of this item:

Find related papers by JEL classification:
C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: General
C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics
C5 - Mathematical and Quantitative Methods - - Econometric Modeling

This paper has been announced in the following NEP Reports:

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Reiter, Jerome P., 2005. "Estimating Risks of Identification Disclosure in Microdata," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1103-1112, December. [Downloadable!] (restricted)
  2. John J. Abowd & John Haltiwanger & Julia Lane, 2004. "Integrated Longitudinal Employer-Employee Data for the United States," American Economic Review, American Economic Association, vol. 94(2), pages 224-229, May. [Downloadable!]
    Other versions:
  3. John Abowd & Bryce Stephens & Lars Vilhuber, 2006. "The LEHD Infrastructure Files and the Creation of the Quarterly Workforce Indicators," Technical Papers 2006-01, Longitudinal Employer-Household Dynamics, Center for Economic Studies, U.S. Census Bureau. [Downloadable!]
    Other versions:
Full references

Statistics
Access and download statistics

Did you know? Springer Verlag was the first commercial publisher to be listed on RePEc.

This page was last updated on 2009-11-19.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.