Avoiding disclosure of individually identifiable health information: a literature review
AbstractAchieving data and information dissemination without arming anyone is a central task of any entity in charge of collecting data. In this article, the authors examine the literature on data and statistical confidentiality. Rather than comparing the theoretical properties of specific methods, they emphasize the main themes that emerge from the ongoing discussion among scientists regarding how best to achieve the appropriate balance between data protection, data utility, and data dissemination. They cover the literature on de-identification and reidentification methods with emphasis on health care data. The authors also discuss the benefits and limitations for the most common access methods. Although there is abundant theoretical and empirical research, their review reveals lack of consensus on fundamental questions for empirical practice: How to assess disclosure risk, how to choose among disclosure methods, how to assess reidentification risk, and how to measure utility loss.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by University Library of Munich, Germany in its series MPRA Paper with number 35463.
Date of creation: 14 Dec 2011
Date of revision:
public use files; disclosure avoidance; reidentification; de-identification; data utility;
Find related papers by JEL classification:
- I18 - Health, Education, and Welfare - - Health - - - Government Policy; Regulation; Public Health
- C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions
This paper has been announced in the following NEP Reports:
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Skinner, Chris & Shlomo, Natalie, 2008. "Assessing Identification Risk in Survey Microdata Using Log-Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 989-1001.
- John M. Abowd & Julia I. Lane, 2004. "New Approaches to Confidentiality Protection Synthetic Data, Remote Access and Research Data Centers," Longitudinal Employer-Household Dynamics Technical Papers 2004-03, Center for Economic Studies, U.S. Census Bureau.
- J. Trent Alexander & Michael Davern & Betsey Stevenson, 2010.
"Inaccurate age and sex data in the Census PUMS files: evidence and implications,"
Working Paper Series
2010-03, Federal Reserve Bank of San Francisco.
- J. Trent Alexander & Michael Davern & Betsey Stevenson, 2010. "Inaccurate age and sex data in the Census PUMS files: Evidence and Implications," NBER Working Papers 15703, National Bureau of Economic Research, Inc.
- J. Trent Alexander & Michael Davern & Betsey Stevenson, 2010. "Inaccurate Age and Sex Data in the Census PUMS Files: Evidence and Implications," CESifo Working Paper Series 2929, CESifo Group Munich.
- Daniel Weinberg & John Abowd & Sandra Rowland & Philip Steel & Laura Zayatz, 2007. "Access Methods for United States Microdata," Working Papers 07-25, Center for Economic Studies, U.S. Census Bureau.
- C. J. Skinner, 2007. "The probability of identification: applying ideas from forensic statistics to disclosure risk assessment," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(1), pages 195-212.
- Julia Lane & Claudia Schur, 2009. "Balancing Access to Data And Privacy. A review of the issues and approaches for the future," Working Paper Series of the German Council for Social and Economic Data 113, German Council for Social and Economic Data (RatSWD).
- Erdem, Erkan & Prada, Sergio I, 2011. "Creation of public use files: lessons learned from the comparative effectiveness research public use files data pilot project," MPRA Paper 35478, University Library of Munich, Germany.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Ekkehart Schlicht).
If references are entirely missing, you can add them using this form.