Synthetic Data For Small Area Estimation In The American Community Survey
Small area estimates provide a critical source of information used to study local populations. Statistical agencies regularly collect data from small areas but are prevented from releasing detailed geographical identifiers in public-use data sets due to disclosure concerns. Alternative data dissemination methods used in practice include releasing summary/aggregate tables, suppressing detailed geographic information in public-use data sets, and accessing restricted data via Research Data Centers. This research examines an alternative method for disseminating microdata that contains more geographical details than are currently being released in public-use data files. Specifically, the method replaces the observed survey values with imputed, or synthetic, values simulated from a hierarchical Bayesian model. Confidentiality protection is enhanced because no actual values are released. The method is demonstrated using restricted data from the 2005-2009 American Community Survey. The analytic validity of the synthetic data is assessed by comparing small area estimates obtained from the synthetic data with those obtained from the observed data.
|Date of creation:||Apr 2013|
|Date of revision:|
|Contact details of provider:|| Postal: |
Phone: (301) 763-6460
Fax: (301) 763-5935
Web page: http://www.census.gov/ces
More information through EDIRC
When requesting a correction, please mention this item's handle: RePEc:cen:wpaper:13-19. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Fariha Kamal)
If references are entirely missing, you can add them using this form.