Plant-Level Productivity and Imputation of Missing Data in the Census of Manufactures
In the U.S. Census of Manufactures, the Census Bureau imputes missing values using a combination of mean imputation, ratio imputation, and conditional mean imputation. It is wellknown that imputations based on these methods can result in underestimation of variability and potential bias in multivariate inferences. We show that this appears to be the case for the existing imputations in the Census of Manufactures. We then present an alternative strategy for handling the missing data based on multiple imputation. Specifically, we impute missing values via sequences of classification and regression trees, which offer a computationally straightforward and flexible approach for semi-automatic, large-scale multiple imputation. We also present an approach to evaluating these imputations based on posterior predictive checks. We use the multiple imputations, and the imputations currently employed by the Census Bureau, to estimate production function parameters and productivity dispersions. The results suggest that the two approaches provide quite different answers about productivity.
|Date of creation:||Jan 2011|
|Date of revision:|
|Contact details of provider:|| Postal: |
Phone: (301) 763-6460
Fax: (301) 763-5935
Web page: http://www.census.gov/cesEmail:
More information through EDIRC
When requesting a correction, please mention this item's handle: RePEc:cen:wpaper:11-02. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Fariha Kamal)
If references are entirely missing, you can add them using this form.