Univariate Multiple Imputation for Coarse Employee Income Data
his paper is concerned with conducting univariate multiple imputation for employee income data that is comprised of continuously distributed observations, observations that are bounded by consecutive income brackets, and observations that are missing. A variable with this mixture of data types is a form of coarsening in the data. An interval-censored regression imputation procedure is utilised to generate plausible draws for the bounded and nonresponse subsets of income. We test the sensitivity of results to mis-specification in the prediction equations of the imputation algorithm, and we test the stability of the results as the number of imputations increase from two to five to twenty. We find that for missing data, imputed draws are very different for respondents who state that they don't know their income compared to those who refuse. The upper tail of the income distribution is most sensitive to mis-specification in the imputation algorithm, and we discuss how best to conduct multiple imputation to take this into account. Lastly, stability in parameter estimates of the income distribution is achieved with as little as two multiple imputations, due largely to (a) the small fraction of missing data, in combination with (b) reduced within- and between-imputation components of variance for imputed draws of the bracketed income subset, a function of the defined lower and upper bounds of the brackets that restrict the range of plausibility for imputed draws. This is a joint SALDRU and DataFirst working paper
|Date of creation:||2012|
|Date of revision:|
|Contact details of provider:|| Postal: |
Phone: +27 21 650 5696
Fax: +27 21 650 5697
Web page: http://www.saldru.uct.ac.za/
More information through EDIRC
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Patrick Royston, 2005. "MICE for multiple imputation of missing values," United Kingdom Stata Users' Group Meetings 2005 02, Stata Users Group.
- Patrick Royston, 2005. "Multiple imputation of missing values: update," Stata Journal, StataCorp LP, vol. 5(2), pages 188-201, June.
- Patrick Royston, 2005. "Multiple imputation of missing values: Update of ice," Stata Journal, StataCorp LP, vol. 5(4), pages 527-536, December.
- Martin Wittenberg, 2008. "Nonparametric estimation when income is reported in bands and at points," Working Papers 94, Economic Research Southern Africa.
- Reza Daniels, 2008. "The income distribution with coarse data," Working Papers 82, Economic Research Southern Africa.
- Reza C. Daniels, 2012. "Questionnaire Design and Response Propensities for Employee Income Micro Data," SALDRU Working Papers 89, Southern Africa Labour and Development Research Unit, University of Cape Town.
- White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
When requesting a correction, please mention this item's handle: RePEc:ldr:wpaper:88. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Alison Siljeur)
If references are entirely missing, you can add them using this form.