Univariate Multiple Imputation for Coarse Employee Income Data
Abstracthis paper is concerned with conducting univariate multiple imputation for employee income data that is comprised of continuously distributed observations, observations that are bounded by consecutive income brackets, and observations that are missing. A variable with this mixture of data types is a form of coarsening in the data. An interval-censored regression imputation procedure is utilised to generate plausible draws for the bounded and nonresponse subsets of income. We test the sensitivity of results to mis-specification in the prediction equations of the imputation algorithm, and we test the stability of the results as the number of imputations increase from two to five to twenty. We find that for missing data, imputed draws are very different for respondents who state that they don't know their income compared to those who refuse. The upper tail of the income distribution is most sensitive to mis-specification in the imputation algorithm, and we discuss how best to conduct multiple imputation to take this into account. Lastly, stability in parameter estimates of the income distribution is achieved with as little as two multiple imputations, due largely to (a) the small fraction of missing data, in combination with (b) reduced within- and between-imputation components of variance for imputed draws of the bracketed income subset, a function of the defined lower and upper bounds of the brackets that restrict the range of plausibility for imputed draws. This is a joint SALDRU and DataFirst working paper
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Southern Africa Labour and Development Research Unit, University of Cape Town in its series SALDRU Working Papers with number 88.
Date of creation: 2012
Date of revision:
Contact details of provider:
Postal: Leslie Social Science Building, Private Bag, Rondebosch, 7701
Phone: +27 21 650 5696
Fax: +27 21 650 5697
Web page: http://www.saldru.uct.ac.za/
More information through EDIRC
Multiple Imputation; Coarse Data; Income Distribution;
Find related papers by JEL classification:
- C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
- C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
- D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution
This paper has been announced in the following NEP Reports:
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Patrick Royston, 2005. "Multiple imputation of missing values: update," Stata Journal, StataCorp LP, vol. 5(2), pages 188-201, June.
- Reza Daniels, 2008. "The income distribution with coarse data," Working Papers 82, Economic Research Southern Africa.
- Patrick Royston, 2005. "MICE for multiple imputation of missing values," United Kingdom Stata Users' Group Meetings 2005 02, Stata Users Group.
- Patrick Royston, 2005. "Multiple imputation of missing values: Update of ice," Stata Journal, StataCorp LP, vol. 5(4), pages 527-536, December.
- Reza C. Daniels, 2012. "Questionnaire Design and Response Propensities for Employee Income Micro Data," SALDRU Working Papers 89, Southern Africa Labour and Development Research Unit, University of Cape Town.
- Martin Wittenberg, 2008. "Nonparametric estimation when income is reported in bands and at points," Working Papers 94, Economic Research Southern Africa.
- White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Alison Siljeur).
If references are entirely missing, you can add them using this form.