IDEAS home Printed from https://ideas.repec.org/p/ldr/wpaper/88.html
   My bibliography  Save this paper

Univariate Multiple Imputation for Coarse Employee Income Data

Author

Listed:
  • Reza C. Daniels

    () (SALDRU, School of Economics, University of Cape Town)

Abstract

his paper is concerned with conducting univariate multiple imputation for employee income data that is comprised of continuously distributed observations, observations that are bounded by consecutive income brackets, and observations that are missing. A variable with this mixture of data types is a form of coarsening in the data. An interval-censored regression imputation procedure is utilised to generate plausible draws for the bounded and nonresponse subsets of income. We test the sensitivity of results to mis-specification in the prediction equations of the imputation algorithm, and we test the stability of the results as the number of imputations increase from two to five to twenty. We find that for missing data, imputed draws are very different for respondents who state that they don't know their income compared to those who refuse. The upper tail of the income distribution is most sensitive to mis-specification in the imputation algorithm, and we discuss how best to conduct multiple imputation to take this into account. Lastly, stability in parameter estimates of the income distribution is achieved with as little as two multiple imputations, due largely to (a) the small fraction of missing data, in combination with (b) reduced within- and between-imputation components of variance for imputed draws of the bracketed income subset, a function of the defined lower and upper bounds of the brackets that restrict the range of plausibility for imputed draws. This is a joint SALDRU and DataFirst working paper

Suggested Citation

  • Reza C. Daniels, 2012. "Univariate Multiple Imputation for Coarse Employee Income Data," SALDRU Working Papers 88, Southern Africa Labour and Development Research Unit, University of Cape Town.
  • Handle: RePEc:ldr:wpaper:88
    as

    Download full text from publisher

    File URL: http://opensaldru.uct.ac.za/bitstream/handle/11090/179/2012_88.pdf?sequence=1
    File Function: Full text
    Download Restriction: no

    References listed on IDEAS

    as
    1. White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
    2. Patrick Royston, 2005. "Multiple imputation of missing values: update," Stata Journal, StataCorp LP, vol. 5(2), pages 188-201, June.
    3. Patrick Royston, 2005. "Multiple imputation of missing values: Update of ice," Stata Journal, StataCorp LP, vol. 5(4), pages 527-536, December.
    4. Reza C. Daniels, 2012. "Questionnaire Design and Response Propensities for Employee Income Micro Data," SALDRU Working Papers 89, Southern Africa Labour and Development Research Unit, University of Cape Town.
    5. Patrick Royston, 2005. "MICE for multiple imputation of missing values," United Kingdom Stata Users' Group Meetings 2005 02, Stata Users Group.
    6. Martin Wittenberg, 2008. "Nonparametric estimation when income is reported in bands and at points," Working Papers 94, Economic Research Southern Africa.
    7. Reza Daniels, 2008. "The income distribution with coarse data," Working Papers 82, Economic Research Southern Africa.
    Full references (including those not matched with items on IDEAS)

    More about this item

    Keywords

    Multiple Imputation; Coarse Data; Income Distribution;

    JEL classification:

    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
    • C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ldr:wpaper:88. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Alison Siljeur). General contact details of provider: http://edirc.repec.org/data/sauctza.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.