IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v42y2013i1p105-138.html
   My bibliography  Save this article

Should a Normal Imputation Model be Modified to Impute Skewed Variables?

Author

Listed:
  • Paul T. von Hippel

Abstract

Researchers often impute continuous variables under an assumption of normality–yet many incomplete variables are skewed. We find that imputing skewed continuous variables under a normal model can lead to bias. The bias is usually mild for popular estimands such as means, standard deviations, and linear regression coefficients, but the bias can be severe for more shape-dependent estimands such as percentiles or the coefficient of skewness. We test several methods for adapting a normal imputation model to accommodate skewness, including methods that transform, truncate, or censor (round) normally imputed values as well as methods that impute values from a quadratic or truncated regression. None of these modifications reliably reduces the biases of the normal model, and some modifications can make the biases much worse. We conclude that, if one has to impute a skewed variable under a normal model, it is usually safest to do so without modifications–unless you are more interested in estimating percentiles and shape than in estimating means, variances, and regressions. In the conclusion, we briefly discuss promising developments in the area of continuous imputation models that do not assume normality.

Suggested Citation

  • Paul T. von Hippel, 2013. "Should a Normal Imputation Model be Modified to Impute Skewed Variables?," Sociological Methods & Research, , vol. 42(1), pages 105-138, February.
  • Handle: RePEc:sae:somere:v:42:y:2013:i:1:p:105-138
    DOI: 10.1177/0049124112464866
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124112464866
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124112464866?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Goldberger, Arthur S., 1981. "Linear regression after selection," Journal of Econometrics, Elsevier, vol. 15(3), pages 357-366, April.
    2. Yulei He & Trivellore E. Raghunathan, 2012. "Multiple imputation using multivariate gh transformations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(10), pages 2177-2198, June.
    3. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, December.
    4. Horton N.J. & Lipsitz S.R. & Parzen M., 2003. "A Potential for Bias When Rounding in Multiple Imputation," The American Statistician, American Statistical Association, vol. 57, pages 229-232, November.
    5. Hakan Demirtas & Donald Hedeker, 2008. "Imputing continuous data under some non‐Gaussian distributions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 62(2), pages 193-205, May.
    6. Rebecca R. Andridge & Roderick J. A. Little, 2010. "A Review of Hot Deck Imputation for Survey Non‐response," International Statistical Review, International Statistical Institute, vol. 78(1), pages 40-64, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Paul T. von Hippel, 2020. "How Many Imputations Do You Need? A Two-stage Calculation Using a Quadratic Rule," Sociological Methods & Research, , vol. 49(3), pages 699-718, August.
    2. Antonio Zinilli, 2021. "Imputation methods for estimating public R&D funding: evidence from longitudinal data," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(2), pages 707-729, April.
    3. Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhixin Lun & Ravindra Khattree, 2021. "Imputation for Skewed Data: Multivariate Lomax Case," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 86-113, May.
    2. Robert Moffitt & John Abowd & Christopher Bollinger & Michael Carr & Charles Hokayem & Kevin McKinney & Emily Wiemers & Sisi Zhang & James Ziliak, 2022. "Reconciling Trends in U.S. Male Earnings Volatility: Results from Survey and Administrative Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 1-11, December.
    3. Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.
    4. Averi Chakrabarti & Karen A Grépin & Stéphane Helleringer, 2019. "The impact of supplementary immunization activities on routine vaccination coverage: An instrumental variable analysis in five low-income countries," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-11, February.
    5. Harold Alderman & John Hoddinott & Bill Kinsey, 2006. "Long term consequences of early childhood malnutrition," Oxford Economic Papers, Oxford University Press, vol. 58(3), pages 450-474, July.
    6. Raymundo M. Campos-Vázquez, 2013. "Efectos de los ingresos no reportados en el nivel y tendencia de la pobreza laboral en México," Ensayos Revista de Economia, Universidad Autonoma de Nuevo Leon, Facultad de Economia, vol. 0(2), pages 23-54, November.
    7. Huh, Yesol & Kim, You Suk, 2023. "Cheapest-to-deliver pricing, optimal MBS securitization, and welfare implications," Journal of Financial Economics, Elsevier, vol. 150(1), pages 68-93.
    8. Fan Li & Prashant Loyalka & Hongmei Yi & Yaojiang Shi & Natalie Johnson & Scott Rozelle, 2016. "Ability Tracking and Social Capital in China’s Rural Secondary School System," LICOS Discussion Papers 37916, LICOS - Centre for Institutions and Economic Performance, KU Leuven.
    9. Ji Yan & Sally Brocksen, 2013. "Adolescent risk perception, substance use, and educational attainment," Journal of Risk Research, Taylor & Francis Journals, vol. 16(8), pages 1037-1055, September.
    10. Sènakpon Fidèle A. Dedehouanou & Luca Tiberti & Hilaire G. Houeninvo & Djohodo Inès Monwanou, 2019. "Working while studying: Employment premium or penalty for youth in Benin?," Working Papers PMMA 2019-03, PEP-PMMA.
    11. Mengyuan Zhou, 2022. "Does the Source of Inheritance Matter in Bequest Attitudes? Evidence from Japan," Journal of Family and Economic Issues, Springer, vol. 43(4), pages 867-887, December.
    12. Sandra Müllbacher & Wolfgang Nagl, 2017. "Labour supply in Austria: an assessment of recent developments and the effects of a tax reform," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 44(3), pages 465-486, August.
    13. Campbell, Randall C. & Nagel, Gregory L., 2016. "Private information and limitations of Heckman's estimator in banking and corporate finance research," Journal of Empirical Finance, Elsevier, vol. 37(C), pages 186-195.
    14. Maurice Mutisya & Moses W. Ngware & Caroline W. Kabiru & Ngianga-bakwin Kandala, 2016. "The effect of education on household food security in two informal urban settlements in Kenya: a longitudinal analysis," Food Security: The Science, Sociology and Economics of Food Production and Access to Food, Springer;The International Society for Plant Pathology, vol. 8(4), pages 743-756, August.
    15. Giuliani, Elisa & Martinelli, Arianna & Rabellotti, Roberta, 2016. "Is Co-Invention Expediting Technological Catch Up? A Study of Collaboration between Emerging Country Firms and EU Inventors," World Development, Elsevier, vol. 77(C), pages 192-205.
    16. Ilona Babenko & Benjamin Bennett & John M Bizjak & Jeffrey L Coles & Jason J Sandvik, 2023. "Clawback Provisions and Firm Risk," The Review of Corporate Finance Studies, Society for Financial Studies, vol. 12(2), pages 191-239.
    17. Şahan, Duygu & Tuna, Okan, 2018. "Environmental innovation of transportation sector in OECD countries," Chapters from the Proceedings of the Hamburg International Conference of Logistics (HICL), in: Kersten, Wolfgang & Blecker, Thorsten & Ringle, Christian M. (ed.), The Road to a Digitalized Supply Chain Management: Smart and Digital Solutions for Supply Chain Management. Proceedings of the Hamburg International C, volume 25, pages 157-170, Hamburg University of Technology (TUHH), Institute of Business Logistics and General Management.
    18. Mehzabin Tuli, Farzana & Mitra, Suman & Crews, Mariah B., 2021. "Factors influencing the usage of shared E-scooters in Chicago," Transportation Research Part A: Policy and Practice, Elsevier, vol. 154(C), pages 164-185.
    19. Ruomeng Cui & Dennis J. Zhang & Achal Bassamboo, 2019. "Learning from Inventory Availability Information: Evidence from Field Experiments on Amazon," Management Science, INFORMS, vol. 65(3), pages 1216-1235, March.
    20. Peppel-Srebrny, Jemima, 2021. "Not all government budget deficits are created equal: Evidence from advanced economies' sovereign bond markets," Journal of International Money and Finance, Elsevier, vol. 118(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:42:y:2013:i:1:p:105-138. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.