IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2205.07388.html
   My bibliography  Save this paper

Inference with Imputed Data: The Allure of Making Stuff Up

Author

Listed:
  • Charles F. Manski

Abstract

Incomplete observability of data generates an identification problem. There is no panacea for missing data. What one can learn about a population parameter depends on the assumptions one finds credible to maintain. The credibility of assumptions varies with the empirical setting. No specific assumptions can provide a realistic general solution to the problem of inference with missing data. Yet Rubin has promoted random multiple imputation (RMI) as a general way to deal with missing values in public-use data. This recommendation has been influential to empirical researchers who seek a simple fix to the nuisance of missing data. This paper adds to my earlier critiques of imputation. It provides a transparent assessment of the mix of Bayesian and frequentist thinking used by Rubin to argue for RMI. It evaluates random imputation to replace missing outcome or covariate data when the objective is to learn a conditional expectation. It considers steps that might help combat the allure of making stuff up.

Suggested Citation

  • Charles F. Manski, 2022. "Inference with Imputed Data: The Allure of Making Stuff Up," Papers 2205.07388, arXiv.org.
  • Handle: RePEc:arx:papers:2205.07388
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2205.07388
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Charles F. Manski, 1989. "Anatomy of the Selection Problem," Journal of Human Resources, University of Wisconsin Press, vol. 24(3), pages 343-360.
    2. Charles F Manski & Michael Gmeiner & Anat Tamburc, 2021. "Misguided Use of Observed Covariates to Impute Missing Covariates in Conditional Prediction: A Shrinkage Problem," Papers 2102.11334, arXiv.org.
    3. Charles F. Manski, 2011. "Policy Analysis with Incredible Certitude," Economic Journal, Royal Economic Society, vol. 121(554), pages 261-289, August.
    4. Richard Blundell & Amanda Gosling & Hidehiko Ichimura & Costas Meghir, 2007. "Changes in the Distribution of Male and Female Wages Accounting for Employment Composition Using Bounds," Econometrica, Econometric Society, vol. 75(2), pages 323-363, March.
    5. Charles F. Manski & Anat R. Tambur & Michael Gmeiner, 2019. "Predicting kidney transplant outcomes with partial knowledge of HLA mismatch," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 116(41), pages 20339-20345, October.
    6. Horowitz, Joel L. & Manski, Charles F., 1998. "Censoring of outcomes and regressors due to survey nonresponse: Identification and estimation using weights and imputations," Journal of Econometrics, Elsevier, vol. 84(1), pages 37-58, May.
    7. Horowitz, Joel L & Manski, Charles F, 1995. "Identification and Robustness with Contaminated and Corrupted Data," Econometrica, Econometric Society, vol. 63(2), pages 281-302, March.
    8. Manski, Charles F., 2016. "Credible interval estimates for official statistics with survey nonresponse," Journal of Econometrics, Elsevier, vol. 191(2), pages 293-301.
    9. Jeff Dominitz & Charles F. Manski, 2017. "More Data or Better Data? A Statistical Decision Problem," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 84(4), pages 1583-1605.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Charles F. Manski, 2015. "Communicating Uncertainty in Official Economic Statistics: An Appraisal Fifty Years after Morgenstern," Journal of Economic Literature, American Economic Association, vol. 53(3), pages 631-653, September.
    2. Nicoletti, Cheti & Peracchi, Franco & Foliano, Francesca, 2007. "Estimating income poverty in the presence of measurement error and missing data problems," ISER Working Paper Series 2007-15, Institute for Social and Economic Research.
    3. Christian Bontemps & Thierry Magnac & Eric Maurin, 2012. "Set Identified Linear Models," Econometrica, Econometric Society, vol. 80(3), pages 1129-1155, May.
    4. Guido W. Imbens & Charles F. Manski, 2004. "Confidence Intervals for Partially Identified Parameters," Econometrica, Econometric Society, vol. 72(6), pages 1845-1857, November.
    5. Francesca Molinari, 2020. "Microeconometrics with Partial Identi?cation," CeMMAP working papers CWP15/20, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Nicoletti, Cheti & Peracchi, Franco & Foliano, Francesca, 2011. "Estimating Income Poverty in the Presence of Missing Data and Measurement Error," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 61-72.
    7. Rosalia Vazquez-Alvarez, 2003. "Anchoring Bias and Covariate Nonresponse," University of St. Gallen Department of Economics working paper series 2003 2003-19, Department of Economics, University of St. Gallen.
    8. Ivan A. Canay & Azeem M. Shaikh, 2016. "Practical and theoretical advances in inference for partially identified models," CeMMAP working papers CWP05/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    9. Christoph Feichter & Frank Moers & Oscar Timmermans, 2022. "Relative Performance Evaluation and Competitive Aggressiveness," Journal of Accounting Research, Wiley Blackwell, vol. 60(5), pages 1859-1913, December.
    10. Fakih, Ali & Makdissi, Paul & Marrouch, Walid & Tabri, Rami V. & Yazbeck, Myra, 2022. "A stochastic dominance test under survey nonresponse with an application to comparing trust levels in Lebanese public institutions," Journal of Econometrics, Elsevier, vol. 228(2), pages 342-358.
    11. Manski, Charles F., 2016. "Credible interval estimates for official statistics with survey nonresponse," Journal of Econometrics, Elsevier, vol. 191(2), pages 293-301.
    12. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, vol. 144(1), pages 81-117, May.
    13. Magnac, Thierry, 2013. "Identification partielle : méthodes et conséquences pour les applications empiriques," L'Actualité Economique, Société Canadienne de Science Economique, vol. 89(4), pages 233-258, Décembre.
    14. Martin Huber & Giovanni Mellace, 2015. "Sharp Bounds on Causal Effects under Sample Selection," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(1), pages 129-151, February.
    15. Oliver R. Cutbill & Rami V. Tabri, 2022. "The Impossibility of Testing for Dependence Using Kendall’s Ƭ Under Missing Data of Unknown Form," Working Papers 2022-03, University of Sydney, School of Economics.
    16. Charles F. Manski, 2003. "Identification Problems in the Social Sciences and Everyday Life," Southern Economic Journal, John Wiley & Sons, vol. 70(1), pages 11-21, July.
    17. Charles F. Manski, 2014. "Communicating Uncertainty in Official Economic Statistics," NBER Working Papers 20098, National Bureau of Economic Research, Inc.
    18. Francesca Molinari, 2019. "Econometrics with Partial Identification," CeMMAP working papers CWP25/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    19. Molinari, Francesca, 2010. "Missing Treatments," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(1), pages 82-95.
    20. Lothar Essig & Joachim K. Winter, 2009. "Item Non-Response to Financial Questions in Household Surveys: An Experimental Study of Interviewer and Mode Effects," Fiscal Studies, Institute for Fiscal Studies, vol. 30(Special I), pages 367-390, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2205.07388. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.