Diagnosing and Handling Common Violations of Missing at Random

My bibliography Save this article

Diagnosing and Handling Common Violations of Missing at Random

Author

Listed:

Feng Ji
(University of California, Berkeley University of Toronto)
Sophia Rabe-Hesketh
(University of California, Berkeley)
Anders Skrondal
(Norwegian Institute of Public Health
University of Oslo
University of California, Berkeley)

Registered:

Sophia Rabe-Hesketh

Abstract

Ignorable likelihood (IL) approaches are often used to handle missing data when estimating a multivariate model, such as a structural equation model. In this case, the likelihood is based on all available data, and no model is specified for the missing data mechanism. Inference proceeds via maximum likelihood or Bayesian methods, including multiple imputation without auxiliary variables. Such IL approaches are valid under a missing at random (MAR) assumption. Rabe-Hesketh and Skrondal (Ignoring non-ignorable missingness. Presidential Address at the International Meeting of the Psychometric Society, Beijing, China, 2015; Psychometrika, 2023) consider a violation of MAR where a variable A can affect missingness of another variable B also when A is not observed. They show that this case can be handled by discarding more data before proceeding with IL approaches. This data-deletion approach is similar to the sequential estimation of Mohan et al. (in: Advances in neural information processing systems, 2013) based on their ordered factorization theorem but is preferable for parametric models. Which kind of data-deletion or ordered factorization to employ depends on the nature of the MAR violation. In this article, we therefore propose two diagnostic tests, a likelihood-ratio test for a heteroscedastic regression model and a kernel conditional independence test. We also develop a test-based estimator that first uses diagnostic tests to determine which MAR violation appears to be present and then proceeds with the corresponding data-deletion estimator. Simulations show that the test-based estimator outperforms IL when the missing data problem is severe and performs similarly otherwise.

Suggested Citation

Feng Ji & Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Diagnosing and Handling Common Violations of Missing at Random," Psychometrika, Springer;The Psychometric Society, vol. 88(4), pages 1123-1143, December.

Handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09896-0
DOI: 10.1007/s11336-022-09896-0

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Geert Molenberghs & Caroline Beunckens & Cristina Sotto & Michael G. Kenward, 2008. "Every missingness not at random model has a missingness at random counterpart with equal fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(2), pages 371-388, April.
P. Diggle & M. G. Kenward, 1994. "Informative Drop‐Out in Longitudinal Data Analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 43(1), pages 49-73, March.
Iavor I Bojinov & Natesh S Pillai & Donald B Rubin, 2020. "Diagnosing missing always at random in multivariate data," Biometrika, Biometrika Trust, vol. 107(1), pages 246-253.
Oberski, Daniel, 2014. "lavaan.survey: An R Package for Complex Survey Analysis of Structural Equation Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i01).
Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Ignoring Non-ignorable Missingness," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 31-50, March.
James Heckman, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
- Heckman, James J, 1979. "Sample Selection Bias as a Specification Error," Econometrica, Econometric Society, vol. 47(1), pages 153-161, January.
Karthika Mohan & Judea Pearl, 2021. "Graphical Models for Processing Missing Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(534), pages 1023-1037, April.
van der Wal, Willem M. & Geskus, Ronald B., 2011. "ipw: An R Package for Inverse Probability Weighting," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 43(i13).
Hausman, Jerry A & Wise, David A, 1979. "Attrition Bias in Experimental and Panel Data: The Gary Income Maintenance Experiment," Econometrica, Econometric Society, vol. 47(2), pages 455-473, March.
A. Skrondal & S. Rabe-Hesketh, 2014. "Protective estimation of mixed-effects logistic regression when data are not missing at random," Biometrika, Biometrika Trust, vol. 101(1), pages 175-188.
Roderick J. Little & Nanhua Zhang, 2011. "Subsample ignorable likelihood for regression analysis with missing data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(4), pages 591-605, August.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Simon Calmar Andersen & Louise Beuchert & Phillip Heiler & Helena Skyt Nielsen, 2023. "A Guide to Impact Evaluation under Sample Selection and Missing Data: Teacher's Aides and Adolescent Mental Health," Papers 2308.04963, arXiv.org.
Anders Skrondal & Sophia Rabe-Hesketh, 2022. "The Role of Conditional Likelihoods in Latent Variable Modeling," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 799-834, September.
Betsy J. Feldman & Sophia Rabe-Hesketh, 2012. "Modeling Achievement Trajectories When Attrition Is Informative," Journal of Educational and Behavioral Statistics, , vol. 37(6), pages 703-736, December.
Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Ignoring Non-ignorable Missingness," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 31-50, March.
D'Addio, Anna Cristina & De Greef, Isabelle & Rosholm, Michael, 2002. "Assessing Unemployment Traps in Belgium Using Panel Data Sample Selection Models," IZA Discussion Papers 669, Institute of Labor Economics (IZA).
- Anna Cristina d'Addio & Isabelle De Greef & Michael Rosholm, 2002. "Assessing Unemployment Traps in Belgium using Panel Data Sample Selection models," 10th International Conference on Panel Data, Berlin, July 5-6, 2002 C1-3, International Conferences on Panel Data.
Meyer, Maximilian & Hulke, Carolin & Kamwi, Jonathan & Kolem, Hannah & Börner, Jan, 2022. "Spatially heterogeneous effects of collective action on environmental dependence in Namibia’s Zambezi region," World Development, Elsevier, vol. 159(C).
Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Incomplete panels and selection bias : A survey," Discussion Paper 1992-7, Tilburg University, Center for Economic Research.
- Verbeek, M. & Nijman, T., 1992. "Incomplete Panels and Selection Bias: A Survey," Papers 9207, Tilburg - Center for Economic Research.
Keisuke Hirano & Guido W. Imbens & Geert Ridder & Donald B. Rubin, 2001. "Combining Panel Data Sets with Attrition and Refreshment Samples," Econometrica, Econometric Society, vol. 69(6), pages 1645-1659, November.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder & Donald B. Rubin, 1998. "Combining Panel Data Sets with Attrition and Refreshment Samples," Tinbergen Institute Discussion Papers 98-033/4, Tinbergen Institute.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder & Donald B. Rebin, 1998. "Combining Panel Data Sets with Attrition and Refreshment Samples," NBER Technical Working Papers 0230, National Bureau of Economic Research, Inc.
Martin Huber, 2012. "Identification of Average Treatment Effects in Social Experiments Under Alternative Forms of Attrition," Journal of Educational and Behavioral Statistics, , vol. 37(3), pages 443-474, June.
Deniz Dutz & Ingrid Huitfeldt & Santiago Lacouture & Magne Mogstad & Alexander Torgovitsky & Winnie van Dijk, 2021. "Selection in Surveys: Using Randomized Incentives to Detect and Account for Nonresponse Bias," NBER Working Papers 29549, National Bureau of Economic Research, Inc.
- Deniz Dutz & Ingrid Huitfeldt & Santiago Lacouture & Magne Mogstad & Alexander Torgovitsky & Winnie van Dijk, 2025. "Selection in Surveys: Using Randomized Incentives to Detect and Account for Nonresponse Bias," Cowles Foundation Discussion Papers 2451, Cowles Foundation for Research in Economics, Yale University.
E. Michael Foster & Grace Y. Fang, 2004. "Alternative Methods for Handling Attrition," Evaluation Review, , vol. 28(5), pages 434-464, October.
Hajivassiliou, Vassilis A. & Ruud, Paul A., 1986. "Classical estimation methods for LDV models using simulation," Handbook of Econometrics, in: R. F. Engle & D. McFadden (ed.), Handbook of Econometrics, edition 1, volume 4, chapter 40, pages 2383-2441, Elsevier.
- Vassilis A. Hajivassiliou & Paul A. Ruud, 1993. "Classical Estimation Methods for LDV Models Using Simulation," Cowles Foundation Discussion Papers 1051, Cowles Foundation for Research in Economics, Yale University.
- V.A. Hajivassiliou & P. A. Ruud, 1993. "Classical Estimation Methods for LDV Models Using Simulation," Econometrics 9311002, University Library of Munich, Germany.
- Vassilis A. Hajivassiliou and Paul A. Ruud., 1993. "Classical Estimation Methods for LDV Models Using Simulation," Economics Working Papers 93-219, University of California at Berkeley.
John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 251-299.
- J. Fitzgerald & P. Gottschalk & R. Moffitt, "undated". "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Institute for Research on Poverty Discussion Papers 1156-98, University of Wisconsin Institute for Research on Poverty.
- John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," NBER Technical Working Papers 0220, National Bureau of Economic Research, Inc.
- John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of income Dynamics," Economics Working Paper Archive 379, The Johns Hopkins University,Department of Economics.
- John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1997. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Boston College Working Papers in Economics 394, Boston College Department of Economics.
Michael Fertig & Stefanie Schurer, 2007. "Earnings Assimilation of Immigrants in Germany: The Importance of Heterogeneity and Attrition Bias," SOEPpapers on Multidisciplinary Panel Data Research 30, DIW Berlin, The German Socio-Economic Panel (SOEP).
Shin, Jaeun & Moon, Sangho, 2006. "Fertility, relative wages, and labor market decisions: A case of female teachers," Economics of Education Review, Elsevier, vol. 25(6), pages 591-604, December.
Evans, Lawrance & Schwartz, Jeremy, 2014. "The effect of concentration and regulation on audit fees: An application of panel data techniques," Journal of Empirical Finance, Elsevier, vol. 27(C), pages 130-144.
ter Horst, Jenke R. & Nijman, Theo E. & Verbeek, Marno, 2001. "Eliminating look-ahead bias in evaluating persistence in mutual fund performance," Journal of Empirical Finance, Elsevier, vol. 8(4), pages 345-373, September.
- Ter Horst, J.R. & Nijman, T.E. & Verbeek, M.J.C.M., 2001. "Eliminating look-ahead bias in evaluating persistence in mutual fund performance," Other publications TiSEM 144f0bd4-7142-4af6-aeda-0, Tilburg University, School of Economics and Management.
Hübler, Olaf, 2005. "Panel Data Econometrics: Modelling and Estimation," Hannover Economic Papers (HEP) dp-319, Leibniz Universität Hannover, Wirtschaftswissenschaftliche Fakultät.
Shu Xu & Shelley A. Blozis, 2011. "Sensitivity Analysis of Mixed Models for Incomplete Longitudinal Data," Journal of Educational and Behavioral Statistics, , vol. 36(2), pages 237-256, April.
Bian, Yuan & Yi, Grace Y. & He, Wenqing, 2024. "A unified framework of analyzing missing data and variable selection using regularized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).

More about this item

Keywords

; ; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09896-0. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Diagnosing and Handling Common Violations of Missing at Random

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data