IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v80y2015i3p707-726.html
   My bibliography  Save this article

A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data

Author

Listed:
  • Jun Li
  • Yao Yu

Abstract

Missing data occur in many real world studies. Knowing the type of missing mechanisms is important for adopting appropriate statistical analysis procedure. Many statistical methods assume missing completely at random (MCAR) due to its simplicity. Therefore, it is necessary to test whether this assumption is satisfied before applying those procedures. In the literature, most of the procedures for testing MCAR were developed under normality assumption which is sometimes difficult to justify in practice. In this paper, we propose a nonparametric test of MCAR for incomplete multivariate data which does not require distributional assumptions. The proposed test is carried out by comparing the distributions of the observed data across different missing-pattern groups. We prove that the proposed test is consistent against any distributional differences in the observed data. Simulation shows that the proposed procedure has the Type I error well controlled at the nominal level for testing MCAR and also has good power against a variety of non-MCAR alternatives. Copyright The Psychometric Society 2015

Suggested Citation

  • Jun Li & Yao Yu, 2015. "A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 707-726, September.
  • Handle: RePEc:spr:psycho:v:80:y:2015:i:3:p:707-726
    DOI: 10.1007/s11336-014-9410-4
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s11336-014-9410-4
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11336-014-9410-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kevin Kim & Peter Bentler, 2002. "Tests of homogeneity of means and covariance matrices for multivariate incomplete data," Psychometrika, Springer;The Psychometric Society, vol. 67(4), pages 609-623, December.
    2. Mortaza Jamshidian & Siavash Jalal, 2010. "Tests of Homoscedasticity, Normality, and Missing Completely at Random for Incomplete Multivariate Data," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 649-674, December.
    3. Annie Qu, 2002. "Testing ignorable missingness in estimating equation approaches for longitudinal data," Biometrika, Biometrika Trust, vol. 89(4), pages 841-850, December.
    4. Szekely, Gábor J. & Rizzo, Maria L., 2005. "A new test for multivariate normality," Journal of Multivariate Analysis, Elsevier, vol. 93(1), pages 58-80, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hairu Wang & Zhiping Lu & Yukun Liu, 2023. "Score test for missing at random or not under logistic missingness models," Biometrics, The International Biometric Society, vol. 79(2), pages 1268-1279, June.
    2. Nobumichi Shutoh & Takahiro Nishiyama & Masashi Hyodo, 2017. "Bartlett correction to the likelihood ratio test for MCAR with two-step monotone sample," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 71(3), pages 184-199, August.
    3. Shen‐Ming Lee & Wen‐Han Hwang & Jean de Dieu Tapsoba, 2016. "Estimation in closed capture–recapture models when covariates are missing at random," Biometrics, The International Biometric Society, vol. 72(4), pages 1294-1304, December.
    4. Ke-Hai Yuan & Mortaza Jamshidian & Yutaka Kano, 2018. "Missing Data Mechanisms and Homogeneity of Means and Variances–Covariances," Psychometrika, Springer;The Psychometric Society, vol. 83(2), pages 425-442, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ke-Hai Yuan & Mortaza Jamshidian & Yutaka Kano, 2018. "Missing Data Mechanisms and Homogeneity of Means and Variances–Covariances," Psychometrika, Springer;The Psychometric Society, vol. 83(2), pages 425-442, June.
    2. Hairu Wang & Zhiping Lu & Yukun Liu, 2023. "Score test for missing at random or not under logistic missingness models," Biometrics, The International Biometric Society, vol. 79(2), pages 1268-1279, June.
    3. Chassan, Malika & Concordet, Didier, 2023. "How to test the missing data mechanism in a hidden Markov model," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    4. Jamshidian, Mortaza & Jalal, Siavash & Jansen, Camden, 2014. "MissMech: An R Package for Testing Homoscedasticity, Multivariate Normality, and Missing Completely at Random (MCAR)," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 56(i06).
    5. John Lawrence, 2023. "Moments of the Noncentral Chi Distribution," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(2), pages 1243-1259, August.
    6. Gábor J. Székely & Maria L. Rizzo, 2020. "Comments on: Tests for multivariate normality—a critical review with emphasis on weighted $$L^{2}$$ L 2 -statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(4), pages 907-910, December.
    7. Constandina Koki & Loukia Meligkotsidou & Ioannis Vrontos, 2020. "Forecasting under model uncertainty: Non‐homogeneous hidden Markov models with Pòlya‐Gamma data augmentation," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(4), pages 580-598, July.
    8. Norbert Henze & María Dolores Jiménez-Gamero, 2019. "A new class of tests for multinormality with i.i.d. and garch data based on the empirical moment generating function," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 499-521, June.
    9. Quessy, Jean-François, 2021. "A Szekely–Rizzo inequality for testing general copula homogeneity hypotheses," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    10. Florian Ziel & Kevin Berk, 2019. "Multivariate Forecasting Evaluation: On Sensitive and Strictly Proper Scoring Rules," Papers 1910.07325, arXiv.org.
    11. Philip Dörr & Bruno Ebner & Norbert Henze, 2021. "A new test of multivariate normality by a double estimation in a characterizing PDE," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(3), pages 401-427, April.
    12. Ravi Kashyap, 2016. "The Perfect Marriage and Much More: Combining Dimension Reduction, Distance Measures and Covariance," Papers 1603.09060, arXiv.org, revised Jul 2019.
    13. Tomasz Górecki & Lajos Horváth & Piotr Kokoszka, 2020. "Tests of Normality of Functional Data," International Statistical Review, International Statistical Institute, vol. 88(3), pages 677-697, December.
    14. Araújo, Tanya & Dias, João & Eleutério, Samuel & Louçã, Francisco, 2013. "A measure of multivariate kurtosis for the identification of the dynamics of a N-dimensional market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(17), pages 3708-3714.
    15. Kashyap, Ravi, 2019. "The perfect marriage and much more: Combining dimension reduction, distance measures and covariance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    16. Soha Saad & Florence Ossart & Jean Bigeon & Etienne Sourdille & Harold Gance, 2021. "Global Sensitivity Analysis Applied to Train Traffic Rescheduling: A Comparative Study," Energies, MDPI, vol. 14(19), pages 1-29, October.
    17. Rizzo, Maria L. & Haman, John T., 2016. "Expected distances and goodness-of-fit for the asymmetric Laplace distribution," Statistics & Probability Letters, Elsevier, vol. 117(C), pages 158-164.
    18. Chen, Xiao & Feng, Zhenghui & Peng, Heng, 2023. "Estimation and order selection for multivariate exponential power mixture models," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    19. He, Yaoyao & Cao, Chaojin & Wang, Shuo & Fu, Hong, 2022. "Nonparametric probabilistic load forecasting based on quantile combination in electrical power systems," Applied Energy, Elsevier, vol. 322(C).
    20. Cees Diks & Valentyn Panchenko, 2005. "Nonparametric Tests for Serial Independence Based on Quadratic Forms," Tinbergen Institute Discussion Papers 05-076/1, Tinbergen Institute.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:80:y:2015:i:3:p:707-726. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.