IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v13y2017i1p20n6.html
   My bibliography  Save this article

Empirical Likelihood in Nonignorable Covariate-Missing Data Problems

Author

Listed:
  • Xie Yanmei
  • Zhang Biao

    (Department of Mathematics and Statistics, The University of Toledo, Toledo, OH 43606, USA)

Abstract

Missing covariate data occurs often in regression analysis, which frequently arises in the health and social sciences as well as in survey sampling. We study methods for the analysis of a nonignorable covariate-missing data problem in an assumed conditional mean function when some covariates are completely observed but other covariates are missing for some subjects. We adopt the semiparametric perspective of Bartlett et al. (Improving upon the efficiency of complete case analysis when covariates are MNAR. Biostatistics 2014;15:719–30) on regression analyses with nonignorable missing covariates, in which they have introduced the use of two working models, the working probability model of missingness and the working conditional score model. In this paper, we study an empirical likelihood approach to nonignorable covariate-missing data problems with the objective of effectively utilizing the two working models in the analysis of covariate-missing data. We propose a unified approach to constructing a system of unbiased estimating equations, where there are more equations than unknown parameters of interest. One useful feature of these unbiased estimating equations is that they naturally incorporate the incomplete data into the data analysis, making it possible to seek efficient estimation of the parameter of interest even when the working regression function is not specified to be the optimal regression function. We apply the general methodology of empirical likelihood to optimally combine these unbiased estimating equations. We propose three maximum empirical likelihood estimators of the underlying regression parameters and compare their efficiencies with other existing competitors. We present a simulation study to compare the finite-sample performance of various methods with respect to bias, efficiency, and robustness to model misspecification. The proposed empirical likelihood method is also illustrated by an analysis of a data set from the US National Health and Nutrition Examination Survey (NHANES).

Suggested Citation

  • Xie Yanmei & Zhang Biao, 2017. "Empirical Likelihood in Nonignorable Covariate-Missing Data Problems," The International Journal of Biostatistics, De Gruyter, vol. 13(1), pages 1-20, May.
  • Handle: RePEc:bpj:ijbist:v:13:y:2017:i:1:p:20:n:6
    DOI: 10.1515/ijb-2016-0053
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/ijb-2016-0053
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/ijb-2016-0053?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jing Qin & Biao Zhang, 2007. "Empirical‐likelihood‐based inference in missing response problems and its application in observational studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(1), pages 101-122, February.
    2. Qihua Wang, 2002. "Empirical likelihood-based inference in linear errors-in-covariables models with validation data," Biometrika, Biometrika Trust, vol. 89(2), pages 345-358, June.
    3. Hua Liang & Suojin Wang & Raymond J. Carroll, 2007. "Partially linear models with missing response variables and error-prone covariates," Biometrika, Biometrika Trust, vol. 94(1), pages 185-198.
    4. Qihua Wang & Pengjie Dai, 2008. "Semiparametric model-based inference in the presence of missing responses," Biometrika, Biometrika Trust, vol. 95(3), pages 721-734.
    5. Joseph G. Ibrahim & Ming-Hui Chen & Stuart R. Lipsitz & Amy H. Herring, 2005. "Missing-Data Methods for Generalized Linear Models: A Comparative Review," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 332-346, March.
    6. Chen S.X. & Leung D.H.Y. & Qin J., 2003. "Information Recovery in a Study With Surrogate Endpoints," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 1052-1062, January.
    7. Stute, Winfried & Xue, Liugen & Zhu, Lixing, 2007. "Empirical Likelihood Inference in Nonlinear Errors-in-Covariables Models With Validation Data," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 332-346, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Biao Zhang, 2016. "Empirical Likelihood in Causal Inference," Econometric Reviews, Taylor & Francis Journals, vol. 35(2), pages 201-231, February.
    2. Wang, Qihua & Lai, Peng, 2011. "Empirical likelihood calibration estimation for the median treatment difference in observational studies," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1596-1609, April.
    3. Zhao, Yichuan & Chen, Feiming, 2008. "Empirical likelihood inference for censored median regression model via nonparametric kernel estimation," Journal of Multivariate Analysis, Elsevier, vol. 99(2), pages 215-231, February.
    4. Xue, Liugen, 2009. "Empirical likelihood for linear models with missing responses," Journal of Multivariate Analysis, Elsevier, vol. 100(7), pages 1353-1366, August.
    5. Liugen Xue, 2009. "Empirical Likelihood Confidence Intervals for Response Mean with Data Missing at Random," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 671-685, December.
    6. Xue, Liugen & Xue, Dong, 2011. "Empirical likelihood for semiparametric regression model with missing response data," Journal of Multivariate Analysis, Elsevier, vol. 102(4), pages 723-740, April.
    7. Tang, Linjun & Zhou, Zhangong & Wu, Changchun, 2013. "Testing the linear errors-in-variables model with randomly censored data," Statistics & Probability Letters, Elsevier, vol. 83(3), pages 875-884.
    8. Peisong Han, 2016. "Combining Inverse Probability Weighting and Multiple Imputation to Improve Robustness of Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 246-260, March.
    9. Boumahdi, Mounir & Ouassou, Idir & Rachdi, Mustapha, 2023. "Estimation in nonparametric functional-on-functional models with surrogate responses," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    10. Yang, Yiping & Li, Gaorong & Peng, Heng, 2014. "Empirical likelihood of varying coefficient errors-in-variables models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 127(C), pages 1-18.
    11. Wang, Qihua & Su, Miaomiao & Wang, Ruoyu, 2021. "A beyond multiple robust approach for missing response problem," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    12. Lu, Xuewen, 2009. "Empirical likelihood for heteroscedastic partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 387-396, March.
    13. Denis Heng Yan Leung & Ken Yamada & Biao Zhang, 2015. "Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 155-179, March.
    14. Baojiang Chen & Xiao-Hua Zhou, 2013. "Generalized Partially Linear Models for Incomplete Longitudinal Data In the Presence of Population-Level Information," Biometrics, The International Biometric Society, vol. 69(2), pages 386-395, June.
    15. Zhong Guan & Jing Qin, 2017. "Empirical likelihood method for non-ignorable missing data problems," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 113-135, January.
    16. Xue, Liugen & Zhang, Jinghua, 2020. "Empirical likelihood for partially linear single-index models with missing observations," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    17. Zheng, Ming & Yu, Wen, 2011. "An empirical likelihood approach to data analysis under two-stage sampling designs," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 947-956, August.
    18. Wang, Xuan & Wang, Qihua, 2015. "Semiparametric linear transformation model with differential measurement error and validation sampling," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 67-80.
    19. Majid Mojirsheibani & Timothy Reese, 2017. "Kernel regression estimation for incomplete data with applications," Statistical Papers, Springer, vol. 58(1), pages 185-209, March.
    20. Wang, Qihua & Zhang, Riquan, 2009. "Statistical estimation in varying coefficient models with surrogate data and validation sampling," Journal of Multivariate Analysis, Elsevier, vol. 100(10), pages 2389-2405, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:13:y:2017:i:1:p:20:n:6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.