IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v82y2015icp74-88.html
   My bibliography  Save this article

An efficient and robust variable selection method for longitudinal generalized linear models

Author

Listed:
  • Lv, Jing
  • Yang, Hu
  • Guo, Chaohui

Abstract

This paper presents a new efficient and robust smooth-threshold generalized estimating equations for generalized linear models (GLMs) with longitudinal data. The proposed method is based on a bounded exponential score function and leverage-based weights to achieve robustness against outliers both in the response and the covariate domain. Our motivation for the new variable selection procedure is that it enables us to achieve better robustness and efficiency by introducing an additional tuning parameter γ which can be automatically selected using the observed data. Moreover, its performance is near optimal and superior to some recently developed variable selection methods. Under some regularity conditions, the resulting estimator possesses the consistency in variable selection and the oracle property in estimation. Finally, simulation studies and a detailed real data analysis are carried out to assess and illustrate the finite sample performance, which show that the proposed method works better than other existing methods, in particular, when many outliers are included.

Suggested Citation

  • Lv, Jing & Yang, Hu & Guo, Chaohui, 2015. "An efficient and robust variable selection method for longitudinal generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 74-88.
  • Handle: RePEc:eee:csdana:v:82:y:2015:i:c:p:74-88
    DOI: 10.1016/j.csda.2014.08.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947314002382
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2014.08.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Qin, Guoyou & Zhu, Zhongyi, 2007. "Robust estimation in generalized semiparametric mixed models for longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 98(8), pages 1658-1683, September.
    2. Naisyin Wang & Raymond J. Carroll & Xihong Lin, 2005. "Efficient Semiparametric Marginal Estimation for Longitudinal/Clustered Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 147-157, March.
    3. Lan Wang & Jianhui Zhou & Annie Qu, 2012. "Penalized Generalized Estimating Equations for High-Dimensional Longitudinal Data Analysis," Biometrics, The International Biometric Society, vol. 68(2), pages 353-360, June.
    4. Weixin Yao & Bruce Lindsay & Runze Li, 2012. "Local modal regression," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 647-663.
    5. Koenker,Roger, 2005. "Quantile Regression," Cambridge Books, Cambridge University Press, number 9780521845731.
    6. He, Xuming & Fung, Wing K. & Zhu, Zhongyi, 2005. "Robust Estimation in Generalized Partial Linear Models for Clustered Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1176-1184, December.
    7. Xueying Zheng & Wing Fung & Zhongyi Zhu, 2013. "Robust estimation in joint mean–covariance regression model for longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 617-638, August.
    8. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    9. You-Gan Wang & Xu Lin & Min Zhu, 2005. "Robust Estimating Functions and Bias Correction for Longitudinal Data Analysis," Biometrics, The International Biometric Society, vol. 61(3), pages 684-691, September.
    10. Inbal Yahav & Galit Shmueli, 2012. "On generating multivariate Poisson data in management science applications," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 28(1), pages 91-102, January.
    11. Li, Gaorong & Lian, Heng & Feng, Sanying & Zhu, Lixing, 2013. "Automatic variable selection for longitudinal generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 174-186.
    12. Liu, Jicai & Zhang, Riquan & Zhao, Weihua & Lv, Yazhao, 2013. "A robust and efficient estimation method for single index models," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 226-238.
    13. Xu, Peirong & Zhu, Lixing, 2012. "Estimation for a marginal generalized single-index longitudinal model," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 285-299.
    14. Masao Ueki, 2009. "A note on automatic variable selection using smooth-threshold estimating equations," Biometrika, Biometrika Trust, vol. 96(4), pages 1005-1011.
    15. Guo, Jie & Tang, Manlai & Tian, Maozai & Zhu, Kai, 2013. "Variable selection in high-dimensional partially linear additive models for composite quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 65(C), pages 56-67.
    16. Riquan Zhang & Weihua Zhao & Jicai Liu, 2013. "Robust estimation and variable selection for semiparametric partially linear varying coefficient model based on modal regression," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 25(2), pages 523-544, June.
    17. Xueqin Wang & Yunlu Jiang & Mian Huang & Heping Zhang, 2013. "Robust Variable Selection With Exponential Squared Loss," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 632-643, June.
    18. Yao, Weixin & Wang, Qin, 2013. "Robust variable selection through MAVE," Computational Statistics & Data Analysis, Elsevier, vol. 63(C), pages 42-49.
    19. Fan, Yali & Qin, Guoyou & Zhu, Zhongyi, 2012. "Variable selection in robust regression models for longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 156-167.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Kangning & Li, Shaomin & Sun, Xiaofei & Lin, Lu, 2019. "Modal regression statistical inference for longitudinal data semivarying coefficient models: Generalized estimating equations, empirical likelihood and variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 257-276.
    2. Zimu Chen & Zhanfeng Wang & Yuan‐chin Ivan Chang, 2020. "Sequential adaptive variables and subject selection for GEE methods," Biometrics, The International Biometric Society, vol. 76(2), pages 496-507, June.
    3. Guney, Yesim & Arslan, Olcay & Yavuz, Fulya Gokalp, 2022. "Robust estimation in multivariate heteroscedastic regression models with autoregressive covariance structures using EM algorithm," Journal of Multivariate Analysis, Elsevier, vol. 191(C).
    4. Kangning Wang & Mengjie Hao & Xiaofei Sun, 2021. "Robust and efficient estimating equations for longitudinal data partial linear models and its applications," Statistical Papers, Springer, vol. 62(5), pages 2147-2168, October.
    5. You‐Gan Wang & Xu Lin & Min Zhu, 2020. "Rejoinder to “Comment on ‘Wang et al. (2005), Robust estimating functions and bias correction for longitudinal data analysis’ by Nicola Lunardon and Giovanna Menardi”," Biometrics, The International Biometric Society, vol. 76(3), pages 1043-1044, September.
    6. Liya Fu & Zhuoran Yang & Fengjing Cai & You-Gan Wang, 2021. "Efficient and doubly-robust methods for variable selection and parameter estimation in longitudinal data analysis," Computational Statistics, Springer, vol. 36(2), pages 781-804, June.
    7. Kangning Wang & Lu Lin, 2019. "Robust and efficient estimator for simultaneous model structure identification and variable selection in generalized partial linear varying coefficient models with longitudinal data," Statistical Papers, Springer, vol. 60(5), pages 1649-1676, October.
    8. Kangning Wang & Wen Shan, 2021. "Copula and composite quantile regression-based estimating equations for longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 441-455, June.
    9. Zhang, Yuexia & Qin, Guoyou & Zhu, Zhongyi & Xu, Wanghong, 2019. "A novel robust approach for analysis of longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 83-95.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Kangning & Li, Shaomin & Sun, Xiaofei & Lin, Lu, 2019. "Modal regression statistical inference for longitudinal data semivarying coefficient models: Generalized estimating equations, empirical likelihood and variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 257-276.
    2. Kangning Wang & Lu Lin, 2019. "Robust and efficient estimator for simultaneous model structure identification and variable selection in generalized partial linear varying coefficient models with longitudinal data," Statistical Papers, Springer, vol. 60(5), pages 1649-1676, October.
    3. Kangning Wang & Wen Shan, 2021. "Copula and composite quantile regression-based estimating equations for longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 441-455, June.
    4. Yang, Hu & Yang, Jing, 2014. "A robust and efficient estimation and variable selection method for partially linear single-index models," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 227-242.
    5. Kangning Wang & Mengjie Hao & Xiaofei Sun, 2021. "Robust and efficient estimating equations for longitudinal data partial linear models and its applications," Statistical Papers, Springer, vol. 62(5), pages 2147-2168, October.
    6. Wang, Kangning & Li, Shaomin, 2021. "Robust distributed modal regression for massive data," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    7. Geronimi, J. & Saporta, G., 2017. "Variable selection for multiply-imputed data with penalized generalized estimating equations," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 103-114.
    8. Lv, Zhike & Zhu, Huiming & Yu, Keming, 2014. "Robust variable selection for nonlinear models with diverging number of parameters," Statistics & Probability Letters, Elsevier, vol. 91(C), pages 90-97.
    9. Yunlu Jiang & Guo-Liang Tian & Yu Fei, 2019. "A robust and efficient estimation method for partially nonlinear models via a new MM algorithm," Statistical Papers, Springer, vol. 60(6), pages 2063-2085, December.
    10. Mozhgan Taavoni & Mohammad Arashi & Samuel Manda, 2023. "Multicollinearity and Linear Predictor Link Function Problems in Regression Modelling of Longitudinal Data," Mathematics, MDPI, vol. 11(3), pages 1-9, January.
    11. Tian, Ruiqin & Xue, Liugen & Xu, Dengke, 2016. "Automatic variable selection for varying coefficient models with longitudinal data," Statistics & Probability Letters, Elsevier, vol. 119(C), pages 84-90.
    12. Xueying Zheng & Wing Fung & Zhongyi Zhu, 2013. "Robust estimation in joint mean–covariance regression model for longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 617-638, August.
    13. Jing Lv & Chaohui Guo, 2017. "Efficient parameter estimation via modified Cholesky decomposition for quantile regression with longitudinal data," Computational Statistics, Springer, vol. 32(3), pages 947-975, September.
    14. Kangning Wang & Lu Lin, 2017. "Robust and efficient direction identification for groupwise additive multiple-index models and its applications," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(1), pages 22-45, March.
    15. Li, Gaorong & Lian, Heng & Feng, Sanying & Zhu, Lixing, 2013. "Automatic variable selection for longitudinal generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 174-186.
    16. Yang, Jing & Tian, Guoliang & Lu, Fang & Lu, Xuewen, 2020. "Single-index modal regression via outer product gradients," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    17. Hu Yang & Ning Li & Jing Yang, 2020. "A robust and efficient estimation and variable selection method for partially linear models with large-dimensional covariates," Statistical Papers, Springer, vol. 61(5), pages 1911-1937, October.
    18. Ullah, Aman & Wang, Tao & Yao, Weixin, 2023. "Semiparametric partially linear varying coefficient modal regression," Journal of Econometrics, Elsevier, vol. 235(2), pages 1001-1026.
    19. Liya Fu & Zhuoran Yang & Fengjing Cai & You-Gan Wang, 2021. "Efficient and doubly-robust methods for variable selection and parameter estimation in longitudinal data analysis," Computational Statistics, Springer, vol. 36(2), pages 781-804, June.
    20. Guo You Qin & Zhong Yi Zhu, 2009. "Robustified Maximum Likelihood Estimation in Generalized Partial Linear Mixed Model for Longitudinal Data," Biometrics, The International Biometric Society, vol. 65(1), pages 52-59, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:82:y:2015:i:c:p:74-88. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.