IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v36y2009i4p620-635.html
   My bibliography  Save this article

Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function

Author

Listed:
  • XINGWEI TONG
  • XIN HE
  • LIUQUAN SUN
  • JIANGUO SUN

Abstract

. Variable selection is an important issue in all regression analyses, and in this paper we discuss this in the context of regression analysis of panel count data. Panel count data often occur in long‐term studies that concern occurrence rate of a recurrent event, and their analysis has recently attracted a great deal of attention. However, there does not seem to exist any established approach for variable selection with respect to panel count data. For the problem, we adopt the idea behind the non‐concave penalized likelihood approach and develop a non‐concave penalized estimating function approach. The proposed methodology selects variables and estimates regression coefficients simultaneously, and an algorithm is presented for this process. We show that the proposed procedure performs as well as the oracle procedure in that it yields the estimates as if the correct submodel were known. Simulation studies are conducted for assessing the performance of the proposed approach and suggest that it works well for practical situations. An illustrative example from a cancer study is provided.

Suggested Citation

  • Xingwei Tong & Xin He & Liuquan Sun & Jianguo Sun, 2009. "Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 620-635, December.
  • Handle: RePEc:bla:scjsta:v:36:y:2009:i:4:p:620-635
    DOI: 10.1111/j.1467-9469.2009.00658.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9469.2009.00658.x
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chiung-Yu Huang & Mei-Cheng Wang & Ying Zhang, 2006. "Analysing panel count data with informative observation times," Biometrika, Biometrika Trust, vol. 93(4), pages 763-775, December.
    2. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    3. Jianguo Sun, 2003. "A nonparametric test for panel count data," Biometrika, Biometrika Trust, vol. 90(1), pages 199-208, March.
    4. Jianguo Sun & Xingwei Tong & Xin He, 2007. "Regression Analysis of Panel Count Data with Dependent Observation Times," Biometrics, The International Biometric Society, vol. 63(4), pages 1053-1059, December.
    5. Johnson, Brent A. & Lin, D.Y. & Zeng, Donglin, 2008. "Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 672-680, June.
    6. Ying Zhang, 2002. "A semiparametric pseudolikelihood estimation method for panel count data," Biometrika, Biometrika Trust, vol. 89(1), pages 39-48, March.
    7. Brent A. Johnson, 2008. "Variable selection in semiparametric linear regression with censored data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(2), pages 351-370, April.
    8. Jianwen Cai & Jianqing Fan & Runze Li & Haibo Zhou, 2005. "Variable selection for multivariate failure time data," Biometrika, Biometrika Trust, vol. 92(2), pages 303-316, June.
    9. Jianqing Fan & Runze Li, 2004. "New Estimation and Model Selection Procedures for Semiparametric Modeling in Longitudinal Data Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 710-723, January.
    10. Wang, Hansheng & Leng, Chenlei, 2007. "Unified LASSO Estimation by Least Squares Approximation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1039-1048, September.
    11. J. Sun & L. J. Wei, 2000. "Regression analysis of panel count data with covariate‐dependent observation and censoring times," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(2), pages 293-302.
    12. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    13. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    14. Wenjiang J. Fu, 2003. "Penalized Estimating Equations," Biometrics, The International Biometric Society, vol. 59(1), pages 126-132, March.
    15. Annie Qu & Runze Li, 2006. "Quadratic Inference Functions for Varying-Coefficient Models with Longitudinal Data," Biometrics, The International Biometric Society, vol. 62(2), pages 379-391, June.
    16. Sijian Wang & Bin Nan & Ji Zhu & David G. Beer, 2008. "Doubly Penalized Buckley–James Method for Survival Data with High-Dimensional Covariates," Biometrics, The International Biometric Society, vol. 64(1), pages 132-140, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Weiwei & Wu, Xianyi & Zhao, Xiaobing & Zhou, Xian, 2018. "Robust variable selection of joint frailty model for panel count data," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 60-78.
    2. Haiying Wang & Yang Li & Jianguo Sun, 2015. "Focused and Model Average Estimation for Regression Analysis of Panel Count Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(3), pages 732-745, September.
    3. Yang Li & Xin He & Haiying Wang & Jianguo Sun, 2016. "Regression analysis of longitudinal data with correlated censoring and observation times," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(3), pages 343-362, July.
    4. Zhao, Hui & Sun, Dayu & Li, Gang & Sun, Jianguo, 2019. "Simultaneous estimation and variable selection for incomplete event history studies," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 350-361.
    5. Jie Zhou & Haixiang Zhang & Liuquan Sun & Jianguo Sun, 2017. "Joint analysis of panel count data with an informative observation process and a dependent terminal event," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 560-584, October.
    6. Xin He, 2011. "Comments on: Nonparametric inference based on panel count data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 20(1), pages 46-47, May.
    7. Jie Zhou & Haixiang Zhang & Liuquan Sun & Jianguo Sun, 0. "Joint analysis of panel count data with an informative observation process and a dependent terminal event," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 0, pages 1-25.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    2. Jie Zhou & Haixiang Zhang & Liuquan Sun & Jianguo Sun, 0. "Joint analysis of panel count data with an informative observation process and a dependent terminal event," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 0, pages 1-25.
    3. Xin He & Xuenan Feng & Xingwei Tong & Xingqiu Zhao, 0. "Semiparametric partially linear varying coefficient models with panel count data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 0, pages 1-28.
    4. Zhangong Zhou & Rong Jiang & Weimin Qian, 2013. "LAD variable selection for linear models with randomly censored data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 287-300, February.
    5. Jie Zhou & Haixiang Zhang & Liuquan Sun & Jianguo Sun, 2017. "Joint analysis of panel count data with an informative observation process and a dependent terminal event," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 560-584, October.
    6. Xin He & Xuenan Feng & Xingwei Tong & Xingqiu Zhao, 2017. "Semiparametric partially linear varying coefficient models with panel count data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(3), pages 439-466, July.
    7. Pötscher, Benedikt M., 2007. "Confidence Sets Based on Sparse Estimators Are Necessarily Large," MPRA Paper 5677, University Library of Munich, Germany.
    8. Peng, Heng & Lu, Ying, 2012. "Model selection in linear mixed effect models," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 109-129.
    9. Matsui, Hidetoshi & Konishi, Sadanori, 2011. "Variable selection for functional regression models via the L1 regularization," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3304-3310, December.
    10. Li-Ping Zhu & Lin-Yi Qian & Jin-Guan Lin, 2011. "Variable selection in a class of single-index models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(6), pages 1277-1293, December.
    11. Yongjin Li & Qingzhao Zhang & Qihua Wang, 2017. "Penalized estimation equation for an extended single-index model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 169-187, February.
    12. Wang, Li & Wang, Suojin & Wang, Guannan, 2014. "Variable selection and estimation for longitudinal survey data," Journal of Multivariate Analysis, Elsevier, vol. 130(C), pages 409-424.
    13. Zhao, Xingqiu & Tong, Xingwei, 2011. "Semiparametric regression analysis of panel count data with informative observation times," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 291-300, January.
    14. Zhao, Xingqiu & Tong, Xingwei & Sun, Jianguo, 2013. "Robust estimation for panel count data with informative observation times," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 33-40.
    15. Guang Cheng & Hao Zhang & Zuofeng Shang, 2015. "Sparse and efficient estimation for partial spline models with increasing dimension," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 93-127, February.
    16. Lee, Eun Ryung & Park, Byeong U., 2012. "Sparse estimation in functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 1-17.
    17. Feng, Sanying & He, Wenqi & Li, Feng, 2020. "Model detection and estimation for varying coefficient panel data models with fixed effects," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    18. Gang Cheng & Ying Zhang & Liqiang Lu, 2011. "Efficient algorithms for computing the non and semi-parametric maximum likelihood estimates with panel count data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 23(2), pages 567-579.
    19. Ramon I. Garcia & Joseph G. Ibrahim & Hongtu Zhu, 2010. "Variable Selection in the Cox Regression Model with Covariates Missing at Random," Biometrics, The International Biometric Society, vol. 66(1), pages 97-104, March.
    20. Ma, Shujie & Liang, Hua & Tsai, Chih-Ling, 2014. "Partially linear single index models for repeated measurements," Journal of Multivariate Analysis, Elsevier, vol. 130(C), pages 354-375.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:36:y:2009:i:4:p:620-635. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Wiley Content Delivery). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.