IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v25y2016i4d10.1007_s11749-016-0480-8.html
   My bibliography  Save this article

Statistical inference for generalized additive models: simultaneous confidence corridors and variable selection

Author

Listed:
  • Shuzhuan Zheng

    (Soochow University
    Columbia University)

  • Rong Liu

    (University of Toledo)

  • Lijian Yang

    (Tsinghua University)

  • Wolfgang K. Härdle

    (Humboldt-Universität zu Berlin
    Singapore Management University)

Abstract

In spite of widespread use of generalized additive models (GAMs) to remedy the “curse of dimensionality”, there is no well-grounded methodology developed for simultaneous inference and variable selection for GAM in existing literature. However, both are essential in enhancing the capability of statistical models. To this end, we establish simultaneous confidence corridors (SCCs) and a type of Bayesian information criterion (BIC) through the spline-backfitted kernel smoothing techniques proposed in recent articles. To characterize the global features of each non-parametric components, SCCs are constructed for testing their overall trends and entire shapes. By extending the BIC in additive models with identity/trivial link, an asymptotically consistent BIC approach for variable selection is built up in GAM to improve the parsimony of model without loss of prediction accuracy. Simulations and a real example corroborate the above findings.

Suggested Citation

  • Shuzhuan Zheng & Rong Liu & Lijian Yang & Wolfgang K. Härdle, 2016. "Statistical inference for generalized additive models: simultaneous confidence corridors and variable selection," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(4), pages 607-626, December.
  • Handle: RePEc:spr:testjl:v:25:y:2016:i:4:d:10.1007_s11749-016-0480-8
    DOI: 10.1007/s11749-016-0480-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-016-0480-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-016-0480-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Manuel Wiesenfarth & Tatyana Krivobokova & Stephan Klasen & Stefan Sperlich, 2012. "Direct Simultaneous Inference in Additive Models and Its Application to Model Undernutrition," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1286-1296, December.
    2. Rong Liu & Lijian Yang & Wolfgang K. Härdle, 2013. "Oracally Efficient Two-Step Estimation of Generalized Additive Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 619-631, June.
    3. Wang, Lifeng & Li, Hongzhe & Huang, Jianhua Z., 2008. "Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1556-1569.
    4. Jianqing Fan & Wenyang Zhang, 2000. "Simultaneous Confidence Bands and Hypothesis Testing in Varying‐coefficient Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 27(4), pages 715-731, December.
    5. He, Xuming & Fung, Wing K. & Zhu, Zhongyi, 2005. "Robust Estimation in Generalized Partial Linear Models for Clustered Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1176-1184, December.
    6. Daniel Berg, 2007. "Bankruptcy prediction by generalized additive models," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 23(2), pages 129-143, March.
    7. Marc Ryser & Stefan Denzler, 2009. "Selecting credit rating models: a cross-validation-based comparison of discriminatory power," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 23(2), pages 187-203, June.
    8. Li Cai & Lijian Yang, 2015. "A smooth simultaneous confidence band for conditional variance function," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(3), pages 632-655, September.
    9. Lijie Gu & Li Wang & Wolfgang Härdle & Lijian Yang, 2014. "A simultaneous confidence corridor for varying coefficient regression with sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(4), pages 806-843, December.
    10. Härdle, Wolfgang, 1989. "Asymptotic maximal deviation of M-smoothers," Journal of Multivariate Analysis, Elsevier, vol. 29(2), pages 163-179, May.
    11. Shuzhuan Zheng & Lijian Yang & Wolfgang K. Härdle, 2014. "A Smooth Simultaneous Confidence Corridor for the Mean of Sparse Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 661-673, June.
    12. Wei Biao Wu & Zhibiao Zhao, 2007. "Inference of trends in time series," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(3), pages 391-410, June.
    13. Shin, Yoon S. & Moore, William T., 2003. "Explaining credit rating differences between Japanese and U.S. agencies," Review of Financial Economics, Elsevier, vol. 12(4), pages 327-344.
    14. Liu, Rong & Yang, Lijian, 2010. "Spline-Backfitted Kernel Smoothing Of Additive Coefficient Model," Econometric Theory, Cambridge University Press, vol. 26(1), pages 29-59, February.
    15. Xuming He, 2002. "Estimation in a semiparametric model for longitudinal data with unspecified dependence structure," Biometrika, Biometrika Trust, vol. 89(3), pages 579-590, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jie Li & Jiangyan Wang & Lijian Yang, 2022. "Kolmogorov–Smirnov simultaneous confidence bands for time series distribution function," Computational Statistics, Springer, vol. 37(3), pages 1015-1039, July.
    2. Li Cai & Lisha Li & Simin Huang & Liang Ma & Lijian Yang, 2020. "Oracally efficient estimation for dense functional data with holiday effects," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 282-306, March.
    3. Gu, Lijie & Wang, Suojin & Yang, Lijian, 2021. "Smooth simultaneous confidence band for the error distribution function in nonparametric regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    4. Wang, Jiangyan & Gu, Lijie & Yang, Lijian, 2022. "Oracle-efficient estimation for functional data error distribution with simultaneous confidence band," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
    5. Li Cai & Lijie Gu & Qihua Wang & Suojin Wang, 2021. "Simultaneous confidence bands for nonparametric regression with missing covariate data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(6), pages 1249-1279, December.
    6. Li, Xinyi & Wang, Li & Nettleton, Dan, 2019. "Sparse model identification and learning for ultra-high-dimensional additive partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 204-228.
    7. Yuanyuan Zhang & Lijian Yang, 2018. "A smooth simultaneous confidence band for correlation curve," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 247-269, June.
    8. Rong Liu & Yichuan Zhao, 2021. "Empirical likelihood inference for generalized additive partially linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 569-585, September.
    9. Chen Zhong & Lijian Yang, 2021. "Simultaneous confidence bands for comparing variance functions of two samples based on deterministic designs," Computational Statistics, Springer, vol. 36(2), pages 1197-1218, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuzhuan Zheng & Rong Liu & Lijian Yang & Wolfgang Karl Härdle, 2014. "Simultaneous Confidence Corridors and Variable Selection for Generalized Additive Models," SFB 649 Discussion Papers SFB649DP2014-008, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    2. Li Cai & Lisha Li & Simin Huang & Liang Ma & Lijian Yang, 2020. "Oracally efficient estimation for dense functional data with holiday effects," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 282-306, March.
    3. Yuanyuan Zhang & Lijian Yang, 2018. "A smooth simultaneous confidence band for correlation curve," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 247-269, June.
    4. Li Cai & Lijie Gu & Qihua Wang & Suojin Wang, 2021. "Simultaneous confidence bands for nonparametric regression with missing covariate data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(6), pages 1249-1279, December.
    5. Lijie Gu & Suojin Wang & Lijian Yang, 2019. "Simultaneous confidence bands for the distribution function of a finite population in stratified sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(4), pages 983-1005, August.
    6. Li Cai & Lijian Yang, 2015. "A smooth simultaneous confidence band for conditional variance function," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(3), pages 632-655, September.
    7. Jie Li & Jiangyan Wang & Lijian Yang, 2022. "Kolmogorov–Smirnov simultaneous confidence bands for time series distribution function," Computational Statistics, Springer, vol. 37(3), pages 1015-1039, July.
    8. Jiangyan Wang & Suojin Wang & Lijian Yang, 2016. "Simultaneous confidence bands for the distribution function of a finite population and of its superpopulation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(4), pages 692-709, December.
    9. Gu, Lijie & Wang, Suojin & Yang, Lijian, 2021. "Smooth simultaneous confidence band for the error distribution function in nonparametric regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    10. Zhang, Ting, 2015. "Semiparametric model building for regression models with time-varying parameters," Journal of Econometrics, Elsevier, vol. 187(1), pages 189-200.
    11. Lijie Gu & Li Wang & Wolfgang Härdle & Lijian Yang, 2014. "A simultaneous confidence corridor for varying coefficient regression with sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(4), pages 806-843, December.
    12. Rui Li & Chenlei Leng & Jinhong You, 2017. "A Semiparametric Regression Model for Longitudinal Data with Non-stationary Errors," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(4), pages 932-950, December.
    13. Tian, Ruiqin & Xue, Liugen & Liu, Chunling, 2014. "Penalized quadratic inference functions for semiparametric varying coefficient partially linear models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 94-110.
    14. Wang, Jiangyan & Gu, Lijie & Yang, Lijian, 2022. "Oracle-efficient estimation for functional data error distribution with simultaneous confidence band," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
    15. Chen Zhong & Lijian Yang, 2021. "Simultaneous confidence bands for comparing variance functions of two samples based on deterministic designs," Computational Statistics, Springer, vol. 36(2), pages 1197-1218, June.
    16. Hu, Lixia & Huang, Tao & You, Jinhong, 2019. "Two-step estimation of time-varying additive model for locally stationary time series," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 94-110.
    17. Kim, Kun Ho & Chao, Shih-Kang & Härdle, Wolfgang Karl, 2020. "Simultaneous Inference of the Partially Linear Model with a Multivariate Unknown Function," IRTG 1792 Discussion Papers 2020-008, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    18. Hassan Sharghi Ghale-Joogh & S. Mohammad E. Hosseini-Nasab, 2021. "On mean derivative estimation of longitudinal and functional data: from sparse to dense," Statistical Papers, Springer, vol. 62(4), pages 2047-2066, August.
    19. K. De Brabanter & Y. Liu & C. Hua, 2016. "Convergence rates for uniform confidence intervals based on local polynomial regression estimators," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(1), pages 31-48, March.
    20. Ngai Hang Chan & Linhao Gao & Wilfredo Palma, 2022. "Simultaneous variable selection and structural identification for time‐varying coefficient models," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(4), pages 511-531, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:25:y:2016:i:4:d:10.1007_s11749-016-0480-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.