Author
Listed:
- Hepp Tobias
(Institut für medizinische Biometrie, Informatik und Epidemiologie, Medizinische Fakultät, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany)
- Schmid Matthias
(Institut für medizinische Biometrie, Informatik und Epidemiologie, Medizinische Fakultät, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany)
- Mayr Andreas
(Institut für medizinische Biometrie, Informatik und Epidemiologie, Medizinische Fakultät, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany)
Abstract
Generalized additive models for location scale and shape (GAMLSS) offer very flexible solutions to a wide range of statistical analysis problems, but can be challenging in terms of proper model specification. This complex task can be simplified using regularization techniques such as gradient boosting algorithms, but the estimates derived from such models are shrunken towards zero and it is consequently not straightforward to calculate proper confidence intervals or test statistics. In this article, we propose two strategies to obtain p-values for linear effect estimates for Gaussian location and scale models based on permutation tests and a parametric bootstrap approach. These procedures can provide a solution for one of the remaining problems in the application of gradient boosting algorithms for distributional regression in biostatistical data analyses. Results from extensive simulations indicate that in low-dimensional data both suggested approaches are able to hold the type-I error threshold and provide reasonable test power comparable to the Wald-type test for maximum likelihood inference. In high-dimensional data, when gradient boosting is the only feasible inference for this model class, the power decreases but the type-I error is still under control. In addition, we demonstrate the application of both tests in an epidemiological study to analyse the impact of physical exercise on both average and the stability of the lung function of elderly people in Germany.
Suggested Citation
Hepp Tobias & Schmid Matthias & Mayr Andreas, 2019.
"Significance Tests for Boosted Location and Scale Models with Linear Base-Learners,"
The International Journal of Biostatistics, De Gruyter, vol. 15(1), pages 1-13, May.
Handle:
RePEc:bpj:ijbist:v:15:y:2019:i:1:p:13:n:7
DOI: 10.1515/ijb-2018-0110
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:15:y:2019:i:1:p:13:n:7. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.