IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i8d10.1007_s00180-025-01649-8.html

A balanced statistical boosting approach for GAMLSS via new step lengths

Author

Listed:
  • Alexandra Daub

    (University of Goettingen)

  • Andreas Mayr

    (University of Marburg)

  • Boyao Zhang

    (University of Goettingen)

  • Elisabeth Bergherr

    (University of Goettingen)

Abstract

Component-wise gradient boosting algorithms are popular for their intrinsic variable selection and implicit regularization, which can be especially beneficial for very flexible model classes. When estimating generalized additive models for location, scale and shape (GAMLSS) via a component-wise gradient boosting algorithm, an important part of the estimation procedure is to determine the relative complexity of the different submodels. Shrunk optimal step lengths have been suggested to replace small fixed step lengths for a non-cyclical boosting algorithm limited to a Gaussian response variable in order to achieve a similar degree of regularization in the submodels. In this article, we propose a new adaptive step length approach that accounts for the relative size of the fitted base-learners to ensure a natural balance between the different submodels. The new balanced boosting approach thus represents a computationally efficient and easily generalizable alternative to shrunk optimal step lengths. We implemented the balanced non-cyclical boosting algorithm for a Gaussian, a negative binomial as well as a Weibull distributed response variable and investigate the performance of the new approach in a simulation study, for count data of doctor’s visits as well as for survival data in an oncological trial. Both the simulation results and the applications show that the new approach yields similar results to shrunk optimal step lengths, especially with respect to the balance in the overall model. An improvement in the computational efficiency compared to numerically obtained shrunk optimal step lengths is especially evident for the Gaussian and negative binomial setting.

Suggested Citation

  • Alexandra Daub & Andreas Mayr & Boyao Zhang & Elisabeth Bergherr, 2025. "A balanced statistical boosting approach for GAMLSS via new step lengths," Computational Statistics, Springer, vol. 40(8), pages 4741-4773, November.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:8:d:10.1007_s00180-025-01649-8
    DOI: 10.1007/s00180-025-01649-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-025-01649-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-025-01649-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Nicole Ellenbach & Anne-Laure Boulesteix & Bernd Bischl & Kristian Unger & Roman Hornung, 2021. "Improved Outcome Prediction Across Data Sources Through Robust Parameter Tuning," Journal of Classification, Springer;The Classification Society, vol. 38(2), pages 212-231, July.
    2. Hofner, Benjamin & Mayr, Andreas & Schmid, Matthias, 2016. "gamboostLSS: An R Package for Model Building and Variable Selection in the GAMLSS Framework," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i01).
    3. Karlis, Dimitris & Ntzoufras, Ioannis, 2005. "Bivariate Poisson and Diagonal Inflated Bivariate Poisson Regression Models in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i10).
    4. Boyao Zhang & Tobias Hepp & Sonja Greven & Elisabeth Bergherr, 2022. "Adaptive step-length selection in gradient boosting for Gaussian location and scale models," Computational Statistics, Springer, vol. 37(5), pages 2295-2332, November.
    5. A. C. Cameron & P. K. Trivedi & Frank Milne & J. Piggott, 1988. "A Microeconometric Model of the Demand for Health Care and Health Insurance in Australia," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 55(1), pages 85-106.
    6. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chengguo Wu & Chengjie Ren & Juliang Jin & Yuliang Zhou & Boyu Nie & Xia Bai & Yi Cui & Fang Tong & Libing Zhang, 2026. "C-Vine Copulas Function and Conditional Quantile Regression Coupling Model for Agricultural Drought Prediction Analysis," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 40(2), pages 1-22, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Riccardo De Bin & Vegard Grødem Stikbakke, 2023. "A boosting first-hitting-time model for survival analysis in high-dimensional settings," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(2), pages 420-440, April.
    2. Clara Bertinelli Salucci & Azzeddine Bakdi & Ingrid Kristine Glad & Bo Henry Lindqvist & Erik Vanem & Riccardo De Bin, 2025. "Lifetime analysis with monotonic degradation: a boosted first hitting time model based on a homogeneous gamma process," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 31(2), pages 300-339, April.
    3. Michael Balzer & Elisabeth Bergherr & Swen Hutter & Tobias Hepp, 2026. "Gradient boosting for Dirichlet regression models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 110(1), pages 149-189, March.
    4. Gilbert, Ciaran & Browell, Jethro & McMillan, David, 2021. "Probabilistic access forecasting for improved offshore operations," International Journal of Forecasting, Elsevier, vol. 37(1), pages 134-150.
    5. Boyao Zhang & Tobias Hepp & Sonja Greven & Elisabeth Bergherr, 2022. "Adaptive step-length selection in gradient boosting for Gaussian location and scale models," Computational Statistics, Springer, vol. 37(5), pages 2295-2332, November.
    6. Lisa Schlosser & Torsten Hothorn & Reto Stauffer & Achim Zeileis, 2018. "Distributional regression forests for probabilistic precipitation forecasting in complex terrain," Working Papers 2018-08, Faculty of Economics and Statistics, Universität Innsbruck, revised Nov 2018.
    7. Adam, Timo & Mayr, Andreas & Kneib, Thomas, 2022. "Gradient boosting in Markov-switching generalized additive models for location, scale, and shape," Econometrics and Statistics, Elsevier, vol. 22(C), pages 3-16.
    8. Nicolai Hans & Nadja Klein & Florian Faschingbauer & Michael Schneider & Andreas Mayr, 2023. "Boosting distributional copula regression," Biometrics, The International Biometric Society, vol. 79(3), pages 2298-2310, September.
    9. Jing Dai & Stefan Sperlich & Walter Zucchini, 2011. "Estimating and predicting the distribution of the number of visits to the medical doctor," MAGKS Papers on Economics 201148, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    10. Maike Hohberg & Peter Pütz & Thomas Kneib, 2020. "Treatment effects beyond the mean using distributional regression: Methods and guidance," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-29, February.
    11. Giampiero Marra & Rosalba Radice & David M. Zimmer, 2020. "Estimating the binary endogenous effect of insurance on doctor visits by copula‐based regression additive models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 953-971, August.
    12. Amon, Julian & Hornik, Kurt, 2022. "Is it all bafflegab? – Linguistic and meta characteristics of research articles in prestigious economics journals," Journal of Informetrics, Elsevier, vol. 16(2).
    13. Yixuan Wang & Jianzhu Li & Ping Feng & Rong Hu, 2015. "A Time-Dependent Drought Index for Non-Stationary Precipitation Series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(15), pages 5631-5647, December.
    14. Tansel, Aysit & Keskin, Halil Ibrahim, 2017. "Education Effects on Days Hospitalized and Days out of Work by Gender: Evidence from Turkey," IZA Discussion Papers 11210, IZA Network @ LISER.
    15. Auteri, Monica & Cremaschini, Alessandro, 2024. "Ownership or procurement, which matters? exploring asymmetries in local public transportation in Italy through a semi-parametric approach," The Journal of Economic Asymmetries, Elsevier, vol. 30(C).
    16. Nathaniel Geiger & Bryan McLaughlin & John Velez, 2021. "Not all boomers: temporal orientation explains inter- and intra-cultural variability in the link between age and climate engagement," Climatic Change, Springer, vol. 166(1), pages 1-20, May.
    17. Trottmann, Maria & Zweifel, Peter & Beck, Konstantin, 2012. "Supply-side and demand-side cost sharing in deregulated social health insurance: Which is more effective?," Journal of Health Economics, Elsevier, vol. 31(1), pages 231-242.
    18. Gauss Cordeiro & Josemar Rodrigues & Mário Castro, 2012. "The exponential COM-Poisson distribution," Statistical Papers, Springer, vol. 53(3), pages 653-664, August.
    19. Wanting He & Wenyuan Li & Yunran Wei, 2025. "Periodic evaluation of defined-contribution pension fund: A dynamic risk measure approach," Papers 2508.05241, arXiv.org.
    20. Becker, Gary S. & Rubinstein, Yona, 2011. "Fear and the response to terrorism: an economic analysis," LSE Research Online Documents on Economics 121740, London School of Economics and Political Science, LSE Library.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:8:d:10.1007_s00180-025-01649-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.