IDEAS home Printed from https://ideas.repec.org/p/inn/wpaper/2018-16.html
   My bibliography  Save this paper

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape

Author

Listed:
  • Andreas Groll
  • Julien Hambuckers
  • Thomas Kneib
  • Nikolaus Umlauf

Abstract

For numerous applications it is of interest to provide full probabilistic forecasts, which are able to assign probabilities to each predicted outcome. Therefore, attention is shifting constantly from conditional mean models to probabilistic distributional models capturing location, scale, shape (and other aspects) of the response distribution. One of the most established models for distributional regression is the generalized additive model for location, scale and shape (GAMLSS). In high dimensional data set-ups classical fitting procedures for the GAMLSS often become rather unstable and methods for variable selection are desirable. Therefore, we propose a regularization approach for high dimensional data set-ups in the framework for GAMLSS. It is designed for linear covariate effects and is based on L1 -type penalties. The following three penalization options are provided: the conventional least absolute shrinkage and selection operator (LASSO) for metric covariates, and both group and fused LASSO for categorical predictors. The methods are investigated both for simulated data and for two real data examples, namely Munich rent data and data on extreme operational losses from the Italian bank UniCredit.

Suggested Citation

  • Andreas Groll & Julien Hambuckers & Thomas Kneib & Nikolaus Umlauf, 2018. "LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape," Working Papers 2018-16, Faculty of Economics and Statistics, Universität Innsbruck.
  • Handle: RePEc:inn:wpaper:2018-16
    as

    Download full text from publisher

    File URL: https://www2.uibk.ac.at/downloads/c4041030/wpaper/2018-16.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Julien Hambuckers & Andreas Groll & Thomas Kneib, 2018. "Understanding the economic determinants of the severity of operational losses: A regularized generalized Pareto regression approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(6), pages 898-935, September.
    2. Thomas Kneib & Susanne Konrath & Ludwig Fahrmeir, 2011. "High dimensional structured additive regression models: Bayesian regularization, smoothing and predictive performance," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(1), pages 51-70, January.
    3. Stasinopoulos, D. Mikis & Rigby, Robert A., 2007. "Generalized Additive Models for Location Scale and Shape (GAMLSS) in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i07).
    4. Lukas Meier & Sara Van De Geer & Peter Bühlmann, 2008. "The group lasso for logistic regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 53-71, February.
    5. Nikolaus Umlauf & Nadja Klein & Achim Zeileis, 2017. "BAMLSS: Bayesian Additive Models for Location, Scale and Shape (and Beyond)," Working Papers 2017-05, Faculty of Economics and Statistics, Universität Innsbruck.
    6. Chapelle, Ariane & Crama, Yves & Hübner, Georges & Peters, Jean-Philippe, 2008. "Practical methods for measuring and managing operational risk in the financial sector: A clinical study," Journal of Banking & Finance, Elsevier, vol. 32(6), pages 1049-1061, June.
    7. Andreas Mayr & Nora Fenske & Benjamin Hofner & Thomas Kneib & Matthias Schmid, 2012. "Generalized additive models for location, scale and shape for high dimensional data—a flexible approach based on boosting," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(3), pages 403-427, May.
    8. Howard D. Bondell & Brian J. Reich, 2009. "Simultaneous Factor Selection and Collapsing Levels in ANOVA," Biometrics, The International Biometric Society, vol. 65(1), pages 169-177, March.
    9. Valérie Chavez-Demoulin & Paul Embrechts & Marius Hofert, 2016. "An Extreme Value Approach for Modeling Operational Risk Losses Depending on Covariates," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 83(3), pages 735-776, September.
    10. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    11. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hendrik van der Wurp & Andreas Groll, 2023. "Introducing LASSO-type penalisation to generalised joint regression modelling for count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 127-151, March.
    2. Kneib, Thomas & Silbersdorff, Alexander & Säfken, Benjamin, 2023. "Rage Against the Mean – A Review of Distributional Regression Approaches," Econometrics and Statistics, Elsevier, vol. 26(C), pages 99-123.
    3. Marco Bee & Julien Hambuckers & Flavio Santi & Luca Trapin, 2021. "Testing a parameter restriction on the boundary for the g-and-h distribution: a simulated approach," Computational Statistics, Springer, vol. 36(3), pages 2177-2200, September.
    4. M. Carvalho & S. Pereira & P. Pereira & P. Zea Bermudez, 2022. "An Extreme Value Bayesian Lasso for the Conditional Left and Right Tails," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(2), pages 222-239, June.
    5. Linda Mhalla & Julien Hambuckers & Marie Lambert, 2022. "Extremal connectedness of hedge funds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 988-1009, August.
    6. Amon, Julian & Hornik, Kurt, 2022. "Is it all bafflegab? – Linguistic and meta characteristics of research articles in prestigious economics journals," Journal of Informetrics, Elsevier, vol. 16(2).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bernardi, Mauro & Bottone, Marco & Petrella, Lea, 2018. "Bayesian quantile regression using the skew exponential power distribution," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 92-111.
    2. Matteo Malavasi & Gareth W. Peters & Pavel V. Shevchenko & Stefan Truck & Jiwook Jang & Georgy Sofronov, 2021. "Cyber Risk Frequency, Severity and Insurance Viability," Papers 2111.03366, arXiv.org, revised Mar 2022.
    3. Gerhard Tutz & Jan Gertheiss, 2014. "Rating Scales as Predictors—The Old Question of Scale Level and Some Answers," Psychometrika, Springer;The Psychometric Society, vol. 79(3), pages 357-376, July.
    4. Zhao, Weihua & Lian, Heng & Song, Xinyuan, 2017. "Composite quantile regression for correlated data," Computational Statistics & Data Analysis, Elsevier, vol. 109(C), pages 15-33.
    5. Fabian Scheipl & Thomas Kneib & Ludwig Fahrmeir, 2013. "Penalized likelihood and Bayesian function selection in regression models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 349-385, October.
    6. Gilbert, Ciaran & Browell, Jethro & McMillan, David, 2021. "Probabilistic access forecasting for improved offshore operations," International Journal of Forecasting, Elsevier, vol. 37(1), pages 134-150.
    7. Simon N. Wood, 2020. "Inference and computation with generalized additive models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 307-339, June.
    8. Henning Schaak & Oliver Musshoff, 2022. "The distribution of the rent–price relationship of agricultural land in Germany [An analysis of growth of U.S. farmland prices, 1963–82]," European Review of Agricultural Economics, Oxford University Press and the European Agricultural and Applied Economics Publications Foundation, vol. 49(3), pages 696-718.
    9. Maike Hohberg & Peter Pütz & Thomas Kneib, 2020. "Treatment effects beyond the mean using distributional regression: Methods and guidance," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-29, February.
    10. Julien Hambuckers & Marie Kratz & Antoine Usseglio-Carleve, 2023. "Efficient Estimation In Extreme Value Regression Models Of Hedge Fund Tail Risks," Working Papers hal-04090916, HAL.
    11. Kneib, Thomas & Silbersdorff, Alexander & Säfken, Benjamin, 2023. "Rage Against the Mean – A Review of Distributional Regression Approaches," Econometrics and Statistics, Elsevier, vol. 26(C), pages 99-123.
    12. Hendrik van der Wurp & Andreas Groll, 2023. "Introducing LASSO-type penalisation to generalised joint regression modelling for count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 127-151, March.
    13. Boyao Zhang & Tobias Hepp & Sonja Greven & Elisabeth Bergherr, 2022. "Adaptive step-length selection in gradient boosting for Gaussian location and scale models," Computational Statistics, Springer, vol. 37(5), pages 2295-2332, November.
    14. Malavasi, Matteo & Peters, Gareth W. & Shevchenko, Pavel V. & Trück, Stefan & Jang, Jiwook & Sofronov, Georgy, 2022. "Cyber risk frequency, severity and insurance viability," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 90-114.
    15. Julien Hambuckers & Marie Kratz & Antoine Usseglio-Carleve, 2023. "Efficient Estimation in Extreme Value Regression Models of Hedge Fund Tail Risks," Papers 2304.06950, arXiv.org.
    16. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    17. Yixuan Wang & Jianzhu Li & Ping Feng & Rong Hu, 2015. "A Time-Dependent Drought Index for Non-Stationary Precipitation Series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(15), pages 5631-5647, December.
    18. Panayi, Efstathios & Peters, Gareth W. & Danielsson, Jon & Zigrand, Jean-Pierre, 2018. "Designating market maker behaviour in limit order book markets," Econometrics and Statistics, Elsevier, vol. 5(C), pages 20-44.
    19. Gauss Cordeiro & Josemar Rodrigues & Mário Castro, 2012. "The exponential COM-Poisson distribution," Statistical Papers, Springer, vol. 53(3), pages 653-664, August.
    20. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.

    More about this item

    Keywords

    GAMLSS; distributional regression; model selection; LASSO; fused LASSO;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inn:wpaper:2018-16. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Janette Walde (email available below). General contact details of provider: https://edirc.repec.org/data/fuibkat.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.