IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-05539060.html

Closed-form estimators for multivariate regressions models -a single categorical variable approach

Author

Listed:
  • Antoine Burg

    (CEREMADE - CEntre de REcherches en MAthématiques de la DEcision - Université Paris Dauphine-PSL - PSL - Université Paris Sciences et Lettres - CNRS - Centre National de la Recherche Scientifique, SCOR SE [Paris])

  • Christophe Dutang

    (ASAR - Applied Statistics And Reliability - ASAR - LJK - Laboratoire Jean Kuntzmann - Inria - Institut National de Recherche en Informatique et en Automatique - CNRS - Centre National de la Recherche Scientifique - UGA - Université Grenoble Alpes - Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology - UGA - Université Grenoble Alpes)

Abstract

The maximum likelihood estimator (MLE) remains the most frequently used method to estimate the parameters of generalized linear models. But even for distributions within the exponential family, MLEs are not always tractable and need to be computed with time consuming numerical methods like the Iterative Weighted Least Square algorithm. In order to improve the computation time, closed-form estimators have been found in case of categorical explanatory variables for univariate random variables of one-parameter exponential type. In the context of multivariate generalized linear models (MGLM), we propose a new way to look at the score in case of single categorical variables for any distribution in the exponential family. Firstly, we derive closed-form MLE for MGLM assuming multinomial and negative multinomial distributions. Secondly, we deduce similar results for the multivariate normal distributions. For the Dirichlet distribution, we propose a closed-form estimator, yet not MLE, for which we prove the consistency. We illustrate the computation time gains on simulated datasets: closed-form estimators are about 1000 times faster, especially for high dimension. Closed-form estimators are computed in constant times.. Finally, we show the relevancy of the proposed estimator on real-world datasets by modeling cause-of-death mortality in US. We are able to catch the first-order effects of covid between 2019 and 2021.

Suggested Citation

  • Antoine Burg & Christophe Dutang, 2026. "Closed-form estimators for multivariate regressions models -a single categorical variable approach," Post-Print hal-05539060, HAL.
  • Handle: RePEc:hal:journl:hal-05539060
    DOI: 10.1007/s00180-025-01679-2
    Note: View the original document on HAL open archive server: https://hal.science/hal-05539060v1
    as

    Download full text from publisher

    File URL: https://hal.science/hal-05539060v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.1007/s00180-025-01679-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zinoviy Landsman & Emiliano A. Valdez, 2016. "The Tail Stein's Identity with Applications to Risk Measures," North American Actuarial Journal, Taylor & Francis Journals, vol. 20(4), pages 313-326, October.
    2. Silvia Ferrari & Francisco Cribari-Neto, 2004. "Beta Regression for Modelling Rates and Proportions," Journal of Applied Statistics, Taylor & Francis Journals, vol. 31(7), pages 799-815.
    3. Prem C. Consul & Felix Famoye, 2006. "Lagrangian Probability Distributions," Springer Books, Springer, number 978-0-8176-4477-2, December.
    4. Abdulaziz Alenazi, 2023. "A review of compositional data analysis and recent advances," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 52(16), pages 5535-5567, August.
    5. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149, Enero-Abr.
    6. Grün, Bettina & Kosmidis, Ioannis & Zeileis, Achim, 2012. "Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i11).
    7. Monique Graf, 2020. "Regression for compositions based on a generalization of the Dirichlet distribution," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(4), pages 913-936, December.
    8. Alexandre Brouste & Christophe Dutang & Tom Rohmer, 2020. "Closed-form maximum likelihood estimator for generalized linear models in the case of categorical explanatory variables: application to insurance loss modeling," Computational Statistics, Springer, vol. 35(2), pages 689-724, June.
    9. José Jairo Santana-e-Silva & Francisco Cribari-Neto & Klaus L P Vasconcellos, 2022. "Beta distribution misspecification tests with application to Covid-19 mortality rates in the United States," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-30, September.
    10. Jenni Niku & Wesley Brooks & Riki Herliansyah & Francis K C Hui & Sara Taskinen & David I Warton, 2019. "Efficient estimation of generalized linear latent variable models," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-20, May.
    11. Gueorguieva, Ralitza & Rosenheck, Robert & Zelterman, Daniel, 2008. "Dirichlet component regression and its applications to psychiatric data," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5344-5355, August.
    12. Kumar Kattumannil, Sudheesh, 2009. "On Stein's identity and its applications," Statistics & Probability Letters, Elsevier, vol. 79(12), pages 1444-1449, June.
    13. Jun Zhao & Yun-beom Lee & Hyoung-Moon Kim, 2025. "New and fast closed-form efficient estimators for the negative multinomial distribution," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 54(20), pages 6684-6699, October.
    14. Patricia Espinheira & Silvia Ferrari & Francisco Cribari-Neto, 2008. "On beta regression residuals," Journal of Applied Statistics, Taylor & Francis Journals, vol. 35(4), pages 407-419.
    15. Ospina, Raydonal & Cribari-Neto, Francisco & Vasconcellos, Klaus L.P., 2006. "Improved point and interval estimation for a beta regression model," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 960-981, November.
    16. Tatiane F. N. Melo & Tiago M. Vargas & Artur J. Lemonte & Germán Moreno–Arenas, 2020. "On improved estimation in multivariate Dirichlet regressions," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 49(23), pages 5765-5777, December.
    17. Simas, Alexandre B. & Barreto-Souza, Wagner & Rocha, Andréa V., 2010. "Improved estimators for a general class of beta regression models," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 348-366, February.
    18. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    19. Ongaro, A. & Migliorati, S., 2013. "A generalization of the Dirichlet distribution," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 412-426.
    20. Tsagris, Michail, 2015. "A novel, divergence based, regression for compositional data," MPRA Paper 72769, University Library of Munich, Germany.
    21. Wagner Hugo Bonat & Bent Jørgensen, 2016. "Multivariate covariance generalized linear models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(5), pages 649-675, November.
    22. Zhi-Sheng Ye & Nan Chen, 2017. "Closed-Form Estimators for the Gamma Distribution Derived From Likelihood Equations," The American Statistician, Taylor & Francis Journals, vol. 71(2), pages 177-181, April.
    23. Mukhopadhyay, S. & Khuri, A.I., 2008. "Optimization in a multivariate generalized linear model situation," Computational Statistics & Data Analysis, Elsevier, vol. 52(10), pages 4625-4634, June.
    24. Alexandre Brouste & Christophe Dutang & Tom Rohmer, 2022. "A Closed-form Alternative Estimator for GLM with Categorical Explanatory Variables," Post-Print hal-03689206, HAL.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Antoine Burg & Christophe Dutang, 2026. "Closed-form estimators for multivariate regressions models: a single categorical variable approach," Computational Statistics, Springer, vol. 41(3), pages 1-23, April.
    2. Cristine Rauber & Francisco Cribari-Neto & Fábio M. Bayer, 2020. "Improved testing inferences for beta regressions with parametric mean link function," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 687-717, December.
    3. Wagner Hugo Bonat & Paulo Justiniano Ribeiro & Walmes Marques Zeviani, 2015. "Likelihood analysis for a class of beta mixed models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(2), pages 252-266, February.
    4. Weihua Zhao & Riquan Zhang & Yazhao Lv & Jicai Liu, 2014. "Variable selection for varying dispersion beta regression model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(1), pages 95-108, January.
    5. Grün, Bettina & Kosmidis, Ioannis & Zeileis, Achim, 2012. "Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i11).
    6. Jay Verkuilen & Michael Smithson, 2012. "Mixed and Mixture Regression Models for Continuous Bounded Responses Using the Beta Distribution," Journal of Educational and Behavioral Statistics, , vol. 37(1), pages 82-113, February.
    7. Diego Ramos Canterle & Fábio Mariano Bayer, 2019. "Variable dispersion beta regressions with parametric link functions," Statistical Papers, Springer, vol. 60(5), pages 1541-1567, October.
    8. Francisco Cribari-Neto & Sadraque E.F. Lucena, 2015. "Nonnested hypothesis testing in the class of varying dispersion beta regressions," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(5), pages 967-985, May.
    9. Pablo Mitnik & Sunyoung Baek, 2013. "The Kumaraswamy distribution: median-dispersion re-parameterizations for regression modeling and simulation-based estimation," Statistical Papers, Springer, vol. 54(1), pages 177-192, February.
    10. Emilio Gómez-Déniz & Jorge V Pérez-Rodríguez & José Boza-Chirino, 2020. "Modelling tourist expenditure at origin and destination," Tourism Economics, , vol. 26(3), pages 437-460, May.
    11. Tariq Maqsood & Mark Edwards & Ioanna Ioannou & Ioannis Kosmidis & Tiziana Rossetto & Neil Corby, 2016. "Seismic vulnerability functions for Australian buildings by using GEM empirical vulnerability assessment guidelines," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 80(3), pages 1625-1650, February.
    12. Ospina, Raydonal & Ferrari, Silvia L.P., 2012. "A general class of zero-or-one inflated beta regression models," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1609-1623.
    13. Alisson de Oliveira Silva & Jonas Weverson de Ararújo Silva & Patrícia L Espinheira, 2022. "Bootstrap-based inferential improvements to the simplex nonlinear regression model," PLOS ONE, Public Library of Science, vol. 17(8), pages 1-27, August.
    14. Chen, Kee Kuo & Chiu, Rong-Her & Chang, Ching-Ter, 2017. "Using beta regression to explore the relationship between service attributes and likelihood of customer retention for the container shipping industry," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 104(C), pages 1-16.
    15. Yiyun Shou & Michael Smithson, 2015. "Evaluating Predictors of Dispersion: A Comparison of Dominance Analysis and Bayesian Model Averaging," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 236-256, March.
    16. Tariq Maqsood & Mark Edwards & Ioanna Ioannou & Ioannis Kosmidis & Tiziana Rossetto & Neil Corby, 2016. "Seismic vulnerability functions for Australian buildings by using GEM empirical vulnerability assessment guidelines," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 80(3), pages 1625-1650, February.
    17. Oscar Melo & Carlos Melo & Jorge Mateu, 2015. "Distance-based beta regression for prediction of mutual funds," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 99(1), pages 83-106, January.
    18. repec:jss:jstsof:34:i02 is not listed on IDEAS
    19. Yuri S. Maluf & Silvia L. P. Ferrari & Francisco F. Queiroz, 2025. "Robust beta regression through the logit transformation," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 88(1), pages 61-81, January.
    20. Edilberto Cepeda-Cuervo & Vicente Núñez-Antón, 2013. "Spatial Double Generalized Beta Regression Models," Journal of Educational and Behavioral Statistics, , vol. 38(6), pages 604-628, December.
    21. Cepeda-Cuervo Edilberto & Garrido Liliana, 2015. "Bayesian beta regression models with joint mean and dispersion modeling," Monte Carlo Methods and Applications, De Gruyter, vol. 21(1), pages 49-58, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-05539060. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.