IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i4d10.1007_s00180-025-01604-7.html

Lasso multinomial performance indicators for in-play basketball data

Author

Listed:
  • Argyro Damoulaki

    (Athens University of Economics and Business
    AUEB Sports Analytics Group, Computational and Bayesian Statistics Lab
    Institute of Statistical Research Analysis and Documentation (ISTAER))

  • Ioannis Ntzoufras

    (Athens University of Economics and Business
    AUEB Sports Analytics Group, Computational and Bayesian Statistics Lab
    Institute of Statistical Research Analysis and Documentation (ISTAER))

  • Konstantinos Pelechrinis

    (University of Pittsburgh)

Abstract

A typical approach to quantify the contribution of each player in basketball uses the plus–minus method. The ratings obtained by such a method are estimated using simple regression models and their regularized variants, with response variable being either the points scored or the point differences. To capture more precisely the effect of each player, detailed possession-based play-by-play data may be used. This is the direction we take in this article, in which we investigate the performance of regularized adjusted plus–minus (RAPM) indicators estimated by different regularized models having as a response the number of points scored in each possession. Therefore, we use possession play-by-play data from all NBA games for the season 2021–2022 (322,852 possessions). We initially present simple regression model-based indices starting from the implementation of ridge regression which is the standard technique in the relevant literature. We proceed with the lasso approach which has specific advantages and better performance than ridge regression when compared with selected objective validation criteria. Then, we implement regularized binary and multinomial logistic regression models to obtain more accurate performance indicators since the response is a discrete variable taking values mainly from zero to three. Our final proposal is an improved RAPM measure which is based on the expected points of a multinomial logistic regression model where each player’s contribution is weighted by his participation in the team’s possessions. The proposed indicator, called weighted expected points (wEPTS), outperforms all other RAPM measures we investigate in this study.

Suggested Citation

  • Argyro Damoulaki & Ioannis Ntzoufras & Konstantinos Pelechrinis, 2025. "Lasso multinomial performance indicators for in-play basketball data," Computational Statistics, Springer, vol. 40(4), pages 2157-2181, April.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:4:d:10.1007_s00180-025-01604-7
    DOI: 10.1007/s00180-025-01604-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-025-01604-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-025-01604-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Carlos M. Carvalho & Nicholas G. Polson & James G. Scott, 2010. "The horseshoe estimator for sparse signals," Biometrika, Biometrika Trust, vol. 97(2), pages 465-480.
    2. Shankar Ghimire & Justin A Ehrlich & Shane D Sanders, 2020. "Measuring individual worker output in a complementary team setting: Does regularized adjusted plus minus isolate individual NBA player contributions?," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-11, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hauzenberger, Niko & Huber, Florian & Klieber, Karin & Marcellino, Massimiliano, 2025. "Bayesian neural networks for macroeconomic analysis," Journal of Econometrics, Elsevier, vol. 249(PC).
    2. Lucas D. Konrad & Lukas Vashold & Jesus Crespo Cuaresma, 2026. "Bayesian Indicator-Saturated Regression for Climate Policy Evaluation," Papers 2603.04997, arXiv.org.
    3. Tamal Ghosh & Malay Ghosh & Jerry J. Maples & Xueying Tang, 2022. "Multivariate Global-Local Priors for Small Area Estimation," Stats, MDPI, vol. 5(3), pages 1-16, July.
    4. Martin Feldkircher & Florian Huber & Gary Koop & Michael Pfarrhofer, 2022. "APPROXIMATE BAYESIAN INFERENCE AND FORECASTING IN HUGE‐DIMENSIONAL MULTICOUNTRY VARs," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 63(4), pages 1625-1658, November.
    5. Andrey Polbin & Andrei Shumilov, 2025. "Nowcasting and forecasting Russian GDP and its components using quantile models," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 79, pages 5-26.
    6. Phella, Anthoulla & Gabriel, Vasco J. & Martins, Luis F., 2024. "Predicting tail risks and the evolution of temperatures," Energy Economics, Elsevier, vol. 131(C).
    7. Ley, Eduardo & Steel, Mark F.J., 2012. "Mixtures of g-priors for Bayesian model averaging with economic applications," Journal of Econometrics, Elsevier, vol. 171(2), pages 251-266.
    8. Martin Guth, 2022. "Predicting Default Probabilities for Stress Tests: A Comparison of Models," Papers 2202.03110, arXiv.org.
    9. Hauzenberger, Niko, 2021. "Flexible Mixture Priors for Large Time-varying Parameter Models," Econometrics and Statistics, Elsevier, vol. 20(C), pages 87-108.
    10. Todd E. Clark & Florian Huber & Gary Koop & Massimiliano Marcellino & Michael Pfarrhofer, 2023. "Tail Forecasting With Multivariate Bayesian Additive Regression Trees," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 64(3), pages 979-1022, August.
    11. Chan, Joshua C.C., 2021. "Minnesota-type adaptive hierarchical priors for large Bayesian VARs," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1212-1226.
    12. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    13. Barbaglia, Luca & Frattarolo, Lorenzo & Hauzenberger, Niko & Hirschbühl, Dominik & Huber, Florian & Onorante, Luca & Pfarrhofer, Michael & Pezzoli, Luca Tiozzo, 2026. "Nowcasting economic activity in European regions using a mixed-frequency dynamic factor model," International Journal of Forecasting, Elsevier, vol. 42(2), pages 657-672.
    14. Lukas Berend & Jan Pruser, 2024. "The Transmission of Monetary Policy via Common Cycles in the Euro Area," Papers 2410.05741, arXiv.org, revised Nov 2024.
    15. Anindya Bhadra, 2022. "Discussion to: Bayesian graphical models for modern biological applications by Y. Ni, V. Baladandayuthapani, M. Vannucci and F.C. Stingo," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(2), pages 235-239, June.
    16. Harrison Katz & Liz Medina & Robert E. Weiss, 2025. "Sensitivity Analysis of Priors in the Bayesian Dirichlet Auto-Regressive Moving Average Model," Forecasting, MDPI, vol. 7(3), pages 1-19, June.
    17. Debamita Kundu & Riten Mitra & Jeremy T. Gaskins, 2021. "Bayesian variable selection for multioutcome models through shared shrinkage," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 295-320, March.
    18. Yi Nengjun & Ma Shuangge, 2012. "Hierarchical Shrinkage Priors and Model Fitting for High-dimensional Generalized Linear Models," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(6), pages 1-25, November.
    19. You Wu & Jeremy Gaskins & Maiying Kong & Susmita Datta, 2018. "Profiling the effects of short time†course cold ischemia on tumor protein phosphorylation using a Bayesian approach," Biometrics, The International Biometric Society, vol. 74(1), pages 331-341, March.
    20. Ryan Martin & Bo Ning, 2020. "Empirical Priors and Coverage of Posterior Credible Sets in a Sparse Normal Mean Model," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(2), pages 477-498, August.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:4:d:10.1007_s00180-025-01604-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.