IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v325y2023i1d10.1007_s10479-022-04784-3.html
   My bibliography  Save this article

Spatial performance analysis in basketball with CART, random forest and extremely randomized trees

Author

Listed:
  • Paola Zuccolotto

    (University of Brescia)

  • Marco Sandri

    (University of Brescia)

  • Marica Manisera

    (University of Brescia)

Abstract

This paper proposes tools for spatial performance analysis in basketball. In detail, we aim at representing maps of the court visualizing areas with different levels of scoring probability of the analysed player or team. To do that, we propose the adoption of algorithmic modeling techniques. Firstly, following previous studies, we examine CART, highlighting strengths and weaknesses. With respect to what done in the past, here we propose the use of polar coordinates, which are more consistent with the basketball court geometry. In order to overcome CART’s drawbacks while maintaining its points of force, we propose to resort to CART-based ensemble learning algorithms, namely to Random Forest and Extremely Randomized Trees, which are shown to be able to give excellent results in terms of interpretation and robustness. Finally, an index is defined in order to measure the map’s graphical goodness, which can be used—jointly with measures of the out-of-sample error—to tune the algorithm’s parameters. The functioning of the proposed approaches is shown by the analysis of real data of the NBA regular season 2020/2021.

Suggested Citation

  • Paola Zuccolotto & Marco Sandri & Marica Manisera, 2023. "Spatial performance analysis in basketball with CART, random forest and extremely randomized trees," Annals of Operations Research, Springer, vol. 325(1), pages 495-519, June.
  • Handle: RePEc:spr:annopr:v:325:y:2023:i:1:d:10.1007_s10479-022-04784-3
    DOI: 10.1007/s10479-022-04784-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-022-04784-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-022-04784-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kubatko Justin & Oliver Dean & Pelton Kevin & Rosenbaum Dan T, 2007. "A Starting Point for Analyzing Basketball Statistics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 3(3), pages 1-24, July.
    2. Koh Koon Teck & C.K.J Wang & C.J. Mallett, 2012. "Discriminating Factors between Successful and Unsuccessful Elite Youth Olympic Female Basketball Teams," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 12(1), pages 119-131, April.
    3. Steven Wu & Luke Bornn, 2018. "Modeling Offensive Player Movement in Professional Basketball," The American Statistician, Taylor & Francis Journals, vol. 72(1), pages 72-79, January.
    4. Koh Koon Teck & John Wang & Mallett Clifford, 2011. "Discriminating Factors between Successful and Unsuccessful Teams: A Case Study in Elite Youth Olympic Basketball Games," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-15, July.
    5. Fearnhead Paul & Taylor Benjamin Matthew, 2011. "On Estimating the Ability of NBA Players," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-18, July.
    6. Biau, Gérard & Devroye, Luc, 2010. "On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2499-2518, November.
    7. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    8. Metulini Rodolfo & Manisera Marica & Zuccolotto Paola, 2018. "Modelling the dynamic pattern of surface area in basketball and its effects on team performance," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(3), pages 117-130, September.
    9. G. V. Kass, 1980. "An Exploratory Technique for Investigating Large Quantities of Categorical Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(2), pages 119-127, June.
    10. Yuan Lo-Hua & Liu Anthony & Yeh Alec & Franks Alex & Wang Sherrie & Illushin Dmitri & Bornn Luke & Kaufman Aaron & Reece Andrew & Bull Peter, 2015. "A mixture-of-modelers approach to forecasting NCAA tournament outcomes," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 13-27, March.
    11. L. Lamas & D. De Rose Junior & F. Santana & E. Rostaiser & L. Negretti & C. Ugrinowitsch, 2011. "Space creation dynamics in basketball offence: validation and evaluation of elite teams," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 11(1), pages 71-84, April.
    12. Gabel Alan & Redner Sidney, 2012. "Random Walk Picture of Basketball Scoring," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(1), pages 1-20, March.
    13. Marco Sandri & Paola Zuccolotto & Marica Manisera, 2020. "Markov switching modelling of shooting performance variability and teammate interactions in basketball," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1337-1356, November.
    14. Paola Zuccolotto & Marco Sandri & Marica Manisera, 2021. "Spatial Performance Indicators and Graphs in Basketball," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 725-738, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Paola Zuccolotto & Marco Sandri & Marica Manisera, 2021. "Spatial Performance Indicators and Graphs in Basketball," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 725-738, August.
    2. Alessandro Chessa & Pierpaolo D’Urso & Livia Giovanni & Vincenzina Vitale & Alfonso Gebbia, 2023. "Complex networks for community detection of basketball players," Annals of Operations Research, Springer, vol. 325(1), pages 363-389, June.
    3. Pierpalo D’Urso & Livia Giovanni & Vincenzina Vitale, 2023. "A Bayesian network to analyse basketball players’ performances: a multivariate copula-based approach," Annals of Operations Research, Springer, vol. 325(1), pages 419-440, June.
    4. Rodolfo Metulini & Giorgio Gnecco, 2023. "Measuring players’ importance in basketball using the generalized Shapley value," Annals of Operations Research, Springer, vol. 325(1), pages 441-465, June.
    5. Kęstutis Matulaitis & Tomas Bietkis, 2021. "Prediction of Offensive Possession Ends in Elite Basketball Teams," IJERPH, MDPI, vol. 18(3), pages 1-11, January.
    6. Manlio Migliorati & Marica Manisera & Paola Zuccolotto, 2023. "Integration of model-based recursive partitioning with bias reduction estimation: a case study assessing the impact of Oliver’s four factors on the probability of winning a basketball game," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 271-293, March.
    7. Nikolaos Stavropoulos & Alexandra Papadopoulou & Pavlos Kolias, 2021. "Evaluating the Efficiency of Off-Ball Screens in Elite Basketball Teams via Second-Order Markov Modelling," Mathematics, MDPI, vol. 9(16), pages 1-13, August.
    8. Sabin R. Paul, 2021. "Estimating player value in American football using plus–minus models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 313-364, December.
    9. Jun Woo Kim & Mar Magnusen & Seunghoon Jeong, 2023. "March Madness prediction: Different machine learning approaches with non‐box score statistics," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 44(4), pages 2223-2236, June.
    10. Marco Sandri & Paola Zuccolotto & Marica Manisera, 2020. "Markov switching modelling of shooting performance variability and teammate interactions in basketball," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1337-1356, November.
    11. Strobl, Carolin & Boulesteix, Anne-Laure & Augustin, Thomas, 2007. "Unbiased split selection for classification trees based on the Gini Index," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 483-501, September.
    12. Joseph Price & Justin Wolfers, 2010. "Racial Discrimination Among NBA Referees," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 125(4), pages 1859-1887.
    13. I. Albarrán & P. Alonso-González & J. M. Marin, 2017. "Some criticism to a general model in Solvency II: an explanation from a clustering point of view," Empirical Economics, Springer, vol. 52(4), pages 1289-1308, June.
    14. Ludden Ian G. & Jacobson Sheldon H. & Khatibi Arash & King Douglas M., 2020. "Models for generating NCAA men’s basketball tournament bracket pools," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(1), pages 1-15, March.
    15. Yousaf Muhammad & Dey Sandeep Kumar, 2022. "Best proxy to determine firm performance using financial ratios: A CHAID approach," Review of Economic Perspectives, Sciendo, vol. 22(3), pages 219-239, September.
    16. Lorenzo Gasperi & Daniele Conte & Anthony Leicht & Miguel-Ángel Gómez-Ruano, 2020. "Game Related Statistics Discriminate National and Foreign Players According to Playing Position and Team Ability in the Women’s Basketball EuroLeague," IJERPH, MDPI, vol. 17(15), pages 1-10, July.
    17. Archana R. Panhalkar & Dharmpal D. Doye, 2020. "An approach of improving decision tree classifier using condensed informative data," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 47(4), pages 431-445, December.
    18. Bas Donkers & Richard Paap & Jedid‐Jah Jonker & Philip Hans Franses, 2006. "Deriving target selection rules from endogenously selected samples," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(5), pages 549-562, July.
    19. Lea Piscitelli & Annalisa De Boni & Rocco Roma & Giovanni Ottomano Palmisano, 2023. "Carbon Farming: How to Support Farmers in Choosing the Best Management Strategies for Low-Impact Food Production," Land, MDPI, vol. 13(1), pages 1-16, December.
    20. Luu, Tung Duy & Fadili, Jalal & Chesneau, Christophe, 2019. "PAC-Bayesian risk bounds for group-analysis sparse regression by exponential weighting," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 209-233.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:325:y:2023:i:1:d:10.1007_s10479-022-04784-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.