IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v325y2023i1d10.1007_s10479-022-04784-3.html
   My bibliography  Save this article

Spatial performance analysis in basketball with CART, random forest and extremely randomized trees

Author

Listed:
  • Paola Zuccolotto

    (University of Brescia)

  • Marco Sandri

    (University of Brescia)

  • Marica Manisera

    (University of Brescia)

Abstract

This paper proposes tools for spatial performance analysis in basketball. In detail, we aim at representing maps of the court visualizing areas with different levels of scoring probability of the analysed player or team. To do that, we propose the adoption of algorithmic modeling techniques. Firstly, following previous studies, we examine CART, highlighting strengths and weaknesses. With respect to what done in the past, here we propose the use of polar coordinates, which are more consistent with the basketball court geometry. In order to overcome CART’s drawbacks while maintaining its points of force, we propose to resort to CART-based ensemble learning algorithms, namely to Random Forest and Extremely Randomized Trees, which are shown to be able to give excellent results in terms of interpretation and robustness. Finally, an index is defined in order to measure the map’s graphical goodness, which can be used—jointly with measures of the out-of-sample error—to tune the algorithm’s parameters. The functioning of the proposed approaches is shown by the analysis of real data of the NBA regular season 2020/2021.

Suggested Citation

  • Paola Zuccolotto & Marco Sandri & Marica Manisera, 2023. "Spatial performance analysis in basketball with CART, random forest and extremely randomized trees," Annals of Operations Research, Springer, vol. 325(1), pages 495-519, June.
  • Handle: RePEc:spr:annopr:v:325:y:2023:i:1:d:10.1007_s10479-022-04784-3
    DOI: 10.1007/s10479-022-04784-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-022-04784-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-022-04784-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Biau, Gérard & Devroye, Luc, 2010. "On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2499-2518, November.
    2. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    3. Kubatko Justin & Oliver Dean & Pelton Kevin & Rosenbaum Dan T, 2007. "A Starting Point for Analyzing Basketball Statistics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 3(3), pages 1-24, July.
    4. Metulini Rodolfo & Manisera Marica & Zuccolotto Paola, 2018. "Modelling the dynamic pattern of surface area in basketball and its effects on team performance," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(3), pages 117-130, September.
    5. Koh Koon Teck & C.K.J Wang & C.J. Mallett, 2012. "Discriminating Factors between Successful and Unsuccessful Elite Youth Olympic Female Basketball Teams," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 12(1), pages 119-131, April.
    6. Steven Wu & Luke Bornn, 2018. "Modeling Offensive Player Movement in Professional Basketball," The American Statistician, Taylor & Francis Journals, vol. 72(1), pages 72-79, January.
    7. G. V. Kass, 1980. "An Exploratory Technique for Investigating Large Quantities of Categorical Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(2), pages 119-127, June.
    8. Yuan Lo-Hua & Liu Anthony & Yeh Alec & Franks Alex & Wang Sherrie & Illushin Dmitri & Bornn Luke & Kaufman Aaron & Reece Andrew & Bull Peter, 2015. "A mixture-of-modelers approach to forecasting NCAA tournament outcomes," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 13-27, March.
    9. Koh Koon Teck & John Wang & Mallett Clifford, 2011. "Discriminating Factors between Successful and Unsuccessful Teams: A Case Study in Elite Youth Olympic Basketball Games," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-15, July.
    10. L. Lamas & D. De Rose Junior & F. Santana & E. Rostaiser & L. Negretti & C. Ugrinowitsch, 2011. "Space creation dynamics in basketball offence: validation and evaluation of elite teams," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 11(1), pages 71-84, April.
    11. Gabel Alan & Redner Sidney, 2012. "Random Walk Picture of Basketball Scoring," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(1), pages 1-20, March.
    12. Marco Sandri & Paola Zuccolotto & Marica Manisera, 2020. "Markov switching modelling of shooting performance variability and teammate interactions in basketball," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1337-1356, November.
    13. Paola Zuccolotto & Marco Sandri & Marica Manisera, 2021. "Spatial Performance Indicators and Graphs in Basketball," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 725-738, August.
    14. Fearnhead Paul & Taylor Benjamin Matthew, 2011. "On Estimating the Ability of NBA Players," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-18, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Paola Zuccolotto & Marco Sandri & Marica Manisera, 2021. "Spatial Performance Indicators and Graphs in Basketball," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 725-738, August.
    2. Pierpalo D’Urso & Livia Giovanni & Vincenzina Vitale, 2023. "A Bayesian network to analyse basketball players’ performances: a multivariate copula-based approach," Annals of Operations Research, Springer, vol. 325(1), pages 419-440, June.
    3. Alessandro Chessa & Pierpaolo D’Urso & Livia Giovanni & Vincenzina Vitale & Alfonso Gebbia, 2023. "Complex networks for community detection of basketball players," Annals of Operations Research, Springer, vol. 325(1), pages 363-389, June.
    4. Rodolfo Metulini & Giorgio Gnecco, 2023. "Measuring players’ importance in basketball using the generalized Shapley value," Annals of Operations Research, Springer, vol. 325(1), pages 441-465, June.
    5. Kęstutis Matulaitis & Tomas Bietkis, 2021. "Prediction of Offensive Possession Ends in Elite Basketball Teams," IJERPH, MDPI, vol. 18(3), pages 1-11, January.
    6. Manlio Migliorati & Marica Manisera & Paola Zuccolotto, 2023. "Integration of model-based recursive partitioning with bias reduction estimation: a case study assessing the impact of Oliver’s four factors on the probability of winning a basketball game," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 271-293, March.
    7. Sabin R. Paul, 2021. "Estimating player value in American football using plus–minus models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 313-364, December.
    8. Nikolaos Stavropoulos & Alexandra Papadopoulou & Pavlos Kolias, 2021. "Evaluating the Efficiency of Off-Ball Screens in Elite Basketball Teams via Second-Order Markov Modelling," Mathematics, MDPI, vol. 9(16), pages 1-13, August.
    9. Jun Woo Kim & Mar Magnusen & Seunghoon Jeong, 2023. "March Madness prediction: Different machine learning approaches with non‐box score statistics," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 44(4), pages 2223-2236, June.
    10. Marco Sandri & Paola Zuccolotto & Marica Manisera, 2020. "Markov switching modelling of shooting performance variability and teammate interactions in basketball," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1337-1356, November.
    11. Strobl, Carolin & Boulesteix, Anne-Laure & Augustin, Thomas, 2007. "Unbiased split selection for classification trees based on the Gini Index," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 483-501, September.
    12. Hache, Emmanuel & Leboullenger, Déborah & Mignon, Valérie, 2017. "Beyond average energy consumption in the French residential housing market: A household classification approach," Energy Policy, Elsevier, vol. 107(C), pages 82-95.
    13. Joseph Price & Justin Wolfers, 2010. "Racial Discrimination Among NBA Referees," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 125(4), pages 1859-1887.
    14. Ghosh, Atish R. & Qureshi, Mahvash S. & Kim, Jun Il & Zalduendo, Juan, 2014. "Surges," Journal of International Economics, Elsevier, vol. 92(2), pages 266-285.
      • Mahvash S Qureshi & Mr. Atish R. Ghosh & Mr. Juan Zalduendo & Mr. Jun I Kim, 2012. "Surges," IMF Working Papers 2012/022, International Monetary Fund.
    15. Tomàs Aluja-Banet & Eduard Nafria, 2003. "Stability and scalability in decision trees," Computational Statistics, Springer, vol. 18(3), pages 505-520, September.
    16. I. Albarrán & P. Alonso-González & J. M. Marin, 2017. "Some criticism to a general model in Solvency II: an explanation from a clustering point of view," Empirical Economics, Springer, vol. 52(4), pages 1289-1308, June.
    17. Schwartz, Ira M. & York, Peter & Nowakowski-Sims, Eva & Ramos-Hernandez, Ana, 2017. "Predictive and prescriptive analytics, machine learning and child welfare risk assessment: The Broward County experience," Children and Youth Services Review, Elsevier, vol. 81(C), pages 309-320.
    18. Ludden Ian G. & Jacobson Sheldon H. & Khatibi Arash & King Douglas M., 2020. "Models for generating NCAA men’s basketball tournament bracket pools," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(1), pages 1-15, March.
    19. Yousaf Muhammad & Dey Sandeep Kumar, 2022. "Best proxy to determine firm performance using financial ratios: A CHAID approach," Review of Economic Perspectives, Sciendo, vol. 22(3), pages 219-239, September.
    20. Ralf Elsner & Manfred Krafft & Arnd Huchzermeier, 2003. "Optimizing Rhenania's Mail-Order Business Through Dynamic Multilevel Modeling (DMLM)," Interfaces, INFORMS, vol. 33(1), pages 50-66, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:325:y:2023:i:1:d:10.1007_s10479-022-04784-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.