IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v15y2019i4p289-312n5.html
   My bibliography  Save this article

Bayesian statistics meets sports: a comprehensive review

Author

Listed:
  • Santos-Fernandez Edgar

    (Queensland University of Technology, Faculty of Science and Engineering, School of Mathematical Sciences, Y Block, Floor 8, Gardens Point Campus Queensland University of Technology, GPO Box 2434, Brisbane, Queensland, Australia, e-mail: santosfe@qut.edu.au)

  • Wu Paul
  • Mengersen Kerrie L.

    (Queensland University of Technology, Faculty of Science and Engineering, School of Mathematical Sciences, Brisbane, Queensland, Australia)

Abstract

Bayesian methods are becoming increasingly popular in sports analytics. Identified advantages of the Bayesian approach include the ability to model complex problems, obtain probabilistic estimates and predictions that account for uncertainty, combine information sources and update learning as new data become available. The volume and variety of data produced in sports activities over recent years and the availability of software packages for Bayesian computation have contributed significantly to this growth. This comprehensive survey reviews and characterizes the latest advances in Bayesian statistics in sports, including methods and applications. We found that a large proportion of these articles focus on modeling/predicting the outcome of sports games and on the development of statistics that provides a better picture of athletes’ performance. We provide a description of some of the advances in basketball, football and baseball. We also summarise the sources of data used for the analysis and the most commonly used software for Bayesian computation. We found a similar number of publications between 2013 and 2018 as compared to those published in the three previous decades, which is an indication of the growing adoption rate of Bayesian methods in sports.

Suggested Citation

  • Santos-Fernandez Edgar & Wu Paul & Mengersen Kerrie L., 2019. "Bayesian statistics meets sports: a comprehensive review," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 15(4), pages 289-312, December.
  • Handle: RePEc:bpj:jqsprt:v:15:y:2019:i:4:p:289-312:n:5
    DOI: 10.1515/jqas-2018-0106
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2018-0106
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2018-0106?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Siem Jan Koopman & Rutger Lit, 2015. "A dynamic bivariate Poisson model for analysing and forecasting match results in the English Premier League," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(1), pages 167-186, January.
    2. Sturtz, Sibylle & Ligges, Uwe & Gelman, Andrew, 2005. "R2WinBUGS: A Package for Running WinBUGS from R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 12(i03).
    3. McShane Blakeley B. & Braunstein Alexander & Piette James & Jensen Shane T., 2011. "A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(4), pages 1-26, October.
    4. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    5. Koulis Theodoro & Muthukumarana Saman & Briercliffe Creagh Dyson, 2014. "A Bayesian stochastic model for batting performance evaluation in one-day cricket," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 10(1), pages 1-13, January.
    6. Miskin Michelle A & Fellingham Gilbert W & Florence Lindsay W, 2010. "Skill Importance in Women's Volleyball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 6(2), pages 1-14, April.
    7. Silva Rajitha M. & Swartz Tim B., 2016. "Analysis of substitution times in soccer," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 12(3), pages 113-122, September.
    8. Stephenson Alec G. & Tawn Jonathan A., 2013. "Determining the Best Track Performances of All Time Using a Conceptual Population Model for Athletics Records," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 9(1), pages 67-76, March.
    9. Mark E. Glickman, 1999. "Parameter Estimation in Large Dynamic Paired Comparison Experiments," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(3), pages 377-394.
    10. Visser, Ingmar & Speekenbrink, Maarten, 2010. "depmixS4: An R Package for Hidden Markov Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 36(i07).
    11. A K Suzuki & L E B Salasar & J G Leite & F Louzada-Neto, 2010. "A Bayesian approach for predicting match outcomes: The 2006 (Association) Football World Cup," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(10), pages 1530-1539, October.
    12. Stevenson Oliver George & Brewer Brendon J., 2017. "Bayesian survival analysis of batsmen in Test cricket," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(1), pages 25-36, March.
    13. Tae Young Yang, 2004. "Bayesian binary segmentation procedure for detecting streakiness in sports," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 167(4), pages 627-637, November.
    14. Neal Dan & Tan James & Hao Feng & Wu Samuel S, 2010. "Simply Better: Using Regression Models to Estimate Major League Batting Averages," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 6(3), pages 1-14, July.
    15. Kovalchik Stephanie A. & Albert Jim, 2017. "A multilevel Bayesian approach for modeling the time-to-serve in professional tennis," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(2), pages 49-62, June.
    16. Shortridge Ashton & Goldsberry Kirk & Adams Matthew, 2014. "Creating space to shoot: quantifying spatial relative field goal efficiency in basketball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 10(3), pages 1-11, September.
    17. Gramacy Robert B. & Taddy Matt & Jensen Shane T., 2013. "Estimating player contribution in hockey with regularized logistic regression," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 9(1), pages 97-111, March.
    18. Albert Jim, 2008. "Streaky Hitting in Baseball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 4(1), pages 1-34, January.
    19. Thomas Andrew C, 2006. "The Impact of Puck Possession and Location on Ice Hockey Strategy," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 2(1), pages 1-19, January.
    20. Gianluca Baio & Marta Blangiardo, 2010. "Bayesian hierarchical model for the prediction of football results," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(2), pages 253-264.
    21. Albert Jim, 2016. "Improved component predictions of batting and pitching measures," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 12(2), pages 73-85, June.
    22. Leonardo Lamas & Felipe Santana & Matthew Heiner & Carlos Ugrinowitsch & Gilbert Fellingham, 2015. "Modeling the Offensive-Defensive Interaction and Resulting Outcomes in Basketball," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-14, December.
    23. Mark Glickman, 2001. "Dynamic paired comparison models with stochastic variances," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(6), pages 673-689.
    24. Glickman Mark E. & Hennessy Jonathan, 2015. "A stochastic rank ordered logit model for rating multi-competitor games and sports," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(3), pages 131-144, September.
    25. Ruiz Francisco J. R. & Perez-Cruz Fernando, 2015. "A generative model for predicting outcomes in college basketball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 39-52, March.
    26. Martin, Andrew D. & Quinn, Kevin M. & Park, Jong Hee, 2011. "MCMCpack: Markov Chain Monte Carlo in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i09).
    27. Matt Taddy, 2013. "Multinomial Inverse Regression for Text Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 755-770, September.
    28. Baker, Rose D. & McHale, Ian G., 2017. "An empirical Bayes model for time-varying paired comparisons ratings: Who is the greatest women’s tennis player?," European Journal of Operational Research, Elsevier, vol. 258(1), pages 328-333.
    29. Wimmer Valentin & Fenske Nora & Pyrka Patricia & Fahrmeir Ludwig, 2011. "Exploring Competition Performance in Decathlon Using Semi-Parametric Latent Variable Models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(4), pages 1-21, October.
    30. Deshpande Sameer K. & Jensen Shane T., 2016. "Estimating an NBA player’s impact on his team’s chances of winning," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 12(2), pages 51-72, June.
    31. Murray Thomas A., 2017. "Ranking ultimate teams using a Bayesian score-augmented win-loss model," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(2), pages 63-78, June.
    32. Alessandro Liberati & Douglas G Altman & Jennifer Tetzlaff & Cynthia Mulrow & Peter C Gøtzsche & John P A Ioannidis & Mike Clarke & P J Devereaux & Jos Kleijnen & David Moher, 2009. "The PRISMA Statement for Reporting Systematic Reviews and Meta-Analyses of Studies That Evaluate Health Care Interventions: Explanation and Elaboration," PLOS Medicine, Public Library of Science, vol. 6(7), pages 1-28, July.
    33. Rose D. Baker & Ian G. McHale, 2015. "Deterministic Evolution of Strength in Multiple Comparisons Models: Who is the Greatest Golfer?," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 180-196, March.
    34. Golnaz Shahtahmassebi & Rana Moyeed, 2016. "An application of the generalized Poisson difference distribution to the Bayesian modelling of football scores," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 70(3), pages 260-273, August.
    35. Cafarelli Ryan & Rigdon Christopher J. & Rigdon Steven E., 2012. "Models for Third Down Conversion in the National Football League," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(3), pages 1-26, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Griffin Jim E. & Hinoveanu Laurenţiu C. & Hopker James G., 2022. "Bayesian modelling of elite sporting performance with large databases," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(4), pages 253-268, December.
    2. Sabin R. Paul, 2021. "Estimating player value in American football using plus–minus models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 313-364, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Angelini, Giovanni & Candila, Vincenzo & De Angelis, Luca, 2022. "Weighted Elo rating for tennis match predictions," European Journal of Operational Research, Elsevier, vol. 297(1), pages 120-132.
    2. Blaž Krese & Erik Štrumbelj, 2021. "A Bayesian approach to time-varying latent strengths in pairwise comparisons," PLOS ONE, Public Library of Science, vol. 16(5), pages 1-17, May.
    3. Yurko Ronald & Ventura Samuel & Horowitz Maksim, 2019. "nflWAR: a reproducible method for offensive player evaluation in football," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 15(3), pages 163-183, September.
    4. Sabin R. Paul, 2021. "Estimating player value in American football using plus–minus models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 313-364, December.
    5. Song, Kai & Shi, Jian, 2020. "A gamma process based in-play prediction model for National Basketball Association games," European Journal of Operational Research, Elsevier, vol. 283(2), pages 706-713.
    6. Luke S. Benz & Michael J. Lopez, 2023. "Estimating the change in soccer’s home advantage during the Covid-19 pandemic using bivariate Poisson regression," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 205-232, March.
    7. Jacek Osiewalski & Jerzy Marzec, 2019. "Joint modelling of two count variables when one of them can be degenerate," Computational Statistics, Springer, vol. 34(1), pages 153-171, March.
    8. Alexandra Grand & Regina Dittrich & Brian Francis, 2015. "Markov models of dependence in longitudinal paired comparisons: an application to course design," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 99(2), pages 237-257, April.
    9. Kharrat, Tarak & McHale, Ian G. & Peña, Javier López, 2020. "Plus–minus player ratings for soccer," European Journal of Operational Research, Elsevier, vol. 283(2), pages 726-736.
    10. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.
    11. Albert Jim, 2013. "Looking at spacings to assess streakiness," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 9(2), pages 151-163, June.
    12. Albert Jim, 2016. "Improved component predictions of batting and pitching measures," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 12(2), pages 73-85, June.
    13. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    14. Devlin Stephen & Treloar Thomas & Creagar Molly & Cassels Samuel, 2021. "An iterative Markov rating method," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(2), pages 117-127, June.
    15. Abdolnasser Sadeghkhani & Seyed Ejaz Ahmed, 2019. "A Bayesian Approach to Predict the Number of Goals in Hockey," Stats, MDPI, vol. 2(2), pages 1-11, April.
    16. Corona, Francisco & Forrest, David & Tena, J.D. & Wiper, Michael, 2019. "Bayesian forecasting of UEFA Champions League under alternative seeding regimes," International Journal of Forecasting, Elsevier, vol. 35(2), pages 722-732.
    17. Gerber Eric A. E. & Craig Bruce A., 2021. "A mixed effects multinomial logistic-normal model for forecasting baseball performance," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(3), pages 221-239, September.
    18. Seong W. Kim & Sabina Shahin & Hon Keung Tony Ng & Jinheum Kim, 2021. "Binary segmentation procedures using the bivariate binomial distribution for detecting streakiness in sports data," Computational Statistics, Springer, vol. 36(3), pages 1821-1843, September.
    19. Kovalchik, Stephanie, 2020. "Extension of the Elo rating system to margin of victory," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1329-1341.
    20. Federico ANDREIS & Pier Alda FERRARI, 2015. "Customer Satisfaction Evaluation Using Multidimensional Item Response Theory Models," Departmental Working Papers 2015-25, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:15:y:2019:i:4:p:289-312:n:5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.