IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0282295.html
   My bibliography  Save this article

Expected goals in football: Improving model performance and demonstrating value

Author

Listed:
  • James Mead
  • Anthony O’Hare
  • Paul McMenemy

Abstract

Recently, football has seen the creation of various novel, ubiquitous metrics used throughout clubs’ analytics departments. These can influence many of their day-to-day operations ranging from financial decisions on player transfers, to evaluation of team performance. At the forefront of this scientific movement is the metric expected goals, a measure which allows analysts to quantify how likely a given shot is to result in a goal however, xG models have not until this point considered using important features, e.g., player/team ability and psychological effects, and is not widely trusted by everyone in the wider football community. This study aims to solve both these issues through the implementation of machine learning techniques by, modelling expected goals values using previously untested features and comparing the predictive ability of traditional statistics against this newly developed metric. Error values from the expected goals models built in this work were shown to be competitive with optimal values from other papers, and some of the features added in this study were revealed to have a significant impact on expected goals model outputs. Secondly, not only was expected goals found to be a superior predictor of a football team’s future success when compared to traditional statistics, but also our results outperformed those collected from an industry leader in the same area.

Suggested Citation

  • James Mead & Anthony O’Hare & Paul McMenemy, 2023. "Expected goals in football: Improving model performance and demonstrating value," PLOS ONE, Public Library of Science, vol. 18(4), pages 1-29, April.
  • Handle: RePEc:plo:pone00:0282295
    DOI: 10.1371/journal.pone.0282295
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0282295
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0282295&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0282295?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jean-Marc Falter & Christophe Perignon, 2000. "Demand for football and intramatch winning probability: an essay on the glorious uncertainty of sports," Applied Economics, Taylor & Francis Journals, vol. 32(13), pages 1757-1765.
    2. Manuel Stein & Halldór Janetzko & Daniel Seebacher & Alexander Jäger & Manuel Nagel & Jürgen Hölsch & Sven Kosub & Tobias Schreck & Daniel A. Keim & Michael Grossniklaus, 2017. "How to Make Sense of Team Sport Data: From Acquisition to Data Modeling and Research Aspects," Data, MDPI, vol. 2(1), pages 1-23, January.
    3. Goddard, John, 2005. "Regression models for forecasting goals and match results in association football," International Journal of Forecasting, Elsevier, vol. 21(2), pages 331-340.
    4. Hvattum, Lars Magnus & Arntzen, Halvard, 2010. "Using ELO ratings for match result prediction in association football," International Journal of Forecasting, Elsevier, vol. 26(3), pages 460-470, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ishara Bandara & Sergiy Shelyag & Sutharshan Rajasegarar & Dan Dwyer & Eun-jin Kim & Maia Angelova, 2024. "Predicting goal probabilities with improved xG models using event sequences in association football," PLOS ONE, Public Library of Science, vol. 19(10), pages 1-22, October.
    2. Pilar Malagón-Selma & Ana Debón & Josep Domenech, 2023. "Measuring the popularity of football players with Google Trends," PLOS ONE, Public Library of Science, vol. 18(8), pages 1-21, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    2. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    3. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," LSE Research Online Documents on Economics 103712, London School of Economics and Political Science, LSE Library.
    4. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.
    5. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," International Journal of Forecasting, Elsevier, vol. 36(3), pages 916-932.
    6. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    7. Peeters, Thomas, 2018. "Testing the Wisdom of Crowds in the field: Transfermarkt valuations and international soccer results," International Journal of Forecasting, Elsevier, vol. 34(1), pages 17-29.
    8. Koopman, Siem Jan & Lit, Rutger, 2019. "Forecasting football match results in national league competitions using score-driven time series models," International Journal of Forecasting, Elsevier, vol. 35(2), pages 797-809.
    9. P. Gorgi & S. J. Koopman & R. Lit, 2023. "Estimation of final standings in football competitions with a premature ending: the case of COVID-19," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 233-250, March.
    10. Regina Madalozzo & Rodrigo Berber Villar, 2009. "Brazilian Football," Journal of Sports Economics, , vol. 10(6), pages 639-650, December.
    11. J. James Reade & Carl Singleton & Alasdair Brown, 2021. "Evaluating strange forecasts: The curious case of football match scorelines," Scottish Journal of Political Economy, Scottish Economic Society, vol. 68(2), pages 261-285, May.
    12. Gross, Johannes & Rebeggiani, Luca, 2018. "Chance or Ability? The Efficiency of the Football Betting Market Revisited," MPRA Paper 87230, University Library of Munich, Germany.
    13. Hvattum Lars Magnus, 2015. "Playing on artificial turf may be an advantage for Norwegian soccer teams," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(3), pages 183-192, September.
    14. J Reade & C Singleton & L Vaughan Williams, 2020. "Betting Markets for English Premier League Results and Scorelines: Evaluating a Simple Forecasting Model," Economic Issues Journal Articles, Economic Issues, vol. 25(1), pages 87-106, March.
    15. Andreas Heuer & Oliver Rubner, 2014. "Optimizing the Prediction Process: From Statistical Concepts to the Case Study of Soccer," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-9, September.
    16. Sumit Sarkar & Sooraj Kamath, 2023. "Does luck play a role in the determination of the rank positions in football leagues? A study of Europe’s ‘big five’," Annals of Operations Research, Springer, vol. 325(1), pages 245-260, June.
    17. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.
    18. J. James Reade & Sachiko Akie, 2013. "Using Forecasting to Detect Corruption in International Football," Working Papers 2013-005, The George Washington University, Department of Economics, H. O. Stekler Research Program on Forecasting.
    19. Marius Ötting & Christian Deutscher & Carl Singleton & Luca De Angelis, 2023. "Gambling on Momentum in Contests," Economics Discussion Papers em-dp2023-08, Department of Economics, University of Reading.
    20. Constantinou Anthony Costa & Fenton Norman Elliott, 2013. "Determining the level of ability of football teams by dynamic ratings based on the relative discrepancies in scores between adversaries," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 9(1), pages 37-50, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0282295. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.