IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0265372.html
   My bibliography  Save this article

Using machine learning pipeline to predict entry into the attack zone in football

Author

Listed:
  • Leandro Stival
  • Allan Pinto
  • Felipe dos Santos Pinto de Andrade
  • Paulo Roberto Pereira Santiago
  • Henrik Biermann
  • Ricardo da Silva Torres
  • Ulisses Dias

Abstract

Sports sciences are increasingly data-intensive nowadays since computational tools can extract information from large amounts of data and derive insights from athlete performances during the competition. This paper addresses a performance prediction problem in soccer, a popular collective sport modality played by two teams competing against each other in the same field. In a soccer game, teams score points by placing the ball into the opponent’s goal and the winner is the team with the highest count of goals. Retaining possession of the ball is one key to success, but it is not enough since a team needs to score to achieve victory, which requires an offensive toward the opponent’s goal. The focus of this work is to determine if analyzing the first five seconds after the control of the ball is taken by one of the teams provides enough information to determine whether the ball will reach the final quarter of the soccer field, therefore creating a goal-scoring chance. By doing so, we can further investigate which conditions increase strategic leverage. Our approach comprises modeling players’ interactions as graph structures and extracting metrics from these structures. These metrics, when combined, form time series that we encode in two-dimensional representations of visual rhythms, allowing feature extraction through deep convolutional networks, coupled with a classifier to predict the outcome (whether the final quarter of the field is reached). The results indicate that offensive play near the adversary penalty area can be predicted by looking at the first five seconds. Finally, the explainability of our models reveals the main metrics along with its contributions for the final inference result, which corroborates other studies found in the literature for soccer match analysis.

Suggested Citation

  • Leandro Stival & Allan Pinto & Felipe dos Santos Pinto de Andrade & Paulo Roberto Pereira Santiago & Henrik Biermann & Ricardo da Silva Torres & Ulisses Dias, 2023. "Using machine learning pipeline to predict entry into the attack zone in football," PLOS ONE, Public Library of Science, vol. 18(1), pages 1-24, January.
  • Handle: RePEc:plo:pone00:0265372
    DOI: 10.1371/journal.pone.0265372
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0265372
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0265372&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0265372?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Craig Wright & Steve Atkins & Remco Polman & Bryan Jones & Lee Sargeson ., 2011. "Factors Associated with Goals and Goal Scoring Opportunities in Professional Soccer," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 11(3), pages 438-449, December.
    2. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Galli, L. & Galvan, G. & Levato, T. & Liti, C. & Piccialli, V. & Sciandrone, M., 2021. "Football: Discovering elapsing-time bias in the science of success," Chaos, Solitons & Fractals, Elsevier, vol. 152(C).
    2. Carlos Lago-Peñas & Miguel Gómez-Ruano & Gai Yang, 2017. "Styles of play in professional soccer: an approach of the Chinese Soccer Super League," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 17(6), pages 1073-1084, November.
    3. Wheatcroft Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 273-287, December.
    4. Chunyang Huang & Shaoliang Zhang, 2023. "Explainable artificial intelligence model for identifying Market Value in Professional Soccer Players," Papers 2311.04599, arXiv.org, revised Nov 2023.
    5. Wheatcroft, Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," LSE Research Online Documents on Economics 111494, London School of Economics and Political Science, LSE Library.
    6. Schlembach, Christoph & Schmidt, Sascha L. & Schreyer, Dominik & Wunderlich, Linus, 2022. "Forecasting the Olympic medal distribution – A socioeconomic machine learning model," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    7. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    8. Butler, David & Butler, Robert & Eakins, John, 2021. "Expert performance and crowd wisdom: Evidence from English Premier League predictions," European Journal of Operational Research, Elsevier, vol. 288(1), pages 170-182.
    9. Christoph Schlembach & Sascha L. Schmidt & Dominik Schreyer & Linus Wunderlich, 2020. "Forecasting the Olympic medal distribution during a pandemic: a socio-economic machine learning model," Papers 2012.04378, arXiv.org, revised Jun 2021.
    10. Marc Brechot & Raphael Flepp, 2018. "Dealing with randomness in match outcomes: how to rethink performance evaluation and decision-making in European club football," Working Papers 374, University of Zurich, Department of Business Administration (IBW).
    11. Koopman, Siem Jan & Lit, Rutger, 2019. "Forecasting football match results in national league competitions using score-driven time series models," International Journal of Forecasting, Elsevier, vol. 35(2), pages 797-809.
    12. Green, Lawrence & Sung, Ming-Chien & Ma, Tiejun & Johnson, Johnnie E. V., 2019. "To what extent can new web-based technology improve forecasts? Assessing the economic value of information derived from Virtual Globes and its rate of diffusion in a financial market," European Journal of Operational Research, Elsevier, vol. 278(1), pages 226-239.
    13. Maurizio Carpita & Enrico Ciavolino & Paola Pasca, 2021. "Players’ Role-Based Performance Composite Indicators of Soccer Teams: A Statistical Perspective," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 815-830, August.
    14. Jeremy P Alexander & Karl B Jackson & Timothy Bedin & Matthew A Gloster & Sam Robertson, 2022. "Quantifying congestion with player tracking data in Australian football," PLOS ONE, Public Library of Science, vol. 17(8), pages 1-16, August.
    15. Hassanniakalager, Arman & Sermpinis, Georgios & Stasinakis, Charalampos & Verousis, Thanos, 2020. "A conditional fuzzy inference approach in forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 196-216.
    16. Marc Brechot & Raphael Flepp, 2020. "Dealing With Randomness in Match Outcomes: How to Rethink Performance Evaluation in European Club Football Using Expected Goals," Journal of Sports Economics, , vol. 21(4), pages 335-362, May.
    17. Christoph Buehren & Dominic Jung, 2022. "Performing without pressure? The effect of ghost games on effort- and skill-based tasks in the football Bundesliga," MAGKS Papers on Economics 202227, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    18. Federico Fioravanti & Fernando Delbianco & Fernando Tohmé, 2023. "The relative importance of ability, luck and motivation in team sports: a Bayesian model of performance in the English Rugby Premiership," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(3), pages 715-731, September.
    19. Sumit Sarkar & Sooraj Kamath, 2023. "Does luck play a role in the determination of the rank positions in football leagues? A study of Europe’s ‘big five’," Annals of Operations Research, Springer, vol. 325(1), pages 245-260, June.
    20. Fry, John & Serbera, Jean-Philippe & Wilson, Rob, 2021. "Managing performance expectations in association football," Journal of Business Research, Elsevier, vol. 135(C), pages 445-453.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0265372. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.