IDEAS home Printed from https://ideas.repec.org/p/tin/wpaper/20170062.html
   My bibliography  Save this paper

Forecasting Football Match Results in National League Competitions Using Score-Driven Time Series Models

Author

Listed:
  • Siem Jan (S.J.) Koopman

    (VU Amsterdam, The Netherlands; CREATES, Aarhus University, Denmark; Tinbergen Institute, The Netherlands)

  • Rutger Lit

    (VU Amsterdam, The Netherlands)

Abstract

We develop a new dynamic multivariate model for the analysis and the forecasting of football match results in national league competitions. The proposed dynamic model is based on the score of the predictive observation mass function for a high-dimensional panel of weekly match results. Our main interest is to forecast whether the match result is a win, a loss or a draw for each team. To deliver such forecasts, the dynamic model can be based on three different dependent variables: the pairwise count of the number of goals, the difference between the number of goals, or the category of the match result (win, loss, draw). The different dependent variables require different distributional assumptions. Furthermore, different dynamic model specifications can be considered for generating the forecasts. We empirically investigate which dependent variable and which dynamic model specification yield the best forecasting results. In an extensive forecasting study, we consider match results from six large European football competitions and we validate the precision of the forecasts for a period of seven years for each competition. We conclude that our preferred dynamic model for pairwise counts delivers the most precise forecasts and outperforms benchmark and other competing models.

Suggested Citation

  • Siem Jan (S.J.) Koopman & Rutger Lit, 2017. "Forecasting Football Match Results in National League Competitions Using Score-Driven Time Series Models," Tinbergen Institute Discussion Papers 17-062/III, Tinbergen Institute.
  • Handle: RePEc:tin:wpaper:20170062
    as

    Download full text from publisher

    File URL: https://papers.tinbergen.nl/17062.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. F. Blasques & S. J. Koopman & A. Lucas, 2015. "Information-theoretic optimality of observation-driven time series models for continuous responses," Biometrika, Biometrika Trust, vol. 102(2), pages 325-343.
    2. Ioannis Asimakopoulos & John Goddard, 2004. "Forecasting football results and the efficiency of fixed-odds betting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(1), pages 51-66.
    3. Siem Jan Koopman & André Lucas & Marcel Scharth, 2016. "Predicting Time-Varying Parameters with Parameter-Driven and Observation-Driven Models," The Review of Economics and Statistics, MIT Press, vol. 98(1), pages 97-110, March.
    4. Siem Jan Koopman & Rutger Lit & André Lucas, 2017. "Intraday Stochastic Volatility in Discrete Price Changes: The Dynamic Skellam Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1490-1503, October.
    5. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    6. Giovanni Angelini & Luca De Angelis, 2017. "PARX model for football match predictions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 36(7), pages 795-807, November.
    7. Boshnakov, Georgi & Kharrat, Tarak & McHale, Ian G., 2017. "A bivariate Weibull count model for forecasting association football scores," International Journal of Forecasting, Elsevier, vol. 33(2), pages 458-466.
    8. Constantinou Anthony Costa & Fenton Norman Elliott, 2012. "Solving the Problem of Inadequate Scoring Rules for Assessing Probabilistic Football Forecast Models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(1), pages 1-14, March.
    9. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Applications to Poisson Models," Econometrica, Econometric Society, vol. 52(3), pages 701-720, May.
    10. M. J. Maher, 1982. "Modelling association football scores," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 36(3), pages 109-118, September.
    11. Manuela Cattelan & Cristiano Varin & David Firth, 2013. "Dynamic Bradley–Terry modelling of sports tournaments," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(1), pages 135-150, January.
    12. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    13. Felix Famoye, 2010. "On the bivariate negative binomial regression model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(6), pages 969-981.
    14. Drew Creal & Siem Jan Koopman & André Lucas, 2013. "Generalized Autoregressive Score Models With Applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(5), pages 777-795, August.
    15. Goddard, John, 2005. "Regression models for forecasting goals and match results in association football," International Journal of Forecasting, Elsevier, vol. 21(2), pages 331-340.
    16. Hvattum, Lars Magnus & Arntzen, Halvard, 2010. "Using ELO ratings for match result prediction in association football," International Journal of Forecasting, Elsevier, vol. 26(3), pages 460-470, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wheatcroft Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 273-287, December.
    2. Wheatcroft, Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," LSE Research Online Documents on Economics 111494, London School of Economics and Political Science, LSE Library.
    3. Harvey, A., 2021. "Score-driven time series models," Cambridge Working Papers in Economics 2133, Faculty of Economics, University of Cambridge.
    4. Francisco Blasques & Vladim'ir Hol'y & Petra Tomanov'a, 2018. "Zero-Inflated Autoregressive Conditional Duration Model for Discrete Trade Durations with Excessive Zeros," Papers 1812.07318, arXiv.org, revised Jan 2022.
    5. Vladimír Holý & Jan Zouhar, 2022. "Modelling time‐varying rankings with autoregressive and score‐driven dynamics," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1427-1450, November.
    6. Raffaele Mattera, 2023. "Forecasting binary outcomes in soccer," Annals of Operations Research, Springer, vol. 325(1), pages 115-134, June.
    7. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    8. Butler, David & Butler, Robert & Eakins, John, 2021. "Expert performance and crowd wisdom: Evidence from English Premier League predictions," European Journal of Operational Research, Elsevier, vol. 288(1), pages 170-182.
    9. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.
    10. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    11. Vladim'ir Hol'y, 2022. "An Intraday GARCH Model for Discrete Price Changes and Irregularly Spaced Observations," Papers 2211.12376, arXiv.org, revised Sep 2023.
    12. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    13. Robert C. Smit & Francesco Ravazzolo & Luca Rossini, 2020. "Dynamic Bayesian forecasting of English Premier League match results with the Skellam distribution," BEMPS - Bozen Economics & Management Paper Series BEMPS72, Faculty of Economics and Management at the Free University of Bozen.
    14. P. Gorgi & S. J. Koopman & R. Lit, 2023. "Estimation of final standings in football competitions with a premature ending: the case of COVID-19," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 233-250, March.
    15. Giovanni Angelini & Giuseppe Cavaliere & Enzo D'Innocenzo & Luca De Angelis, 2022. "Time-Varying Poisson Autoregression," Papers 2207.11003, arXiv.org.
    16. Kung, Ko-Lun & Liu, I-Chien & Wang, Chou-Wen, 2021. "Modeling and pricing longevity derivatives using Skellam distribution," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 341-354.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    2. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    3. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.
    4. Gross, Johannes & Rebeggiani, Luca, 2018. "Chance or Ability? The Efficiency of the Football Betting Market Revisited," MPRA Paper 87230, University Library of Munich, Germany.
    5. Raffaele Mattera, 2023. "Forecasting binary outcomes in soccer," Annals of Operations Research, Springer, vol. 325(1), pages 115-134, June.
    6. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    7. Siem Jan Koopman & Rutger Lit & André Lucas & Anne Opschoor, 2018. "Dynamic discrete copula models for high‐frequency stock price changes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(7), pages 966-985, November.
    8. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    9. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," LSE Research Online Documents on Economics 103712, London School of Economics and Political Science, LSE Library.
    10. Tobias Eckernkemper & Bastian Gribisch, 2021. "Intraday conditional value at risk: A periodic mixed‐frequency generalized autoregressive score approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(5), pages 883-910, August.
    11. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," International Journal of Forecasting, Elsevier, vol. 36(3), pages 916-932.
    12. Blasques, Francisco & van Brummelen, Janneke & Koopman, Siem Jan & Lucas, André, 2022. "Maximum likelihood estimation for score-driven models," Journal of Econometrics, Elsevier, vol. 227(2), pages 325-346.
    13. J. James Reade & Carl Singleton & Alasdair Brown, 2021. "Evaluating strange forecasts: The curious case of football match scorelines," Scottish Journal of Political Economy, Scottish Economic Society, vol. 68(2), pages 261-285, May.
    14. Hassanniakalager, Arman & Sermpinis, Georgios & Stasinakis, Charalampos & Verousis, Thanos, 2020. "A conditional fuzzy inference approach in forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 196-216.
    15. Andreas Heuer & Oliver Rubner, 2014. "Optimizing the Prediction Process: From Statistical Concepts to the Case Study of Soccer," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-9, September.
    16. Gorgi, Paolo & Koopman, Siem Jan & Li, Mengheng, 2019. "Forecasting economic time series using score-driven dynamic models with mixed-data sampling," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1735-1747.
    17. Nguyen, Hoang & Javed, Farrukh, 2023. "Dynamic relationship between Stock and Bond returns: A GAS MIDAS copula approach," Journal of Empirical Finance, Elsevier, vol. 73(C), pages 272-292.
    18. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.
    19. Buccheri, Giuseppe & Corsi, Fulvio & Flandoli, Franco & Livieri, Giulia, 2021. "The continuous-time limit of score-driven volatility models," Journal of Econometrics, Elsevier, vol. 221(2), pages 655-675.
    20. Catania, Leopoldo & Proietti, Tommaso, 2020. "Forecasting volatility with time-varying leverage and volatility of volatility effects," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1301-1317.

    More about this item

    Keywords

    Football; Forecasting; Score-driven models; Bivariate Poisson; Skellam; Ordered probit; Probabilistic loss function;
    All these keywords.

    JEL classification:

    • C32 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes; State Space Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tin:wpaper:20170062. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tinbergen Office +31 (0)10-4088900 (email available below). General contact details of provider: https://edirc.repec.org/data/tinbenl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.