IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v14y2018i2p65-79n3.html
   My bibliography  Save this article

On the dependency of soccer scores – a sparse bivariate Poisson model for the UEFA European football championship 2016

Author

Listed:
  • Groll Andreas

    (Georg-August-Universität Göttingen, Department of Statistics and Econometrics, Göttingen, Niedersachsen, Germany; and Technische Universität Dortmund, Fakultät Statistik, Dortmund, Nordrhein-Westfalen, Germany)

  • Kneib Thomas

    (Georg-August-Universität Göttingen, Department of Statistics and Econometrics, Göttingen, Niedersachsen, Germany)

  • Mayr Andreas

    (Friedrich-Alexander-Universität Erlangen-Nürnberg, Institut für Medizininformatik, Biometrie und Epidemiologie, Erlangen, Bavaria, Germany; and Institut für Medizinische Biometrie, Informatik und Epidemiologie, WG Statistical Methods in Epidemiology, University Hospital Bonn, Bonn, Germany)

  • Schauberger Gunther

    (Ludwig-Maximilians-University, Department of Statistics, Munich, Bavaria, Germany; and Technical University of Munich, Department of Sport and Health Sciences, Chair of Epidemiology, Munich, Bavaria, Germany)

Abstract

When analyzing and modeling the results of soccer matches, one important aspect is to account for the correct dependence of the scores of two competing teams. Several studies have found that, marginally, these scores are moderately negatively correlated. Even though many approaches that analyze the results of soccer matches are based on two (conditionally) independent pairwise Poisson distributions, a certain amount of (mostly negative) dependence between the scores of the competing teams can simply be induced by the inclusion of covariate information of both teams in a suitably structured linear predictor. One objective of this article is to analyze if this type of modeling is appropriate or if additional explicit modeling of the dependence structure for the joint score of a soccer match needs to be taken into account. Therefore, a specific bivariate Poisson model for the two numbers of goals scored by national teams competing in UEFA European football championship matches is fitted to all matches from the three previous European championships, including covariate information of both competing teams. A boosting approach is then used to select the relevant covariates. Based on the estimates, the tournament is simulated 1,000,000 times to obtain winning probabilities for all participating national teams.

Suggested Citation

  • Groll Andreas & Kneib Thomas & Mayr Andreas & Schauberger Gunther, 2018. "On the dependency of soccer scores – a sparse bivariate Poisson model for the UEFA European football championship 2016," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(2), pages 65-79, June.
  • Handle: RePEc:bpj:jqsprt:v:14:y:2018:i:2:p:65-79:n:3
    DOI: 10.1515/jqas-2017-0067
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2017-0067
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2017-0067?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Siem Jan Koopman & Rutger Lit, 2015. "A dynamic bivariate Poisson model for analysing and forecasting match results in the English Premier League," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(1), pages 167-186, January.
    2. Andreas Mayr & Nora Fenske & Benjamin Hofner & Thomas Kneib & Matthias Schmid, 2012. "Generalized additive models for location, scale and shape for high dimensional data—a flexible approach based on boosting," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(3), pages 403-427, May.
    3. Groll Andreas & Abedieh Jasmin, 2013. "Spain retains its title and sets a new record – generalized linear mixed models on European football championships," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 9(1), pages 51-66, March.
    4. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    5. Schmid, Matthias & Hothorn, Torsten, 2008. "Boosting additive models using component-wise P-Splines," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 298-311, December.
    6. Achim Zeileis & Christoph Leitner & Kurt Hornik, 2016. "Predictive Bookmaker Consensus Model for the UEFA Euro 2016," Working Papers 2016-15, Faculty of Economics and Statistics, Universität Innsbruck.
    7. Boshnakov, Georgi & Kharrat, Tarak & McHale, Ian G., 2017. "A bivariate Weibull count model for forecasting association football scores," International Journal of Forecasting, Elsevier, vol. 33(2), pages 458-466.
    8. Hofner, Benjamin & Mayr, Andreas & Schmid, Matthias, 2016. "gamboostLSS: An R Package for Model Building and Variable Selection in the GAMLSS Framework," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i01).
    9. D Dyte & S R Clarke, 2000. "A ratings based Poisson model for World Cup soccer simulation," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 51(8), pages 993-998, August.
    10. Groll Andreas & Schauberger Gunther & Tutz Gerhard, 2015. "Prediction of major international soccer tournaments based on team-specific regularized Poisson regression: An application to the FIFA World Cup 2014," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(2), pages 97-115, June.
    11. Constantinou Anthony Costa & Fenton Norman Elliott, 2012. "Solving the Problem of Inadequate Scoring Rules for Assessing Probabilistic Football Forecast Models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(1), pages 1-14, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Daniel Goller & Michael C. Knaus & Michael Lechner & Gabriel Okasa, 2021. "Predicting match outcomes in football by an Ordered Forest estimator," Chapters, in: Ruud H. Koning & Stefan Kesenne (ed.), A Modern Guide to Sports Economics, chapter 22, pages 335-355, Edward Elgar Publishing.
    2. Luke S. Benz & Michael J. Lopez, 2023. "Estimating the change in soccer’s home advantage during the Covid-19 pandemic using bivariate Poisson regression," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 205-232, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    2. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    3. Groll Andreas & Schauberger Gunther & Tutz Gerhard, 2015. "Prediction of major international soccer tournaments based on team-specific regularized Poisson regression: An application to the FIFA World Cup 2014," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(2), pages 97-115, June.
    4. Hendrik van der Wurp & Andreas Groll, 2023. "Introducing LASSO-type penalisation to generalised joint regression modelling for count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 127-151, March.
    5. Gunther Schauberger & Andreas Groll & Gerhard Tutz, 2018. "Analysis of the importance of on-field covariates in the German Bundesliga," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(9), pages 1561-1578, July.
    6. Pearson Mitchell & Jr Glen Livingston & King Robert, 2020. "An exploration of predictive football modelling," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(1), pages 27-39, March.
    7. Nicolai Hans & Nadja Klein & Florian Faschingbauer & Michael Schneider & Andreas Mayr, 2023. "Boosting distributional copula regression," Biometrics, The International Biometric Society, vol. 79(3), pages 2298-2310, September.
    8. Riccardo De Bin & Vegard Grødem Stikbakke, 2023. "A boosting first-hitting-time model for survival analysis in high-dimensional settings," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(2), pages 420-440, April.
    9. Singleton, Carl & Reade, J. James & Brown, Alasdair, 2020. "Going with your gut: The (In)accuracy of forecast revisions in a football score prediction game," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 89(C).
    10. Hopfensitz, Astrid & Mantilla, Cesar, 2019. "Emotional expressions by sports teams: An analysis of World Cup soccer player portraits," Journal of Economic Psychology, Elsevier, vol. 75(PB).
    11. Xin Fang & Bo Fang & Chunfang Wang & Tian Xia & Matteo Bottai & Fang Fang & Yang Cao, 2019. "Comparison of Frequentist and Bayesian Generalized Additive Models for Assessing the Association between Daily Exposure to Fine Particles and Respiratory Mortality: A Simulation Study," IJERPH, MDPI, vol. 16(5), pages 1-20, March.
    12. Hubáček, Ondřej & Šír, Gustav, 2023. "Beating the market with a bad predictive model," International Journal of Forecasting, Elsevier, vol. 39(2), pages 691-719.
    13. Wheatcroft Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 273-287, December.
    14. Wheatcroft, Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," LSE Research Online Documents on Economics 111494, London School of Economics and Political Science, LSE Library.
    15. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    16. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    17. Hans Eetvelde & Lars Magnus Hvattum & Christophe Ley, 2023. "The Probabilistic Final Standing Calculator: a fair stochastic tool to handle abruptly stopped football seasons," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 251-269, March.
    18. Chater, Mario & Arrondel, Luc & Gayant, Jean-Pascal & Laslier, Jean-François, 2021. "Fixing match-fixing: Optimal schedules to promote competitiveness," European Journal of Operational Research, Elsevier, vol. 294(2), pages 673-683.
    19. Kharrat, Tarak & McHale, Ian G. & Peña, Javier López, 2020. "Plus–minus player ratings for soccer," European Journal of Operational Research, Elsevier, vol. 283(2), pages 726-736.
    20. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:14:y:2018:i:2:p:65-79:n:3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.