A new methodology for generating and combining statistical forecasting models to enhance competitive event prediction
Forecasting methods are routinely employed to predict the outcome of competitive events (CEs) and to shed light on the factors that influence participants’ winning prospects (e.g., in sports events, political elections). Combining statistical models’ forecasts, shown to be highly successful in other settings, has been neglected in CE prediction. Two particular difficulties arise when developing model-based composite forecasts of CE outcomes: the intensity of rivalry among contestants, and the strength/diversity trade-off among individual models. To overcome these challenges we propose a range of surrogate measures of event outcome to construct a heterogeneous set of base forecasts. To effectively extract the complementary information concealed within these predictions, we develop a novel pooling mechanism which accounts for competition among contestants: a stacking paradigm integrating conditional logit regression and log-likelihood-ratio-based forecast selection. Empirical results using data related to horseracing events demonstrate that: (i) base model strength and diversity are important when combining model-based predictions for CEs; (ii) average-based pooling, commonly employed elsewhere, may not be appropriate for CEs (because average-based pooling exclusively focuses on strength); and (iii) the proposed stacking ensemble provides statistically and economically accurate forecasts. These results have important implications for regulators of betting markets associated with CEs and in particular for the accurate assessment of market efficiency.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Marco Aiolfi & Carlos Capistrán & Allan Timmermann, 2010.
CREATES Research Papers
2010-21, School of Economics and Management, University of Aarhus.
- Thomas D. Russell & Everett E. Adam, Jr., 1987. "An Empirical Evaluation of Alternative Forecasting Combinations," Management Science, INFORMS, vol. 33(10), pages 1267-1276, October.
- Diebold, Francis X., 1989.
"Forecast combination and encompassing: Reconciling two divergent literatures,"
International Journal of Forecasting,
Elsevier, vol. 5(4), pages 589-592.
- Francis X. Diebold, 1989. "Forecast combination and encompassing: reconciling two divergent literatures," Finance and Economics Discussion Series 80, Board of Governors of the Federal Reserve System (U.S.).
- Roy Batchelor & Pami Dua, 1995. "Forecaster Diversity and the Benefits of Combining Forecasts," Management Science, INFORMS, vol. 41(1), pages 68-75, January.
- Abellán, Joaquín & Masegosa, Andrés R., 2010. "An ensemble method using credal decision trees," European Journal of Operational Research, Elsevier, vol. 205(1), pages 218-226, August.
- Johnnie E. V. Johnson & Owen Jones & Leilei Tang, 2006. "Exploring Decision Makers' Use of Price Information in a Speculative Market," Management Science, INFORMS, vol. 52(6), pages 897-908, June.
- Vaughan Williams, Leighton, 1999. "Information Efficiency in Betting Markets: A Survey," Bulletin of Economic Research, Wiley Blackwell, vol. 51(1), pages 1-30, January.
- Losey, Robert L & Talbott, John C, Jr, 1980. " Back on the Track with the Efficient Markets Hypothesis," Journal of Finance, American Finance Association, vol. 35(4), pages 1039-43, September.
- Spyros Makridakis & Robert L. Winkler, 1983. "Averages of Forecasts: Some Empirical Results," Management Science, INFORMS, vol. 29(9), pages 987-996, September.
- Crafts, Nicholas F R, 1985. "Some Evidence of Insider Knowledge in Horse Race Betting in Britain," Economica, London School of Economics and Political Science, vol. 52(27), pages 295-304, August.
- White, Edna M. & Dattero, Ronald & Flores, Benito, 1992. "Combining vector forecasts to predict thoroughbred horse race outcomes," International Journal of Forecasting, Elsevier, vol. 8(4), pages 595-611, December.
- Lessmann, Stefan & Sung, Ming-Chien & Johnson, Johnnie E.V., 2009. "Identifying winners of competitive events: A SVM-based classification model for horserace prediction," European Journal of Operational Research, Elsevier, vol. 196(2), pages 569-577, July.
- Grant, Andrew & Johnstone, David, 2010. "Finding profitable forecast combinations using probability scoring rules," International Journal of Forecasting, Elsevier, vol. 26(3), pages 498-510, July.
- van Wezel, Michiel & Potharst, Rob, 2007. "Improved customer choice predictions using ensemble methods," European Journal of Operational Research, Elsevier, vol. 181(1), pages 436-452, August.
- Figlewski, Stephen, 1979. "Subjective Information and Market Efficiency in a Betting Market," Journal of Political Economy, University of Chicago Press, vol. 87(1), pages 75-88, February.
- David Johnstone, 2007. "Economic Darwinism: Who has the Best Probabilities?," Theory and Decision, Springer, vol. 62(1), pages 47-96, February.
- Nikolaos Vlastakis & George Dotsis & Raphael N. Markellos, 2009. "How efficient is the European football betting market? Evidence from arbitrage and trading strategies," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(5), pages 426-444.
- Dixon, Mark J. & Pope, Peter F., 2004. "The value of statistical forecasts in the UK association football betting market," International Journal of Forecasting, Elsevier, vol. 20(4), pages 697-711.
- West, David & Mangiameli, Paul & Rampal, Rohit & West, Vivian, 2005. "Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application," European Journal of Operational Research, Elsevier, vol. 162(2), pages 532-551, April.
- Granger, Clive W. J. & Jeon, Yongil, 2004. "Thick modeling," Economic Modelling, Elsevier, vol. 21(2), pages 323-343, March.
- Jose, Victor Richmond R. & Winkler, Robert L., 2008. "Simple robust averages of forecasts: Some empirical results," International Journal of Forecasting, Elsevier, vol. 24(1), pages 163-169.
- Robert L. Winkler & Roy M. Poses, 1993. "Evaluating and Combining Physicians' Probabilities of Survival in an Intensive Care Unit," Management Science, INFORMS, vol. 39(12), pages 1526-1543, December.
- David C. Schmittlein & Jinho Kim & Donald G. Morrison, 1990. "Combining Forecasts: Operational Adjustments to Theoretically Optimal Rules," Management Science, INFORMS, vol. 36(9), pages 1044-1056, September.
- Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
- Liu, Yu-Hsin, 2011. "Incorporating scatter search and threshold accepting in finding maximum likelihood estimates for the multinomial probit model," European Journal of Operational Research, Elsevier, vol. 211(1), pages 130-138, May.
- Paleologo, Giuseppe & Elisseeff, André & Antonini, Gianluca, 2010. "Subagging for credit scoring models," European Journal of Operational Research, Elsevier, vol. 201(2), pages 490-499, March.
- de Menezes, Lilian M. & W. Bunn, Derek & Taylor, James W., 2000. "Review of guidelines for the use of combined forecasts," European Journal of Operational Research, Elsevier, vol. 120(1), pages 190-204, January.
- Ming-Chien Sung & Johnnie E.V. Johnson, 2007. "Comparing the Effectiveness of One- and Two-step Conditional Logit Models for Predicting Outcomes in a Speculative Market," Journal of Prediction Markets, University of Buckingham Press, vol. 1(1), pages 43-59, February.
- Clemen, Robert T., 1989. "Combining forecasts: A review and annotated bibliography," International Journal of Forecasting, Elsevier, vol. 5(4), pages 559-583.
- Ruth N. Bolton & Randall G. Chapman, 1986. "Searching for Positive Returns at the Track: A Multinomial Logit Model for Handicapping Horse Races," Management Science, INFORMS, vol. 32(8), pages 1040-1060, August.
- Snyder, Wayne W, 1978. "Horse Racing: Testing the Efficient Markets Model," Journal of Finance, American Finance Association, vol. 33(4), pages 1109-18, September.
- Lessmann, Stefan & Sung, Ming-Chien & Johnson, Johnnie E.V., 2010. "Alternative methods of predicting competitive events: An application in horserace betting markets," International Journal of Forecasting, Elsevier, vol. 26(3), pages 518-536, July.
- D. J. Johnstone, 2011. "Economic Interpretation of Probabilities Estimated by Maximum Likelihood or Score," Management Science, INFORMS, vol. 57(2), pages 308-314, February.
- M. Sung & J. E. V. Johnson, 2010. "Revealing Weak-Form Inefficiency in a Market for State Contingent Claims: The Importance of Market Ecology, Modelling Procedures and Investment Strategies," Economica, London School of Economics and Political Science, vol. 77(305), pages 128-147, 01.
- Leitch, Gordon & Tanner, J Ernest, 1991. "Economic Forecast Evaluation: Profits versus the Conventional Error Measures," American Economic Review, American Economic Association, vol. 81(3), pages 580-90, June.
- Martin Spann & Bernd Skiera, 2009. "Sports forecasting: a comparison of the forecast accuracy of prediction markets, betting odds and tipsters," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(1), pages 55-72.
When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:218:y:2012:i:1:p:163-174. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If references are entirely missing, you can add them using this form.