A new methodology for generating and combining statistical forecasting models to enhance competitive event prediction
Forecasting methods are routinely employed to predict the outcome of competitive events (CEs) and to shed light on the factors that influence participants’ winning prospects (e.g., in sports events, political elections). Combining statistical models’ forecasts, shown to be highly successful in other settings, has been neglected in CE prediction. Two particular difficulties arise when developing model-based composite forecasts of CE outcomes: the intensity of rivalry among contestants, and the strength/diversity trade-off among individual models. To overcome these challenges we propose a range of surrogate measures of event outcome to construct a heterogeneous set of base forecasts. To effectively extract the complementary information concealed within these predictions, we develop a novel pooling mechanism which accounts for competition among contestants: a stacking paradigm integrating conditional logit regression and log-likelihood-ratio-based forecast selection. Empirical results using data related to horseracing events demonstrate that: (i) base model strength and diversity are important when combining model-based predictions for CEs; (ii) average-based pooling, commonly employed elsewhere, may not be appropriate for CEs (because average-based pooling exclusively focuses on strength); and (iii) the proposed stacking ensemble provides statistically and economically accurate forecasts. These results have important implications for regulators of betting markets associated with CEs and in particular for the accurate assessment of market efficiency.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Lessmann, Stefan & Sung, Ming-Chien & Johnson, Johnnie E.V., 2010. "Alternative methods of predicting competitive events: An application in horserace betting markets," International Journal of Forecasting, Elsevier, vol. 26(3), pages 518-536, July.
- Robert L. Winkler & Roy M. Poses, 1993. "Evaluating and Combining Physicians' Probabilities of Survival in an Intensive Care Unit," Management Science, INFORMS, vol. 39(12), pages 1526-1543, December.
- Ruth N. Bolton & Randall G. Chapman, 1986. "Searching for Positive Returns at the Track: A Multinomial Logit Model for Handicapping Horse Races," Management Science, INFORMS, vol. 32(8), pages 1040-1060, August.
- Lessmann, Stefan & Sung, Ming-Chien & Johnson, Johnnie E.V., 2009. "Identifying winners of competitive events: A SVM-based classification model for horserace prediction," European Journal of Operational Research, Elsevier, vol. 196(2), pages 569-577, July.
- Francis X. Diebold, 1989.
"Forecast combination and encompassing: reconciling two divergent literatures,"
Finance and Economics Discussion Series
80, Board of Governors of the Federal Reserve System (U.S.).
- Diebold, Francis X., 1989. "Forecast combination and encompassing: Reconciling two divergent literatures," International Journal of Forecasting, Elsevier, vol. 5(4), pages 589-592.
- Marco Aiolfi & Carlos Capistrán & Allan Timmermann, 2010.
2010-04, Banco de México.
- Johnnie E. V. Johnson & Owen Jones & Leilei Tang, 2006. "Exploring Decision Makers' Use of Price Information in a Speculative Market," Management Science, INFORMS, vol. 52(6), pages 897-908, June.
- Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
- Paleologo, Giuseppe & Elisseeff, André & Antonini, Gianluca, 2010. "Subagging for credit scoring models," European Journal of Operational Research, Elsevier, vol. 201(2), pages 490-499, March.
- Martin Spann & Bernd Skiera, 2009. "Sports forecasting: a comparison of the forecast accuracy of prediction markets, betting odds and tipsters," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(1), pages 55-72.
- Dixon, Mark J. & Pope, Peter F., 2004. "The value of statistical forecasts in the UK association football betting market," International Journal of Forecasting, Elsevier, vol. 20(4), pages 697-711.
- Vaughan Williams, Leighton, 1999. "Information Efficiency in Betting Markets: A Survey," Bulletin of Economic Research, Wiley Blackwell, vol. 51(1), pages 1-30, January.
- M. Sung & J. E. V. Johnson, 2010. "Revealing Weak-Form Inefficiency in a Market for State Contingent Claims: The Importance of Market Ecology, Modelling Procedures and Investment Strategies," Economica, London School of Economics and Political Science, vol. 77(305), pages 128-147, 01.
- Abellán, Joaquín & Masegosa, Andrés R., 2010. "An ensemble method using credal decision trees," European Journal of Operational Research, Elsevier, vol. 205(1), pages 218-226, August.
- Clemen, Robert T., 1989. "Combining forecasts: A review and annotated bibliography," International Journal of Forecasting, Elsevier, vol. 5(4), pages 559-583.
- Spyros Makridakis & Robert L. Winkler, 1983. "Averages of Forecasts: Some Empirical Results," Management Science, INFORMS, vol. 29(9), pages 987-996, September.
- Liu, Yu-Hsin, 2011. "Incorporating scatter search and threshold accepting in finding maximum likelihood estimates for the multinomial probit model," European Journal of Operational Research, Elsevier, vol. 211(1), pages 130-138, May.
- Thomas D. Russell & Everett E. Adam, Jr., 1987. "An Empirical Evaluation of Alternative Forecasting Combinations," Management Science, INFORMS, vol. 33(10), pages 1267-1276, October.
- Roy Batchelor & Pami Dua, 1995. "Forecaster Diversity and the Benefits of Combining Forecasts," Management Science, INFORMS, vol. 41(1), pages 68-75, January.
- David C. Schmittlein & Jinho Kim & Donald G. Morrison, 1990. "Combining Forecasts: Operational Adjustments to Theoretically Optimal Rules," Management Science, INFORMS, vol. 36(9), pages 1044-1056, September.
- Jose, Victor Richmond R. & Winkler, Robert L., 2008. "Simple robust averages of forecasts: Some empirical results," International Journal of Forecasting, Elsevier, vol. 24(1), pages 163-169.
- Ming-Chien Sung & Johnnie E.V. Johnson, 2007. "Comparing the Effectiveness of One- and Two-step Conditional Logit Models for Predicting Outcomes in a Speculative Market," Journal of Prediction Markets, University of Buckingham Press, vol. 1(1), pages 43-59, February.
- de Menezes, Lilian M. & W. Bunn, Derek & Taylor, James W., 2000. "Review of guidelines for the use of combined forecasts," European Journal of Operational Research, Elsevier, vol. 120(1), pages 190-204, January.
- van Wezel, Michiel & Potharst, Rob, 2007. "Improved customer choice predictions using ensemble methods," European Journal of Operational Research, Elsevier, vol. 181(1), pages 436-452, August.
- Crafts, Nicholas F R, 1985. "Some Evidence of Insider Knowledge in Horse Race Betting in Britain," Economica, London School of Economics and Political Science, vol. 52(27), pages 295-304, August.
- Grant, Andrew & Johnstone, David, 2010. "Finding profitable forecast combinations using probability scoring rules," International Journal of Forecasting, Elsevier, vol. 26(3), pages 498-510, July.
- White, Edna M. & Dattero, Ronald & Flores, Benito, 1992. "Combining vector forecasts to predict thoroughbred horse race outcomes," International Journal of Forecasting, Elsevier, vol. 8(4), pages 595-611, December.
- Figlewski, Stephen, 1979. "Subjective Information and Market Efficiency in a Betting Market," Journal of Political Economy, University of Chicago Press, vol. 87(1), pages 75-88, February.
- David Johnstone, 2007. "Economic Darwinism: Who has the Best Probabilities?," Theory and Decision, Springer, vol. 62(1), pages 47-96, February.
- West, David & Mangiameli, Paul & Rampal, Rohit & West, Vivian, 2005. "Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application," European Journal of Operational Research, Elsevier, vol. 162(2), pages 532-551, April.
- D. J. Johnstone, 2011. "Economic Interpretation of Probabilities Estimated by Maximum Likelihood or Score," Management Science, INFORMS, vol. 57(2), pages 308-314, February.
- Leitch, Gordon & Tanner, J Ernest, 1991. "Economic Forecast Evaluation: Profits versus the Conventional Error Measures," American Economic Review, American Economic Association, vol. 81(3), pages 580-90, June.
- Losey, Robert L & Talbott, John C, Jr, 1980. " Back on the Track with the Efficient Markets Hypothesis," Journal of Finance, American Finance Association, vol. 35(4), pages 1039-43, September.
- Snyder, Wayne W, 1978. "Horse Racing: Testing the Efficient Markets Model," Journal of Finance, American Finance Association, vol. 33(4), pages 1109-18, September.
- Granger, Clive W. J. & Jeon, Yongil, 2004. "Thick modeling," Economic Modelling, Elsevier, vol. 21(2), pages 323-343, March.
- Nikolaos Vlastakis & George Dotsis & Raphael N. Markellos, 2009. "How efficient is the European football betting market? Evidence from arbitrage and trading strategies," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(5), pages 426-444.
When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:218:y:2012:i:1:p:163-174. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.