IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Login to save this article or follow this journal

Can out-of-sample forecast comparisons help prevent overfitting?

  • Todd E. Clark

    (Federal Reserve Bank of Kansas City, Kansas City, USA)

This paper shows that out-of-sample forecast comparisons can help prevent data mining-induced overfitting. The basic results are drawn from simulations of a simple Monte Carlo design and a real data-based design similar to those used in some previous studies. In each simulation, a general-to-specific procedure is used to arrive at a model. If the selected specification includes any of the candidate explanatory variables, forecasts from the model are compared to forecasts from a benchmark model that is nested within the selected model. In particular, the competing forecasts are tested for equal MSE and encompassing. The simulations indicate most of the post-sample tests are roughly correctly sized. Moreover, the tests have relatively good power, although some are consistently more powerful than others. The paper concludes with an application, modelling quarterly US inflation. Copyright © 2004 John Wiley & Sons, Ltd.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://hdl.handle.net/10.1002/for.904
File Function: Link to full text; subscription required
Download Restriction: no

Article provided by John Wiley & Sons, Ltd. in its journal Journal of Forecasting.

Volume (Year): 23 (2004)
Issue (Month): 2 ()
Pages: 115-139

as
in new window

Handle: RePEc:jof:jforec:v:23:y:2004:i:2:p:115-139
Contact details of provider: Web page: http://www3.interscience.wiley.com/cgi-bin/jhome/2966

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as in new window
  1. Amano, Robert A. & van Norden, Simon, 1995. "Terms of trade and real exchange rates: the Canadian evidence," Journal of International Money and Finance, Elsevier, vol. 14(1), pages 83-104, February.
  2. Todd E. Clark & Michael W. McCracken, 2000. "Tests of Equal Forecast Accuracy and Encompassing for Nested Models," Econometric Society World Congress 2000 Contributed Papers 0319, Econometric Society.
  3. Evans, Martin D. & Lyons, Richard K., 1999. "Order Flow and Exchange Rate Dynamics," Research Program in Finance, Working Paper Series qt0dh1c16w, Research Program in Finance, Institute for Business and Economic Research, UC Berkeley.
  4. Julia Campos & Neil R. Ericsson, 2000. "Constructive data mining: modeling consumers' expenditure in Venezuela," International Finance Discussion Papers 663, Board of Governors of the Federal Reserve System (U.S.).
  5. Kevin D. Hoover & Stephen J. Perez, 1999. "Data mining reconsidered: encompassing and the general-to-specific approach to specification search," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 167-191.
  6. Krolzig, Hans-Martin & Hendry, David F., 2001. "Computer automation of general-to-specific model selection procedures," Journal of Economic Dynamics and Control, Elsevier, vol. 25(6-7), pages 831-866, June.
  7. Meese, Richard A. & Rogoff, Kenneth, 1983. "Empirical exchange rate models of the seventies : Do they fit out of sample?," Journal of International Economics, Elsevier, vol. 14(1-2), pages 3-24, February.
  8. James H. Stock & Mark W. Watson, 1998. "Diffusion Indexes," NBER Working Papers 6702, National Bureau of Economic Research, Inc.
  9. Thomas Knox & James H. Stock & Mark W. Watson, 2000. "Empirical Bayes Forecasts of One Time Series Using Many Predictors," Econometric Society World Congress 2000 Contributed Papers 1421, Econometric Society.
  10. Kenneth D. West & Michael W. McCracken, 1998. "Regression-Based Tests of Predictive Ability," NBER Technical Working Papers 0226, National Bureau of Economic Research, Inc.
  11. Diebold, Francis X & Mariano, Roberto S, 1995. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(3), pages 253-63, July.
  12. Chinn, Menzie D. & Meese, Richard A., 1995. "Banking on currency forecasts: How predictable is change in money?," Journal of International Economics, Elsevier, vol. 38(1-2), pages 161-178, February.
  13. McCracken, Michael W., 2007. "Asymptotics for out of sample tests of Granger causality," Journal of Econometrics, Elsevier, vol. 140(2), pages 719-752, October.
  14. David F. Hendry & Hans-Martin Krolzig, 1999. "Improving on 'Data mining reconsidered' by K.D. Hoover and S.J. Perez," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 202-219.
  15. Kenneth D. West, 1994. "Asymptotic Inference About Predictive Ability," Macroeconomics 9410002, EconWPA.
  16. Lovell, Michael C, 1983. "Data Mining," The Review of Economics and Statistics, MIT Press, vol. 65(1), pages 1-12, February.
  17. Timothy Cogley, 1998. "A simple adaptive measure of core inflation," Working Papers in Applied Economic Theory 98-06, Federal Reserve Bank of San Francisco.
  18. Granger, Clive W. J. & King, Maxwell L. & White, Halbert, 1995. "Comments on testing economic theories and the use of model selection criteria," Journal of Econometrics, Elsevier, vol. 67(1), pages 173-187, May.
  19. Denton, Frank T, 1985. "Data Mining as an Industry," The Review of Economics and Statistics, MIT Press, vol. 67(1), pages 124-27, February.
  20. Norman R. Swanson, 2000. "An Out of Sample Test for Granger Causality," Econometric Society World Congress 2000 Contributed Papers 0362, Econometric Society.
  21. Ashley, R & Granger, C W J & Schmalensee, R, 1980. "Advertising and Aggregate Consumption: An Analysis of Causality," Econometrica, Econometric Society, vol. 48(5), pages 1149-67, July.
  22. Halbert White, 2000. "A Reality Check for Data Snooping," Econometrica, Econometric Society, vol. 68(5), pages 1097-1126, September.
  23. Bruce E. Hansen, 1999. "Discussion of 'Data mining reconsidered'," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 192-201.
  24. Harvey, David I & Leybourne, Stephen J & Newbold, Paul, 1998. "Tests for Forecast Encompassing," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(2), pages 254-59, April.
  25. repec:cup:macdyn:v:5:y:2001:i:4:p:598-620 is not listed on IDEAS
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:jof:jforec:v:23:y:2004:i:2:p:115-139. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Wiley-Blackwell Digital Licensing)

or (Christopher F. Baum)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.