This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

I am not, nor have I ever been a member of a data-mining discipline

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Clinton A. Greene
Abstract

This paper argues classical statistics and standard econometrics are based on a desire to meet scientific standards for accumulating reliable knowledge. Science requires two inputs, mining of existing data for inspiration and new or 'out-of-sample' data for predictive testing. Avoidance of data-mining is neither possible nor desirable. In economics out-of-sample data is relatively scarce, so the production process should intensively exploit the existing data. But the two inputs should be thought of as complements rather than substitutes. And we neglect the importance of out-of-sample testing in the production of reliable knowledge. Avoidance of data-mining is not a substitute for tests conducted in new samples. The problem is not that data-mining corrupts the process, the problem is our collective neglect of out-of-sample encompassing, stability and forecast tests. So the data-mining issue diverts us from the crucial margin.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://taylorandfrancis.metapress.com/link.asp?target=contribution&id=EFCHNV4NDUXGCN41
File Format: text/html
File Function:
Download Restriction: Access to full text is restricted to subscribers.

As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.

Publisher Info
Article provided by Taylor and Francis Journals in its journal Journal of Economic Methodology.

Volume (Year): 7 (2000)
Issue (Month): 2 (June)
Pages: 217-230
Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Handle: RePEc:taf:jecmet:v:7:y:2000:i:2:p:217-230

Contact details of provider:
Web page: http://taylorandfrancis.metapress.com/link.asp?target=journal&id=104715

Order Information:
Web: http://www.tandf.co.uk/journals/subscription.html

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords: Repeated Testing Stability Time Series Experimental Data-MINING Applied Methods Prediction;

Other versions of this item:

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Salmon, Mark H, 1982. "Error Correction Mechanisms," Economic Journal, Royal Economic Society, vol. 92(367), pages 615-29, September. [Downloadable!] (restricted)
  2. Thomas Mayer, . "Data Mining: A Reconsideration," Department of Economics 97-15, California Davis - Department of Economics. [Downloadable!]
    Other versions:
  3. Salmon, Mark, 1982. "Error Correction Mechanisms," The Warwick Economics Research Paper Series (TWERPS) 199, University of Warwick, Department of Economics.
  4. Kevin D. Hoover, Stephen J. Perez, 2000. "Three attitudes towards data mining," Journal of Economic Methodology, Taylor and Francis Journals, vol. 7(2), pages 195-210, June. [Downloadable!] (restricted)
  5. Gregory D. Hess & Christopher S. Jones & Richard D. Porter, 1994. "The predictive failure of the Baba, Hendry and Starr model of the demand for M1 in the United States," Research Working Paper 94-06, Federal Reserve Bank of Kansas City.
    Other versions:
Full references

Statistics
Access and download statistics

Did you know? Springer Verlag was the first commercial publisher to be listed on RePEc.

This page was last updated on 2009-11-14.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.