This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Three attitudes towards data mining

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Kevin D. Hoover, Stephen J. Perez

Additional information is available for the following registered author(s):

Abstract

'Data mining' refers to a broad class of activities that have in common, a search over different ways to process or package data statistically or econometrically with the purpose of making the final presentation meet certain design criteria. We characterize three attitudes toward data mining: first, that it is to be avoided and, if it is engaged in, that statistical inferences must be adjusted to account for it; second, that it is inevitable and that the only results of any interest are those that transcend the variety of alternative data mined specifications (a view associated with Leamer's extreme-bounds analysis); and third, that it is essential and that the only hope we have of using econometrics to uncover true economic relationships is to be found in the intelligent mining of data. The first approach confuses considerations of sampling distribution and considerations of epistemic warrant and, reaches an unnecessarily hostile attitude toward data mining. The second approach relies on a notion of robustness that has little relationship to truth: there is no good reason to expect a true specification to be robust alternative specifications. Robustness is not, in general, a carrier of epistemic warrant. The third approach is operationalized in the general-to-specific search methodology of the LSE school of econometrics. Its success demonstrates that intelligent data mining is an important element in empirical investigation in economics.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://taylorandfrancis.metapress.com/link.asp?target=contribution&id=Q2FJVDYA8FYHEGR4
File Format: text/html
File Function:
Download Restriction: Access to full text is restricted to subscribers.

As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.

Publisher Info
Article provided by Taylor and Francis Journals in its journal Journal of Economic Methodology.

Volume (Year): 7 (2000)
Issue (Month): 2 (June)
Pages: 195-210
Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Handle: RePEc:taf:jecmet:v:7:y:2000:i:2:p:195-210

Contact details of provider:
Web page: http://taylorandfrancis.metapress.com/link.asp?target=journal&id=104715

Order Information:
Web: http://www.tandf.co.uk/journals/subscription.html

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords: Data Mining Extreme-BOUNDS Analysis Specification Search General-TO-SPECIFIC Lse Econometrics;

Other versions of this item:

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Hendry, David F & Starr, Ross M, 1993. "The Demand for M1 in the USA: A Reply," Economic Journal, Royal Economic Society, vol. 103(420), pages 1158-69, September. [Downloadable!] (restricted)
  2. Hendry, D.F. & Richard, J.-F., 1987. "Recent developments in the theory of encompassing," CORE Discussion Papers 1987022, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
  3. Thomas Mayer, . "Data Mining: A Reconsideration," Department of Economics 97-15, California Davis - Department of Economics. [Downloadable!]
    Other versions:
  4. Kevin D. Hoover & Stephen J. Perez, 1999. "Data mining reconsidered: encompassing and the general-to-specific approach to specification search," Econometrics Journal, Royal Economic Society, vol. 2(2), pages 167-191.
    Other versions:
  5. Lovell, Michael C, 1983. "Data Mining," The Review of Economics and Statistics, MIT Press, vol. 65(1), pages 1-12, February. [Downloadable!] (restricted)
  6. Mizon, Grayham E & Richard, Jean-Francois, 1986. "The Encompassing Principle and Its Application to Testing Non-nested Hypotheses," Econometrica, Econometric Society, vol. 54(3), pages 657-78, May. [Downloadable!] (restricted)
  7. Leamer, Edward E & Leonard, Herman B, 1983. "Reporting the Fragility of Regression Estimates," The Review of Economics and Statistics, MIT Press, vol. 65(2), pages 306-17, May. [Downloadable!] (restricted)
  8. Phillips, P C B, 1988. "Reflections on Econometric Methodology," The Economic Record, The Economic Society of Australia, vol. 64(187), pages 344-59, December.
    Other versions:
  9. Neil R. Ericsson & Julia Campos & Hong-Anh Tran, 1991. "PC-give and David Hendry's econometric methodology," International Finance Discussion Papers 406, Board of Governors of the Federal Reserve System (U.S.). [Downloadable!]
  10. repec:cup:etheor:v:6:y:1990:i:2:p:171-261 is not listed on IDEAS
  11. Mayer, Thomas, 1980. "Economics as a Hard Science: Realistic Goal or Wishful Thinking?," Economic Inquiry, Oxford University Press, vol. 18(2), pages 165-78, April.
  12. Hess, Gregory D. & Jones, Christopher S. & Porter, Richard D., 1998. "The predictive failure of the Baba, Hendry and Starr model of M1," Journal of Economics and Business, Elsevier, vol. 50(6), pages 477-507, November. [Downloadable!] (restricted)
Full references

Cited by:
(explanations, Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.)

  1. Jim Woodward, 2006. "Some varieties of robustness," Journal of Economic Methodology, Taylor and Francis Journals, vol. 13(2), pages 219-240, June. [Downloadable!] (restricted)
  2. Mayer, Thomas, 2001. "Misinterpreting a Failure to Disconfirm as a Confirmation: A Recurrent Misreading of Significance Tests," Working Papers 01-8, University of California at Davis, Department of Economics. [Downloadable!]
    Other versions:
  3. Clinton A. Greene, 2000. "I am not, nor have I ever been a member of a data-mining discipline," Journal of Economic Methodology, Taylor and Francis Journals, vol. 7(2), pages 217-230, June. [Downloadable!] (restricted)
  4. Roger E. Backhouse, Mary S. Morgan, 2000. "Introduction: is data mining a methodological problem?," Journal of Economic Methodology, Taylor and Francis Journals, vol. 7(2), pages 171-181, June. [Downloadable!] (restricted)
  5. Kevin D. Hoover & Mark V. Siegler, 2005. "Sound and Fury: McCloskey and Significance Testing in Economics," Econometrics 0511018, EconWPA. [Downloadable!]
    Other versions:
  6. Aris Spanos, 2000. "Revisiting data mining: ‘hunting’ with or without a license," Journal of Economic Methodology, Taylor and Francis Journals, vol. 7(2), pages 231-264, June. [Downloadable!] (restricted)
  7. Alastair R. Hall, Fernanda P. M. Peixe, 2000. "Data mining and the selection of instruments," Journal of Economic Methodology, Taylor and Francis Journals, vol. 7(2), pages 265-277, June. [Downloadable!] (restricted)
  8. Hsiang-Ke Chao, 2005. "A misconception of the semantic conception of econometrics?," Journal of Economic Methodology, Taylor and Francis Journals, vol. 12(1), pages 125-135, March. [Downloadable!] (restricted)
  9. Steven Cook, 2001. "Observations on the practice of data-mining: comments on the JEM symposium," Journal of Economic Methodology, Taylor and Francis Journals, vol. 8(3), pages 415-419, November. [Downloadable!] (restricted)
Statistics
Access and download statistics

Did you know? IDEAS also indexes books.

This page was last updated on 2009-11-14.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.