IDEAS home Printed from https://ideas.repec.org/p/ecl/riceco/14-018.html
   My bibliography  Save this paper

To Hold Out or Not to Hold Out

Author

Listed:
  • Schorfheide, Frank

    (University of PA)

  • Wolpin, Kenneth I.

    (Rice University and University of PA)

Abstract

A recent literature has developed that combines two prominent empirical approaches to ex ante policy evaluation: randomized controlled trials (RCT) and structural estimation. The RCT provides a "gold standard" estimate of a particular treatment, but only of that treatment. Structural estimation provides the capability to extrapolate beyond the experimental treatment, but is based on untestable assumptions and is subject to structural data mining. Combining the approaches by holding out from the structural estimation exercise either the treatment or the control sample allows for external validation of the underlying behavioral model. Although intuitively appealing, this holdout methodology is not well grounded. For instance, it is easy to show that it is suboptimal from a Bayesian perspective. Using a stylized representation of a randomized controlled trial, we provide a formal rationale for the use of a holdout sample in an environment in which data mining poses an impediment to the implementation of the ideal Bayesian analysis and a numerical illustration of the potential benefits of holdout samples.

Suggested Citation

  • Schorfheide, Frank & Wolpin, Kenneth I., 2014. "To Hold Out or Not to Hold Out," Working Papers 14-018, Rice University, Department of Economics.
  • Handle: RePEc:ecl:riceco:14-018
    as

    Download full text from publisher

    File URL: http://economics.rice.edu/rise/working-papers/hold-out-or-not-hold-out
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Esther Duflo & Rema Hanna & Stephen P. Ryan, 2012. "Incentives Work: Getting Teachers to Come to School," American Economic Review, American Economic Association, vol. 102(4), pages 1241-1278, June.
    2. Alvaro Sandroni, 2003. "The reproducible properties of correct forecasts," International Journal of Game Theory, Springer;Game Theory Society, vol. 32(1), pages 151-159, December.
    3. Christopher Ferrall, 2012. "Explaining and Forecasting Results of the Self-sufficiency Project," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(4), pages 1495-1526.
    4. Lo, Andrew W & MacKinlay, A Craig, 1990. "Data-Snooping Biases in Tests of Financial Asset Pricing Models," Review of Financial Studies, Society for Financial Studies, vol. 3(3), pages 431-467.
    5. Kenneth I. Wolpin & Petra E. Todd, 2006. "Assessing the Impact of a School Subsidy Program in Mexico: Using a Social Experiment to Validate a Dynamic Behavioral Model of Child Schooling and Fertility," American Economic Review, American Economic Association, vol. 96(5), pages 1384-1417, December.
    6. Lamont, Owen A., 2002. "Macroeconomic forecasts and microeconomic forecasters," Journal of Economic Behavior & Organization, Elsevier, vol. 48(3), pages 265-280, July.
    7. Wolpin, Kenneth I., 2013. "The Limits of Inference without Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262019086, December.
    8. David Laster & Paul Bennett & In Sun Geoum, 1999. "Rational Bias in Macroeconomic Forecasts," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(1), pages 293-318.
    9. Hidehiko Ichimura & Christopher R. Taber, 2000. "Direct Estimation of Policy Impacts," NBER Technical Working Papers 0254, National Bureau of Economic Research, Inc.
    10. Petra E. Todd & Kenneth I. Wolpin, 2008. "Ex Ante Evaluation of Social Programs," Annals of Economics and Statistics, GENES, issue 91-92, pages 263-291.
    11. repec:adr:anecst:y:2008:i:91-92:p:13 is not listed on IDEAS
    12. Frank Schorfheide & Kenneth I. Wolpin, 2012. "On the Use of Holdout Samples for Model Selection," American Economic Review, American Economic Association, vol. 102(3), pages 477-481, May.
    13. Halbert White, 2000. "A Reality Check for Data Snooping," Econometrica, Econometric Society, vol. 68(5), pages 1097-1126, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Banghua Zhu & Sai Praneeth Karimireddy & Jiantao Jiao & Michael I. Jordan, 2023. "Online Learning in a Creator Economy," Papers 2305.11381, arXiv.org.
    2. Jia, Zhiyang & Vattø, Trine Engh, 2021. "Predicting the path of labor supply responses when state dependence matters," Labour Economics, Elsevier, vol. 71(C).
    3. John Rust, 2014. "The Limits of Inference with Theory: A Review of Wolpin (2013)," Journal of Economic Literature, American Economic Association, vol. 52(3), pages 820-850, September.
    4. Banghua Zhu & Stephen Bates & Zhuoran Yang & Yixin Wang & Jiantao Jiao & Michael I. Jordan, 2022. "The Sample Complexity of Online Contract Design," Papers 2211.05732, arXiv.org, revised May 2023.
    5. de Bresser, Jochem, 2021. "Evaluating the Accuracy of Counterfactuals The Role of Heterogeneous Expectations in Life Cycle Models," Discussion Paper 2021-034, Tilburg University, Center for Economic Research.
    6. Maibom, Jonas, 2021. "The Danish Labor Market Experiments: Methods and Findings," Nationaløkonomisk tidsskrift, Nationaløkonomisk Forening, vol. 2021(1), pages 1-21.
    7. Sebastian Galiani & Juan Pantano, 2021. "Structural Models: Inception and Frontier," NBER Working Papers 28698, National Bureau of Economic Research, Inc.
    8. de Bresser, Jochem, 2021. "Evaluating the Accuracy of Counterfactuals The Role of Heterogeneous Expectations in Life Cycle Models," Other publications TiSEM a7e2b4d8-fed0-4e86-926f-d, Tilburg University, School of Economics and Management.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sebastian Galiani & Juan Pantano, 2021. "Structural Models: Inception and Frontier," NBER Working Papers 28698, National Bureau of Economic Research, Inc.
    2. Deaton, Angus & Cartwright, Nancy, 2018. "Understanding and misunderstanding randomized controlled trials," Social Science & Medicine, Elsevier, vol. 210(C), pages 2-21.
    3. Maibom, Jonas, 2021. "The Danish Labor Market Experiments: Methods and Findings," Nationaløkonomisk tidsskrift, Nationaløkonomisk Forening, vol. 2021(1), pages 1-21.
    4. Stéphane Bonhomme & Martin Weidner, 2020. "Minimizing Sensitivity to Model Misspecification," CeMMAP working papers CWP37/20, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    5. R. Vincent Pohl, 2018. "Medicaid And The Labor Supply Of Single Mothers: Implications For Health Care Reform," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 59(3), pages 1283-1313, August.
    6. de Bresser, Jochem, 2021. "Evaluating the Accuracy of Counterfactuals The Role of Heterogeneous Expectations in Life Cycle Models," Other publications TiSEM a7e2b4d8-fed0-4e86-926f-d, Tilburg University, School of Economics and Management.
    7. de Bresser, Jochem, 2021. "Evaluating the Accuracy of Counterfactuals The Role of Heterogeneous Expectations in Life Cycle Models," Discussion Paper 2021-034, Tilburg University, Center for Economic Research.
    8. St'ephane Bonhomme & Martin Weidner, 2018. "Minimizing Sensitivity to Model Misspecification," Papers 1807.02161, arXiv.org, revised Oct 2021.
    9. Nicholas-James Clavet & Jean-Yves Duclos & Guy Lacroix, 2012. "Fighting Poverty: Assessing the Effect of a Guaranteed Minimum Income Proposal in Québec," CIRANO Working Papers 2012s-36, CIRANO.
    10. Karthik Muralidharan & Paul Niehaus, 2017. "Experimentation at Scale," Journal of Economic Perspectives, American Economic Association, vol. 31(4), pages 103-124, Fall.
    11. Marinovic, Iván & Ottaviani, Marco & Sorensen, Peter, 2013. "Forecasters’ Objectives and Strategies," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 690-720, Elsevier.
    12. Papastamos, Dimitrios & Matysiak, George & Stevenson, Simon, 2015. "Assessing the accuracy and dispersion of real estate investment forecasts," International Review of Financial Analysis, Elsevier, vol. 42(C), pages 141-152.
    13. Thomas, R, 2011. "Using a Semiparametric Estimator to Forecast Education Outcomes in Nicaragua's Red de Proteccion Social," Health, Econometrics and Data Group (HEDG) Working Papers 11/02, HEDG, c/o Department of Economics, University of York.
    14. Ranjeeta Thomas, 2012. "Conditional Cash Transfers To Improve Education And Health: An Ex Ante Evaluation Of Red De Protección Social, Nicaragua," Health Economics, John Wiley & Sons, Ltd., vol. 21(10), pages 1136-1154, October.
    15. Orazio Attanasio & Sarah Cattan & Emla Fitzsimons & Costas Meghir & Marta Rubio-Codina, 2020. "Estimating the Production Function for Human Capital: Results from a Randomized Controlled Trial in Colombia," American Economic Review, American Economic Association, vol. 110(1), pages 48-85, January.
    16. Bajgrowicz, Pierre & Scaillet, Olivier, 2012. "Technical trading revisited: False discoveries, persistence tests, and transaction costs," Journal of Financial Economics, Elsevier, vol. 106(3), pages 473-491.
    17. Aleksejs Krecetovs & Pasquale Della Corte, 2016. "Macro uncertainty and currency premia," 2016 Meeting Papers 624, Society for Economic Dynamics.
    18. Pierre Cahuc & Stéphane Carcillo & Thomas Le Barbanchon, 2019. "The Effectiveness of Hiring Credits," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 86(2), pages 593-626.
    19. Roberto Rigobon & Brian Sack, 2008. "Noisy Macroeconomic Announcements, Monetary Policy, and Asset Prices," NBER Chapters, in: Asset Prices and Monetary Policy, pages 335-370, National Bureau of Economic Research, Inc.
    20. McCracken,M.W. & West,K.D., 2001. "Inference about predictive ability," Working papers 14, Wisconsin Madison - Social Systems.

    More about this item

    JEL classification:

    • C11 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Bayesian Analysis: General
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ecl:riceco:14-018. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/dericus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.