IDEAS home Printed from https://ideas.repec.org/a/eee/wdevel/v127y2020ics0305750x19304450.html
   My bibliography  Save this article

Good identification, meet good data

Author

Listed:
  • Dillon, Andrew
  • Karlan, Dean
  • Udry, Christopher
  • Zinman, Jonathan

Abstract

Causal inference lies at the heart of social science, and the 2019 Nobel Prize in Economics highlights the value of randomized variation for identifying causal effects and mechanisms. But causal inference cannot rely on randomized variation alone; it also requires good data. Yet the data-generating process has received less consideration from economists. We provide a simple framework to clarify how research inputs affect data quality and discuss several such inputs, including interviewer selection and training, survey design, and investments in linking across multiple data sources. More investment in research on the data quality production function would considerably improve casual inference generally, and poverty alleviation specifically.

Suggested Citation

  • Dillon, Andrew & Karlan, Dean & Udry, Christopher & Zinman, Jonathan, 2020. "Good identification, meet good data," World Development, Elsevier, vol. 127(C).
  • Handle: RePEc:eee:wdevel:v:127:y:2020:i:c:s0305750x19304450
    DOI: 10.1016/j.worlddev.2019.104796
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0305750X19304450
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.worlddev.2019.104796?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. James J. Heckman & Tomas Jagelka & Tim Kautz, 2019. "Some Contributions of Economics to the Study of Personality," Working Papers 2019-069, Human Capital and Economic Opportunity Working Group.
    2. Bruce D. Meyer & Nikolas Mittag, 2019. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness, and Holes in the Safety Net," American Economic Journal: Applied Economics, American Economic Association, vol. 11(2), pages 176-204, April.
    3. Joachim De Weerdt & John Gibson & Kathleen Beegle, 2020. "What Can We Learn from Experimenting with Survey Methods?," Annual Review of Resource Economics, Annual Reviews, vol. 12(1), pages 431-447, October.
    4. Diva Dhar & Tarun Jain & Seema Jayachandran, 2022. "Reshaping Adolescents' Gender Attitudes: Evidence from a School-Based Experiment in India," American Economic Review, American Economic Association, vol. 112(3), pages 899-927, March.
    5. Hyslop, Dean R & Imbens, Guido W, 2001. "Bias from Classical and Other Forms of Measurement Error," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(4), pages 475-481, October.
    6. Lori Beaman & Niall Keleher & Jeremy Magruder, 2018. "Do Job Networks Disadvantage Women? Evidence from a Recruitment Experiment in Malawi," Journal of Labor Economics, University of Chicago Press, vol. 36(1), pages 121-157.
    7. Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
    8. Bound, John & Brown, Charles & Mathiowetz, Nancy, 2001. "Measurement error in survey data," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 59, pages 3705-3843, Elsevier.
    9. Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zezza,Alberto & Mcgee,Kevin Robert & Wollburg,Philip Randolph & Assefa,Thomas Woldu & Gourlay,Sydney, 2022. "From Necessity to Opportunity : Lessons for Integrating Phone and In-Person Data Collectionfor Agricultural Statistics in a Post-Pandemic World," Policy Research Working Paper Series 10168, The World Bank.
    2. Fiala, Nathan & Masselus, Lise, 2022. "Whom to ask? Testing respondent effects in household surveys," Ruhr Economic Papers 935, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lidia Ceriani & Vladimir Hlasny & Paolo Verme, 2021. "Bottom Incomes and the Measurement of Poverty: A Brief Assessment of the Literature," Working Papers 589, ECINEQ, Society for the Study of Economic Inequality.
    2. Amitabh Chandra & Courtney Coile & Corina Mommaerts, 2023. "What Can Economics Say about Alzheimer's Disease?," Journal of Economic Literature, American Economic Association, vol. 61(2), pages 428-470, June.
    3. Dang, Hai-Anh & Kilic, Talip & Hlasny, Vladimir & Abanokova, Kseniya & Carletto, Calogero, 2024. "Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost: Evidence from a Randomized Survey Experiment," IZA Discussion Papers 16792, Institute of Labor Economics (IZA).
    4. Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
    5. Carletto,Calogero & Dillon,Andrew S. & Zezza,Alberto, 2021. "Agricultural Data Collection to Minimize Measurement Error and Maximize Coverage," Policy Research Working Paper Series 9745, The World Bank.
    6. Ay, Jean-Sauveur & Le Gallo, Julie, 2021. "The Signaling Values of Nested Wine Names," Working Papers 321851, American Association of Wine Economists.
    7. Tsang, Andrew, 2021. "Uncovering Heterogeneous Regional Impacts of Chinese Monetary Policy," MPRA Paper 110703, University Library of Munich, Germany.
    8. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP54/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    9. Daniel Goller, 2023. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Annals of Operations Research, Springer, vol. 325(1), pages 649-679, June.
    10. Rodríguez-Vargas, Adolfo, 2020. "Forecasting Costa Rican inflation with machine learning methods," Latin American Journal of Central Banking (previously Monetaria), Elsevier, vol. 1(1).
    11. Jesus Fernandez-Villaverde, 2020. "Simple Rules for a Complex World with Arti?cial Intelligence," PIER Working Paper Archive 20-010, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    12. Blankenship, Brian & Aklin, Michaël & Urpelainen, Johannes & Nandan, Vagisha, 2022. "Jobs for a just transition: Evidence on coal job preferences from India," Energy Policy, Elsevier, vol. 165(C).
    13. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    14. Mark Kattenberg & Bas Scheer & Jurre Thiel, 2023. "Causal forests with fixed effects for treatment effect heterogeneity in difference-in-differences," CPB Discussion Paper 452, CPB Netherlands Bureau for Economic Policy Analysis.
    15. Michael C Knaus, 2022. "Double machine learning-based programme evaluation under unconfoundedness [Econometric methods for program evaluation]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 602-627.
    16. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    17. Yigit Aydede & Jan Ditzen, 2022. "Identifying the regional drivers of influenza-like illness in Nova Scotia with dominance analysis," Papers 2212.06684, arXiv.org.
    18. Mona Aghdaee & Bonny Parkinson & Kompal Sinha & Yuanyuan Gu & Rajan Sharma & Emma Olin & Henry Cutler, 2022. "An examination of machine learning to map non‐preference based patient reported outcome measures to health state utility values," Health Economics, John Wiley & Sons, Ltd., vol. 31(8), pages 1525-1557, August.
    19. Hector F. Calvo-Pardo & Tullio Mancini & Jose Olmo, 2020. "Neural Network Models for Empirical Finance," JRFM, MDPI, vol. 13(11), pages 1-22, October.
    20. Cockx, Bart & Lechner, Michael & Bollens, Joost, 2023. "Priority to unemployed immigrants? A causal machine learning evaluation of training in Belgium," Labour Economics, Elsevier, vol. 80(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:wdevel:v:127:y:2020:i:c:s0305750x19304450. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/worlddev .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.