IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.07181.html

Better Measurement or Larger Samples? Data Collection for Policy Learning with Unobserved Heterogeneity

Author

Listed:
  • Giacomo Opocher

Abstract

Empirical research shows that individuals' responses to treatments vary along latent characteristics, such as innate ability or motivation. Therefore, a policymaker seeking to maximize welfare may consider designing policies based on observed characteristics and estimated latent traits. I characterize how the estimates' precision affects the worst-case performance of policies deriving rate-sharp regret bounds for assignment rules that include or exclude them, highlighting new trade-offs with the policy space complexity. I then study how a policymaker can solve such trade-offs by designing tailored data collections and derive a sufficient condition for a collection plan to be minimax optimal. In an empirical application in development economics, I show that including a proxy for entrepreneurs' business skills in targeting cash transfers increases welfare by 5%, and halves the probability of generating welfare losses. Moreover, I estimate the optimal allocation of resources between improving the precision of the proxy via repeated measurements, and increasing sample size.

Suggested Citation

  • Giacomo Opocher, 2026. "Better Measurement or Larger Samples? Data Collection for Policy Learning with Unobserved Heterogeneity," Papers 2604.07181, arXiv.org, revised May 2026.
  • Handle: RePEc:arx:papers:2604.07181
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2604.07181
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Michael Gechter & Keisuke Hirano & Jean Lee & Mahreen Mahmud & Orville Mondal & Jonathan Morduch & Saravana Ravindran & Abu S. Shonchoy, 2024. "Selecting Experimental Sites for External Validity," Papers 2405.13241, arXiv.org.
    2. Shosei Sakaguchi, 2020. "Estimation of average treatment effects using panel data when treatment effect heterogeneity depends on unobserved fixed effects," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(3), pages 315-327, April.
    3. Davide Viviano & Jelena Bradic, 2024. "Fair Policy Targeting," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(545), pages 730-743, January.
    4. Edward Vytlacil & James J. Heckman, 2001. "Policy-Relevant Treatment Effects," American Economic Review, American Economic Association, vol. 91(2), pages 107-111, May.
    5. Toru Kitagawa & Aleksey Tetenov, 2021. "Equality-Minded Treatment Choice," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(2), pages 561-574, March.
    6. Eric Mbakop & Max Tabord‐Meehan, 2021. "Model Selection for Treatment Choice: Penalized Welfare Maximization," Econometrica, Econometric Society, vol. 89(2), pages 825-848, March.
    7. Alberto Abadie & Joshua Angrist & Guido Imbens, 2002. "Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings," Econometrica, Econometric Society, vol. 70(1), pages 91-117, January.
    8. Aristotelis Epanomeritakis & Davide Viviano, 2025. "Learning What to Learn: Experimental Design when Combining Experimental with Observational Evidence," Papers 2510.23434, arXiv.org, revised Dec 2025.
    9. Toru Kitagawa & Aleksey Tetenov, 2018. "Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice," Econometrica, Econometric Society, vol. 86(2), pages 591-616, March.
    10. James J. Heckman & Edward Vytlacil, 2005. "Structural Equations, Treatment Effects, and Econometric Policy Evaluation," Econometrica, Econometric Society, vol. 73(3), pages 669-738, May.
    11. Stefanie Stantcheva, 2023. "How to Run Surveys: A Guide to Creating Your Own Identifying Variation and Revealing the Invisible," Annual Review of Economics, Annual Reviews, vol. 15(1), pages 205-234, September.
    12. Jeff Dominitz & Charles F. Manski, 2017. "More Data or Better Data? A Statistical Decision Problem," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 84(4), pages 1583-1605.
    13. Emily Breza & Arun G. Chandrasekhar & Davide Viviano, 2025. "Generalizability with ignorance in mind: learning what we do (not) know for archetypes discovery," Papers 2501.13355, arXiv.org, revised Jul 2025.
    14. Gharad Bryan & Dean Karlan & Adam Osman, 2024. "Big Loans to Small Businesses: Predicting Winners and Losers in an Entrepreneurial Lending Experiment," American Economic Review, American Economic Association, vol. 114(9), pages 2825-2860, September.
    15. Jeffrey M. Wooldridge, 2005. "Fixed-Effects and Related Estimators for Correlated Random-Coefficient and Treatment-Effect Panel Data Models," The Review of Economics and Statistics, MIT Press, vol. 87(2), pages 385-390, May.
    16. Erik Brynjolfsson & Danielle Li & Lindsey Raymond, 2025. "Generative AI at Work," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 140(2), pages 889-942.
    17. J. Vernon Henderson & Adam Storeygard & David N. Weil, 2012. "Measuring Economic Growth from Outer Space," American Economic Review, American Economic Association, vol. 102(2), pages 994-1028, April.
    18. Charles F. Manski, 2004. "Statistical Treatment Rules for Heterogeneous Populations," Econometrica, Econometric Society, vol. 72(4), pages 1221-1246, July.
    19. Davide Viviano, 2019. "Policy Targeting under Network Interference," Papers 1906.10258, arXiv.org, revised Apr 2024.
    20. Reshmaan Hussam & Natalia Rigol & Benjamin N. Roth, 2022. "Targeting High Ability Entrepreneurs Using Community Information: Mechanism Design in the Field," American Economic Review, American Economic Association, vol. 112(3), pages 861-898, March.
    21. Dave Donaldson & Adam Storeygard, 2016. "The View from Above: Applications of Satellite Data in Economics," Journal of Economic Perspectives, American Economic Association, vol. 30(4), pages 171-198, Fall.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Emily Breza & Arun G. Chandrasekhar & Davide Viviano, 2025. "Generalizability with ignorance in mind: learning what we do (not) know for archetypes discovery," Papers 2501.13355, arXiv.org, revised Jul 2025.
    2. Aristotelis Epanomeritakis & Davide Viviano, 2025. "Learning What to Learn: Experimental Design when Combining Experimental with Observational Evidence," Papers 2510.23434, arXiv.org, revised Dec 2025.
    3. Toru Kitagawa & Weining Wang & Mengshan Xu, 2022. "Policy Choice in Time Series by Empirical Welfare Maximization," Papers 2205.03970, arXiv.org, revised Nov 2025.
    4. Toru Kitagawa & Jeff Rowley, 2024. "Bandit algorithms for policy learning: methods, implementation, and welfare-performance," The Japanese Economic Review, Springer, vol. 75(3), pages 407-447, July.
    5. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    6. Zequn Jin & Gaoqian Xu & Xi Zheng & Yahong Zhou, 2025. "Policy Learning under Unobserved Confounding: A Robust and Efficient Approach," Papers 2507.20550, arXiv.org.
    7. Thomas M. Russell, 2020. "Policy Transforms and Learning Optimal Policies," Papers 2012.11046, arXiv.org.
    8. Toru Kitagawa & Weining Wang & Mengshan Xu, 2024. "Policy choice in time series by empirical welfare maximization," CeMMAP working papers 27/24, Institute for Fiscal Studies.
    9. Yuya Sasaki & Takuya Ura, 2020. "Welfare Analysis via Marginal Treatment Effects," Papers 2012.07624, arXiv.org.
    10. Crippa, Federico, 2025. "Regret analysis in threshold policy design," Journal of Econometrics, Elsevier, vol. 249(PB).
    11. Anders Bredahl Kock & David Preinerstorfer, 2024. "Regularizing Fairness in Optimal Policy Learning with Distributional Targets," Papers 2401.17909, arXiv.org, revised May 2025.
    12. Hugo Bodory & Federica Mascolo & Michael Lechner, 2024. "Enabling Decision-Making with the Modified Causal Forest: Policy Trees for Treatment Assignment," Papers 2406.02241, arXiv.org.
    13. Timothy Armstrong & Martin Weidner & Andrei Zeleneev, 2024. "Robust estimation and inference in panels with interactive fixed effects," IFS Working Papers WCWP28/24, Institute for Fiscal Studies.
    14. Toru Kitagawa & Sokbae Lee & Chen Qiu, 2023. "Treatment choice, mean square regret and partial identification," The Japanese Economic Review, Springer, vol. 74(4), pages 573-602, October.
    15. Martin Huber, 2019. "An introduction to flexible methods for policy evaluation," Papers 1910.00641, arXiv.org.
    16. Davide Viviano, 2019. "Policy Targeting under Network Interference," Papers 1906.10258, arXiv.org, revised Apr 2024.
    17. Yuehao Bai & Azeem M. Shaikh & Max Tabord-Meehan, 2024. "A Primer on the Analysis of Randomized Experiments and a Survey of some Recent Advances," Papers 2405.03910, arXiv.org, revised Apr 2025.
    18. Hirano, Keisuke & Porter, Jack R., 2020. "Asymptotic analysis of statistical decision rules in econometrics," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 283-354, Elsevier.
    19. Chunrong Ai & Yue Fang & Haitian Xie, 2024. "Data-Driven Policy Learning for Continuous Treatments," Papers 2402.02535, arXiv.org, revised Dec 2025.
    20. Toru Kitagawa & Sokbae Lee & Chen Qiu, 2022. "Treatment Choice with Nonlinear Regret," Papers 2205.08586, arXiv.org, revised Oct 2024.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.07181. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.