IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2401.03756.html
   My bibliography  Save this paper

Adaptive Experimental Design for Policy Learning

Author

Listed:
  • Masahiro Kato
  • Kyohei Okumura
  • Takuya Ishihara
  • Toru Kitagawa

Abstract

Evidence-based targeting has been a topic of growing interest among the practitioners of policy and business. Formulating decision-maker's policy learning as a fixed-budget best arm identification (BAI) problem with contextual information, we study an optimal adaptive experimental design for policy learning with multiple treatment arms. In the sampling stage, the planner assigns treatment arms adaptively over sequentially arriving experimental units upon observing their contextual information (covariates). After the experiment, the planner recommends an individualized assignment rule to the population. Setting the worst-case expected regret as the performance criterion of adaptive sampling and recommended policies, we derive its asymptotic lower bounds, and propose a strategy, Adaptive Sampling-Policy Learning strategy (PLAS), whose leading factor of the regret upper bound aligns with the lower bound as the size of experimental units increases.

Suggested Citation

  • Masahiro Kato & Kyohei Okumura & Takuya Ishihara & Toru Kitagawa, 2024. "Adaptive Experimental Design for Policy Learning," Papers 2401.03756, arXiv.org, revised Feb 2024.
  • Handle: RePEc:arx:papers:2401.03756
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2401.03756
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Karlan, Dean & Wood, Daniel H., 2017. "The effect of effectiveness: Donor response to aid effectiveness in a direct mail fundraising experiment," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 66(C), pages 1-8.
    2. Ruohan Zhan & Zhimei Ren & Susan Athey & Zhengyuan Zhou, 2021. "Policy Learning with Adaptively Collected Data," Papers 2105.02344, arXiv.org, revised Nov 2022.
    3. Maximilian Kasy & Anja Sautmann, 2021. "Adaptive Treatment Assignment in Experiments for Policy Choice," Econometrica, Econometric Society, vol. 89(1), pages 113-132, January.
    4. Kaito Ariu & Masahiro Kato & Junpei Komiyama & Kenichiro McAlinn & Chao Qin, 2021. "Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling," Papers 2109.08229, arXiv.org, revised Nov 2021.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Masahiro Kato & Masaaki Imaizumi & Takuya Ishihara & Toru Kitagawa, 2023. "Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds," Papers 2302.02988, arXiv.org, revised Jul 2023.
    2. Masahiro Kato & Masaaki Imaizumi & Takuya Ishihara & Toru Kitagawa, 2022. "Best Arm Identification with Contextual Information under a Small Gap," Papers 2209.07330, arXiv.org, revised Jan 2023.
    3. Chao Qin & Daniel Russo, 2024. "Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification," Papers 2402.10592, arXiv.org.
    4. Masahiro Kato, 2023. "Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances," Papers 2312.12741, arXiv.org, revised Mar 2024.
    5. Minguez, Ana & Javier Sese, F., 2022. "Why do you want a relationship, anyway? Consent to receive marketing communications and donors’ willingness to engage with nonprofits," Journal of Business Research, Elsevier, vol. 148(C), pages 356-367.
    6. Billur Aksoy & Silvana Krasteva, 2020. "When does less information translate into more giving to public goods?," Experimental Economics, Springer;Economic Science Association, vol. 23(4), pages 1148-1177, December.
    7. Shantanu Gupta & Zachary C. Lipton & David Childers, 2021. "Efficient Online Estimation of Causal Effects by Deciding What to Observe," Papers 2108.09265, arXiv.org, revised Oct 2021.
    8. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    9. Chang, Chia-Chi & Chen, Po-Yu, 2019. "Which maximizes donations: Charitable giving as an incentive or incentives for charitable giving?," Journal of Business Research, Elsevier, vol. 97(C), pages 65-75.
    10. Stefano Caria & Grant Gordon & Maximilian Kasy & Simon Quinn & Soha Shami & Alexander Teytelboym, 2020. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," CESifo Working Paper Series 8535, CESifo.
    11. Nadine Chlaß & Lata Gangadharan & Kristy Jones, 2023. "Charitable giving and intermediation: a principal agent problem with hidden prices," Oxford Economic Papers, Oxford University Press, vol. 75(4), pages 941-961.
    12. Bahety, Girija & Bauhoff, Sebastian & Patel, Dev & Potter, James, 2021. "Texts don’t nudge: An adaptive trial to prevent the spread of COVID-19 in India," Journal of Development Economics, Elsevier, vol. 153(C).
    13. Diederich, Johannes & Epperson, Raphael & Goeschl, Timo, 2021. "How to Design the Ask? Funding Units vs. Giving Money," Working Papers 0698, University of Heidelberg, Department of Economics.
    14. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    15. Kock, Anders Bredahl & Preinerstorfer, David & Veliyev, Bezirgen, 2023. "Treatment recommendation with distributional targets," Journal of Econometrics, Elsevier, vol. 234(2), pages 624-646.
    16. Martino Banchio & Giacomo Mantegazza, 2022. "Artificial Intelligence and Spontaneous Collusion," Papers 2202.05946, arXiv.org, revised Sep 2023.
    17. Jasjit Singh & Nina Teng & Serguei Netessine, 2019. "Philanthropic Campaigns and Customer Behavior: Field Experiments on an Online Taxi Booking Platform," Management Science, INFORMS, vol. 65(2), pages 913-932, February.
    18. Portillo, Javier E. & Stinn, Joseph, 2018. "Overhead aversion: Do some types of overhead matter more than others?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 72(C), pages 40-50.
    19. Xiaoxue Sherry Gao & Glenn W. Harrison & Rusty Tchernis, 2020. "Behavioral Welfare Economics and Risk Preferences: A Bayesian Approach," NBER Working Papers 27685, National Bureau of Economic Research, Inc.
    20. Metzger, Laura & Günther, Isabel, 2019. "Making an impact? The relevance of information on aid effectiveness for charitable giving. A laboratory experiment," Journal of Development Economics, Elsevier, vol. 136(C), pages 18-33.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2401.03756. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.