Adaptive Experimental Design for Policy Learning

My bibliography Save this paper

Adaptive Experimental Design for Policy Learning

Author

Listed:

Masahiro Kato
Kyohei Okumura
Takuya Ishihara
Toru Kitagawa

Registered:

Abstract

This study investigates the contextual best arm identification (BAI) problem, aiming to design an adaptive experiment to identify the best treatment arm conditioned on contextual information (covariates). We consider a decision-maker who assigns treatment arms to experimental units during an experiment and recommends the estimated best treatment arm based on the contexts at the end of the experiment. The decision-maker uses a policy for recommendations, which is a function that provides the estimated best treatment arm given the contexts. In our evaluation, we focus on the worst-case expected regret, a relative measure between the expected outcomes of an optimal policy and our proposed policy. We derive a lower bound for the expected simple regret and then propose a strategy called Adaptive Sampling-Policy Learning (PLAS). We prove that this strategy is minimax rate-optimal in the sense that its leading factor in the regret upper bound matches the lower bound as the number of experimental units increases.

Suggested Citation

Masahiro Kato & Kyohei Okumura & Takuya Ishihara & Toru Kitagawa, 2024. "Adaptive Experimental Design for Policy Learning," Papers 2401.03756, arXiv.org, revised Jun 2025.

Handle: RePEc:arx:papers:2401.03756

Download full text from publisher

References listed on IDEAS

Karlan, Dean & Wood, Daniel H., 2017. "The effect of effectiveness: Donor response to aid effectiveness in a direct mail fundraising experiment," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 66(C), pages 1-8.
- Dean Karlan & Daniel Wood, 2014. "The Effect of Effectiveness: Donor Response to Aid Effectiveness in a Direct Mail Fundraising Experiment," Working Papers 1038, Economic Growth Center, Yale University.
- Karlan, Dean S. & Wood, Daniel, 2014. "The effect of effectiveness: Donor response to aid effectiveness in a direct mail fundraising experiment," Center Discussion Papers 166682, Yale University, Economic Growth Center.
- Karlan, Dean & Wood, Daniel H., 2014. "The effect of effectiveness: Donor response to aid effectiveness in a direct mail fundraising experiment," CEPR Discussion Papers 9941, C.E.P.R. Discussion Papers.
- Karlan, Dean & Wood, Daniel, 2015. "The Effect of Effectiveness: Donor Response to Aid Effectiveness in a Direct Mail Fundraising Experiment," Working Papers 130, Yale University, Department of Economics.
- Dean Karlan & Daniel H. Wood, 2014. "The Effect of Effectiveness: Donor Response to Aid Effectiveness in a Direct Mail Fundraising Experiment," NBER Working Papers 20047, National Bureau of Economic Research, Inc.
Ruohan Zhan & Zhimei Ren & Susan Athey & Zhengyuan Zhou, 2024. "Policy Learning with Adaptively Collected Data," Management Science, INFORMS, vol. 70(8), pages 5270-5297, August.
- Ruohan Zhan & Zhimei Ren & Susan Athey & Zhengyuan Zhou, 2021. "Policy Learning with Adaptively Collected Data," Papers 2105.02344, arXiv.org, revised Nov 2022.
- Zhan, Ruohan & Ren, Zhimei & Athey, Susan & Zhou, Zhengyuan, 2021. "Policy Learning with Adaptively Collected Data," Research Papers 3963, Stanford University, Graduate School of Business.
Maximilian Kasy & Anja Sautmann, 2021. "Adaptive Treatment Assignment in Experiments for Policy Choice," Econometrica, Econometric Society, vol. 89(1), pages 113-132, January.
- Maximilian Kasy & Anja Sautmann, 2019. "Adaptive Treatment Assignment in Experiments for Policy Choice," CESifo Working Paper Series 7778, CESifo.
Manski, Charles F., 2000. "Identification problems and decisions under ambiguity: Empirical analysis of treatment response and normative analysis of treatment choice," Journal of Econometrics, Elsevier, vol. 95(2), pages 415-442, April.
Kaito Ariu & Masahiro Kato & Junpei Komiyama & Kenichiro McAlinn & Chao Qin, 2021. "Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling," Papers 2109.08229, arXiv.org, revised Nov 2021.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Masahiro Kato & Masaaki Imaizumi & Takuya Ishihara & Toru Kitagawa, 2023. "Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds," Papers 2302.02988, arXiv.org, revised Jul 2023.
Masahiro Kato & Masaaki Imaizumi & Takuya Ishihara & Toru Kitagawa, 2022. "Best Arm Identification with Contextual Information under a Small Gap," Papers 2209.07330, arXiv.org, revised Jan 2023.
Masahiro Kato, 2023. "Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget," Papers 2310.19788, arXiv.org, revised Mar 2024.
Chao Qin & Daniel Russo, 2024. "Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification," Papers 2402.10592, arXiv.org, revised Jul 2024.
Cavanagh,Jack & Fliegner,Jasmin Claire & Kopper,Sarah & Sautmann,Anja, 2023. "A Metadata Schema for Data from Experiments in the Social Sciences," Policy Research Working Paper Series 10296, The World Bank.
Masahiro Kato, 2024. "Generalized Neyman Allocation for Locally Minimax Optimal Best-Arm Identification," Papers 2405.19317, arXiv.org, revised Feb 2025.
Masahiro Kato, 2023. "Locally Optimal Fixed-Budget Best Arm Identification in Two-Armed Gaussian Bandits with Unknown Variances," Papers 2312.12741, arXiv.org, revised Mar 2024.
Toru Kitagawa & Jeff Rowley, 2024. "Bandit algorithms for policy learning: methods, implementation, and welfare-performance," The Japanese Economic Review, Springer, vol. 75(3), pages 407-447, July.
Minguez, Ana & Javier Sese, F., 2022. "Why do you want a relationship, anyway? Consent to receive marketing communications and donors’ willingness to engage with nonprofits," Journal of Business Research, Elsevier, vol. 148(C), pages 356-367.
Chuah, Swee-Hoon & Feeny, Simon & Hannan, Timothy & Hoffmann, Robert & Neelim, Ananta, 2024. "Qualitative versus quantitative impact communications in humanitarian appeals: Findings from a field experiment," Economics Letters, Elsevier, vol. 243(C).
Stefan Boes, 2013. "Nonparametric analysis of treatment effects in ordered response models," Empirical Economics, Springer, vol. 44(1), pages 81-109, February.
- Stefan Boes, 2007. "Nonparametric Analysis of Treatment Effects in Ordered Response Models," SOI - Working Papers 0709, Socioeconomic Institute - University of Zurich.
Billur Aksoy & Silvana Krasteva, 2020. "When does less information translate into more giving to public goods?," Experimental Economics, Springer;Economic Science Association, vol. 23(4), pages 1148-1177, December.
Charles F. Manski, 2014. "Choosing Size of Government Under Ambiguity: Infrastructure Spending and Income Taxation," Economic Journal, Royal Economic Society, vol. 0(576), pages 359-376, May.
- Charles F. Manski, 2012. "Choosing Size of Government Under Ambiguity: Infrastructure Spending and Income Taxation," NBER Working Papers 18204, National Bureau of Economic Research, Inc.
Shantanu Gupta & Zachary C. Lipton & David Childers, 2021. "Efficient Online Estimation of Causal Effects by Deciding What to Observe," Papers 2108.09265, arXiv.org, revised Oct 2021.
Garbero, Alessandra & Sakos, Grayson & Cerulli, Giovanni, 2023. "Towards data-driven project design: Providing optimal treatment rules for development projects," Socio-Economic Planning Sciences, Elsevier, vol. 89(C).
- Garbero, Alessandra & Sakos, Grayson & Cerulli, Giovanni, 2021. "Towards Data-driven Project design: Providing Optimal Treatment Rules for Development Projects," 2021 Annual Meeting, August 1-3, Austin, Texas 314016, Agricultural and Applied Economics Association.
Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
Chang, Chia-Chi & Chen, Po-Yu, 2019. "Which maximizes donations: Charitable giving as an incentive or incentives for charitable giving?," Journal of Business Research, Elsevier, vol. 97(C), pages 65-75.
Singer, Marcos & Donoso, Patricio & Rodríguez-Sickert, Carlos, 2008. "A static model of cooperation for group-based incentive plans," International Journal of Production Economics, Elsevier, vol. 115(2), pages 492-501, October.
A Stefano Caria & Grant Gordon & Maximilian Kasy & Simon Quinn & Soha Osman Shami & Alexander Teytelboym, 2024. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," Journal of the European Economic Association, European Economic Association, vol. 22(2), pages 781-836.
- Stefano Caria & Grant Gordon & Maximilian Kasy & Simon Quinn & Soha Shami & Alexander Teytelboym, 2020. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," CESifo Working Paper Series 8535, CESifo.
- A. Stefano Caria & Grant Gordon & Maximilian Kasy & Simon Quinn & Soha Shami & Alexander Teytelboym, 2020. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," CSAE Working Paper Series 2020-20, Centre for the Study of African Economies, University of Oxford.
- Caria, Stefano & Gordon, Grant & Kasy, Maximilian & Quinn, Simon & Shami, Soha & Teytelboym, Alexander, 2021. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," CAGE Online Working Paper Series 547, Competitive Advantage in the Global Economy (CAGE).
- Caria, Stefano & Gordon, Grant & Kasy, Maximilian & Quinn, Simon & Shami, Soha & Teytelboym, Alexander, 2021. "An Adaptive Targeted Field Experiment : Job Search Assistance for Refugees in Jordan," The Warwick Economics Research Paper Series (TWERPS) 1335, University of Warwick, Department of Economics.
- Quinn, Simon & Caria, Stefano & Gordon, Grant & Kasy, Maximilian & Shami, Soha & Teytelboym, Alexander, 2020. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," CEPR Discussion Papers 15359, C.E.P.R. Discussion Papers.
Vollebergh, Herman R.J. & Melenberg, Bertrand & Dijkgraaf, Elbert, 2009. "Identifying reduced-form relations with panel data: The case of pollution and income," Journal of Environmental Economics and Management, Elsevier, vol. 58(1), pages 27-42, July.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-DES-2024-02-05 (Economic Design)
NEP-EXP-2024-02-05 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2401.03756. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Adaptive Experimental Design for Policy Learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data