Optimal allocation strategies in a discrete-time bandit problem

Optimal allocation strategies in a discrete-time bandit problem

Author

Listed:

Hu, Audrey
Zou, Liang

Abstract

We study a discrete-time, two-armed “breakthrough” bandit in which an agent allocates a perfectly divisible resource each period between a safe arm and a risky arm. Departing from the binary “either–or” paradigm, we consider continuous allocation strategies and a general success technology F with nonincreasing hazard rate. Using a variational, pathwise approach combined with dynamic programming, we characterize the unique optimal belief–allocation path via a time-invariant backward/forward transformation. The optimal path features interior, tapering allocations that never stop prior to a breakthrough, and it delivers a strictly higher eventual success probability and expected payoff than the optimal binary (bang-bang) benchmark. In the exponential case, the mappings become explicit, making computation immediate and revealing a Goldilocks principle: total planned allocations to exploration is maximized at intermediate task difficulty. The framework highlights comparative dynamics—how entire optimal paths shift with primitives—while remaining robust to the functional form of F.

Suggested Citation

Hu, Audrey & Zou, Liang, 2026. "Optimal allocation strategies in a discrete-time bandit problem," Journal of Economic Dynamics and Control, Elsevier, vol. 184(C).

Handle: RePEc:eee:dyncon:v:184:y:2026:i:c:s0165188926000102
DOI: 10.1016/j.jedc.2026.105264

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Caroline D. Thomas, 2021. "Strategic Experimentation with Congestion," American Economic Journal: Microeconomics, American Economic Association, vol. 13(1), pages 1-82, February.
- Caroline D Thomas, 2010. "Strategic Experimentation with Congestion," Department of Economics Working Papers 130907, The University of Texas at Austin, Department of Economics, revised 04 Nov 2014.
- Caroline D. Thomas, 2010. "Strategic Experimentation with Congestion," Department of Economics Working Papers 130813, The University of Texas at Austin, Department of Economics, revised Aug 2013.
Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
- Bergemann, Dirk & Hege, Ulrich, 1997. "Venture Capital Financing, Moral Hazard and Learning," CEPR Discussion Papers 1738, Centre for Economic Policy Research.
- Bergemann, D. & Hege, U., 1997. "Venture Capital Financing, Moral Hazard and Learning," Other publications TiSEM d70119dd-1d85-4dde-9d59-1, Tilburg University, School of Economics and Management.
- Ulrich Hege & Dirk Bergemann, 1998. "Venture capital financing, moral hazard, and learning," Post-Print hal-00481696, HAL.
Catherine Bobtcheff & Raphaël Levy, 2017. "More Haste, Less Speed? Signaling through Investment Timing," American Economic Journal: Microeconomics, American Economic Association, vol. 9(3), pages 148-186, August.
- Bobtcheff, Catherine & Levy, Raphaël, 2015. "More Haste, Less Speed? Signaling through Investment Timing," TSE Working Papers 15-571, Toulouse School of Economics (TSE).
- Catherine Bobtcheff & Raphaël Lévy, 2017. "More Haste, Less Speed? Signaling through Investment Timing," Post-Print hal-05079209, HAL.
Ivar Ekeland & José Alexandre Scheinkman, 1986. "Transversality Conditions for Some Infinite Horizon Discrete Time Optimization Problems," Mathematics of Operations Research, INFORMS, vol. 11(2), pages 216-229, May.
David A. Malueg & Shunichi O. Tsutsui, 1997. "Dynamic R&D Competition with Learning," RAND Journal of Economics, The RAND Corporation, vol. 28(4), pages 751-772, Winter.
Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2007. "Social Learning in One-Arm Bandit Problems," Econometrica, Econometric Society, vol. 75(6), pages 1591-1611, November.
- Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2004. "Social Learning in One-Arm Bandit Problems," Discussion Papers 1396, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Nicolas Vieille & Dinah Rosenberg & Eilon Solan, 2007. "Social Learning in One-Arm Bandit Problems," Post-Print hal-00464609, HAL.
David Besanko & Jianjun Wu, 2013. "The Impact of Market Structure and Learning on the Tradeoff between R&D Competition and Cooperation," Journal of Industrial Economics, Wiley Blackwell, vol. 61(1), pages 166-201, March.
Bonatti, Alessandro & Hörner, Johannes, 2017. "Career concerns with exponential learning," Theoretical Economics, Econometric Society, vol. 12(1), January.
- Bonatti, Alessandro & Hörner, Johannes, 2017. "Career Concerns with Exponential Learning," TSE Working Papers 17-793, Toulouse School of Economics (TSE).
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, Centre for Economic Policy Research.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Dirk Bergemann & Ulrigh Hege, 2005. "The Financing of Innovation: Learning and Stopping," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 719-752, Winter.
- Dirk Bergemann & Ulrich Hege, 2001. "The Financing of Innovation: Learning and Stopping," Cowles Foundation Discussion Papers 1292, Cowles Foundation for Research in Economics, Yale University.
- Bergemann, D. & Hege, U., 2001. "The Financing of Innovation : Learning and Stopping," Other publications TiSEM 85bb8c47-af02-4c41-88b4-0, Tilburg University, School of Economics and Management.
- Ulrich Hege & Dirk Bergemann, 2005. "The Financing of Innovation: Learning and Stopping," Post-Print hal-00459926, HAL.
- Hege, Ulrich & Bergemann, Dirk, 2001. "The Financing of Innovation: Learning and Stopping," CEPR Discussion Papers 2763, Centre for Economic Policy Research.
- Dirk Bergemann & Ulrich Hege, 2001. "The Financing of Innovation: Learning and Stopping," Cowles Foundation Discussion Papers 1292R, Cowles Foundation for Research in Economics, Yale University, revised Oct 2004.
- Bergemann, D. & Hege, U., 2001. "The Financing of Innovation : Learning and Stopping," Discussion Paper 2001-16, Tilburg University, Center for Economic Research.
- Ulrich Hege & D. Bergemann, 2012. "The Financing of Innovation: Learning and Stopping," Working Papers hal-00759793, HAL.
Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
Philippe Aghion & Patrick Bolton & Christopher Harris & Bruno Jullien, 1991. "Optimal Learning by Experimentation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(4), pages 621-654.
- Aghion, P. & Bolton, P. & Harris, C. & Jullien, B., 1990. "Optimal Learning By Experimentation," DELTA Working Papers 90-10, DELTA (Ecole normale supérieure).
- Aghion Philippe & Bolton, Patrick & Harris Christopher & Jullien Bruno, 1991. "Optimal learning by experimentation," CEPREMAP Working Papers (Couverture Orange) 9104, CEPREMAP.
Marina Halac & Navin Kartik & Qingmin Liu, 2017. "Contests for Experimentation," Journal of Political Economy, University of Chicago Press, vol. 125(5), pages 1523-1569.
Pauli Murto & Juuso Välimäki, 2011. "Learning and Information Aggregation in an Exit Game," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(4), pages 1426-1461.
An, Mark Yuying, 1998. "Logconcavity versus Logconvexity: A Complete Characterization," Journal of Economic Theory, Elsevier, vol. 80(2), pages 350-369, June.
- An, Mark Yuying, 1995. "Logconcavity versus Logconvexity: A Complete Characterization," Working Papers 95-03, Duke University, Department of Economics.
Yu Awaya & Vijay Krishna, 2021. "Startups and Upstarts: Disadvantageous Information in R&D," Journal of Political Economy, University of Chicago Press, vol. 129(2), pages 534-569.
Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
- Calzolari, Giacomo & Calvano, Emilio & Denicolo, Vincenzo & Pastorello, Sergio, 2018. "Artificial intelligence, algorithmic pricing and collusion," CEPR Discussion Papers 13405, Centre for Economic Policy Research.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, Centre for Economic Policy Research.
Winston Wei Dou & Itay Goldstein & Yan Ji, 2025. "AI-Powered Trading, Algorithmic Collusion, and Price Efficiency," NBER Working Papers 34054, National Bureau of Economic Research, Inc.
Marina Halac & Navin Kartik & Qingmin Liu, 2016. "Optimal Contracts for Experimentation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(3), pages 1040-1091.
Jay Pil Choi, 1997. "Herd Behavior, the 'Penguin Effect,' and the Suppression of Informational Diffusion: An Analysis of Informational Externalities and Payoff Interdependency," RAND Journal of Economics, The RAND Corporation, vol. 28(3), pages 407-425, Autumn.
- Choi, J.P., 1994. "Herd behavior, the "Penguin effect", and the suppression of informational diffusion : An analysis of informational externalities and payoff interdependency," Other publications TiSEM d6bac82e-f8fe-4a91-98ec-c, Tilburg University, School of Economics and Management.
- Choi, J.P., 1994. "Herd behavior, the "Penguin effect", and the suppression of informational diffusion : An analysis of informational externalities and payoff interdependency," Discussion Paper 1994-62, Tilburg University, Center for Economic Research.
Yingni Guo, 2016. "Dynamic Delegation of Experimentation," American Economic Review, American Economic Association, vol. 106(8), pages 1969-2008, August.
, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, Centre for Economic Policy Research.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, Centre for Economic Policy Research.
Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
- Dinah Rosenberg & Antoine Salomon & Nicolas Vieille, 2010. "On Games of Strategic Experimentation," Working Papers hal-00579613, HAL.
- Rosenberg, Dinah & Salomon , Antoine & Vieille , Nicolas, 2013. "On Games of Strategic Experimentation," HEC Research Papers Series 1008, HEC Paris.
Kaustav Das & Nicolas Klein & Katharina Schmid, 2020. "Strategic experimentation with asymmetric players," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(4), pages 1147-1175, June.
Renault, Jérôme & Solan, Eilon & Vieille, Nicolas, 0. "Strategic experimentation with privately observed payoffs," Theoretical Economics, Econometric Society.
Thomas, Caroline, 2019. "Experimentation with reputation concerns – Dynamic signalling with changing types," Journal of Economic Theory, Elsevier, vol. 179(C), pages 366-415.
Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
- Chia-Hui Chen & Junichiro Ishida, 2015. "Hierarchical Experimentation," ISER Discussion Paper 0949, Institute of Social and Economic Research, The University of Osaka.
Bloch, Francis & Fabrizi, Simona & Lippert, Steffen, 2022. "Hiding and herding in market entry," Journal of Economic Theory, Elsevier, vol. 206(C).
- Francis Bloch & Simona Fabrizi & Steffen Lippert, 2022. "Hiding and herding in market entry," PSE-Ecole d'économie de Paris (Postprint) halshs-03956373, HAL.
- Francis Bloch & Simona Fabrizi & Steffen Lippert, 2022. "Hiding and herding in market entry," Post-Print halshs-03956373, HAL.
Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," NBER Working Papers 32424, National Bureau of Economic Research, Inc.
- Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Working Papers 334, Princeton University, Department of Economics, Center for Economic Policy Studies..
- Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Papers 2404.19116, arXiv.org.
- Lizzeri, Alessandro & Shmaya, Eran & Yariv, Leeat, 2024. "Disentangling Exploration from Exploitation," CEPR Discussion Papers 19058, Centre for Economic Policy Research.
Klein, Nicolas, 2013. "Strategic learning in teams," Games and Economic Behavior, Elsevier, vol. 82(C), pages 636-657.
- Klein, Nicolas, 2010. "Strategic Learning in Teams," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 333, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
Mira Frick & Yuhta Ishii, 2015. "Innovation Adoption by Forward-Looking Social Learners," Cowles Foundation Discussion Papers 1877, Cowles Foundation for Research in Economics, Yale University.
Keller, Godfrey & Rady, Sven, 2015. "Breakdowns," Theoretical Economics, Econometric Society, vol. 10(1), January.
- Keller, Godfrey & Rady, Sven, 2012. "Breakdowns," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 396, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Godfrey Keller & Sven Rady, 2013. "Breakdowns," Levine's Working Paper Archive 786969000000000635, David K. Levine.
Wagner, Peter A. & Klein, Nicolas, 2022. "Strategic investment and learning with private information," Journal of Economic Theory, Elsevier, vol. 204(C).
- Nicolas KLEIN & Peter WAGNER, 2018. "Strategic Investment and Learning with Private Information," Cahiers de recherche 13-2018, Centre interuniversitaire de recherche en Ã©conomie quantitative, CIREQ.
- KLEIN, Nicolas & WAGNER, Peter, 2018. "Strategic investment and learning with private information," Cahiers de recherche 2018-10, Universite de Montreal, Departement de sciences economiques.
Simina Br^anzei & Yuval Peres, 2019. "Multiplayer Bandit Learning, from Competition to Cooperation," Papers 1908.01135, arXiv.org, revised Jan 2024.
Chen, Chia-Hui & Ishida, Junichiro & Mukherjee, Arijit, 2023. "Pioneer, early follower or late entrant: Entry dynamics with learning and market competition," European Economic Review, Elsevier, vol. 152(C).
- Chia-Hui Chen & Junichiro Ishida & Arijit Mukherjee, 2021. "Pioneer, Early Follower or Late Entrant: Entry Dynamics with Learning and Market Competition," ISER Discussion Paper 1132, Institute of Social and Economic Research, The University of Osaka.
Marlats, Chantal & Ménager, Lucie, 2021. "Strategic observation with exponential bandits," Journal of Economic Theory, Elsevier, vol. 193(C).
Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
- Vladimir Novak & Tim Willems, 2018. "A Note on Optimal Experimentation under Risk Aversion," CERGE-EI Working Papers wp618, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
Thomas Greve & Hans Keiding, 2023. "A model of privately funded public research," Journal of Economics, Springer, vol. 140(1), pages 63-91, September.
Nicolas Klein & Tymofiy Mylovanov, 2011. "Should the Flatterers be Avoided?," 2011 Meeting Papers 1273, Society for Economic Dynamics.
Besanko, David & Tong, Jian & Wu, Jianjun, 2016. "Subsidizing research programs with "if" and "when" uncertainty in the face of severe informational constraints," Discussion Paper Series In Economics And Econometrics 1605, Economics Division, School of Social Sciences, University of Southampton.
Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.

More about this item

Keywords

; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:184:y:2026:i:c:s0165188926000102. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Optimal allocation strategies in a discrete-time bandit problem

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data