Robust experimentation in the continuous time bandit problem

My bibliography Save this article

Robust experimentation in the continuous time bandit problem

Author

Listed:

Farzad Pourbabaee
(University of California)

Registered:

Abstract

We study the experimentation dynamics of a decision maker (DM) in a two-armed bandit setup (Bolton and Harris in Econometrica 67(2):349–374, 1999), where the agent holds ambiguous beliefs regarding the distribution of the return process of one arm and is certain about the other one. The DM entertains Multiplier preferences à la Hansen and Sargent (Am. Econ. Rev. 91(2):60–66, 2001), thus we frame the decision making environment as a two-player differential game against nature in continuous time. We characterize the DM’s value function and her optimal experimentation strategy that turns out to follow a cut-off rule with respect to her belief process. The belief threshold for exploring the ambiguous arm is found in closed form and is shown to be increasing with respect to the ambiguity aversion index. We then study the effect of provision of an unambiguous information source about the ambiguous arm. Interestingly, we show that the exploration threshold rises unambiguously as a result of this new information source, thereby leading to more conservatism. This analysis also sheds light on the efficient time to reach for an expert opinion.

Suggested Citation

Farzad Pourbabaee, 2022. "Robust experimentation in the continuous time bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 73(1), pages 151-181, February.

Handle: RePEc:spr:joecth:v:73:y:2022:i:1:d:10.1007_s00199-020-01328-3
DOI: 10.1007/s00199-020-01328-3

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Gustavo Manso, 2011. "Motivating Innovation," Journal of Finance, American Finance Association, vol. 66(5), pages 1823-1860, October.
Lars Peter Hansen & Thomas J Sargent, 2014. "Robust Control and Model Misspecification," World Scientific Book Chapters, in: UNCERTAINTY WITHIN ECONOMIC MODELS, chapter 6, pages 155-216, World Scientific Publishing Co. Pte. Ltd..
- Hansen, Lars Peter & Sargent, Thomas J. & Turmuhambetova, Gauhar & Williams, Noah, 2006. "Robust control and model misspecification," Journal of Economic Theory, Elsevier, vol. 128(1), pages 45-90, May.
Christopher Anderson, 2012. "Ambiguity aversion in multi-armed bandit problems," Theory and Decision, Springer, vol. 72(1), pages 15-33, January.
Hansen, Lars Peter & Sargent, Thomas J., 2011. "Robustness and ambiguity in continuous time," Journal of Economic Theory, Elsevier, vol. 146(3), pages 1195-1223, May.
Bonatti, Alessandro & Hörner, Johannes, 2017. "Learning to disagree in a game of experimentation," Journal of Economic Theory, Elsevier, vol. 169(C), pages 234-269.
- Alessandro Bonatti & Johannes Horner, 2015. "Learning to Disagree in a Game of Experimentation," Cowles Foundation Discussion Papers 1991, Cowles Foundation for Research in Economics, Yale University.
- Bonatti, Alessandro & Hörner, Johannes, 2017. "Learning to Disagree in a Game of Experimentation," TSE Working Papers 17-791, Toulouse School of Economics (TSE).
Larry G. Epstein & Shaolin Ji, 2022. "Optimal Learning Under Robustness and Time-Consistency," Operations Research, INFORMS, vol. 70(3), pages 1317-1329, May.
- Larry G. Epstein & Shaolin Ji, 2017. "Optimal Learning under Robustness and Time-Consistency," Papers 1708.01890, arXiv.org, revised Mar 2019.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, C.E.P.R. Discussion Papers.
Li, Jian, 2019. "The K-armed bandit problem with multiple priors," Journal of Mathematical Economics, Elsevier, vol. 80(C), pages 22-38.
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
Yaoyao Wu & Jinqiang Yang & Zhentao Zou, 2018. "Ambiguity sharing and the lack of relative performance evaluation," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 66(1), pages 141-157, July.
Larry G. Epstein & Martin Schneider, 2007. "Learning Under Ambiguity," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(4), pages 1275-1303.
- Larry Epstein & Martin Schneider, 2002. "Learning Under Ambiguity," RCER Working Papers 497, University of Rochester - Center for Economic Research (RCER), revised Mar 2005.
- Larry Epstein & Martin Schneider, 2006. "Learning Under Ambiguity," RCER Working Papers 527, University of Rochester - Center for Economic Research (RCER).
Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
Jianjun Miao & Alejandro Rivera, 2016. "Robust Contracts in Continuous Time," Econometrica, Econometric Society, vol. 84(4), pages 1405-1440, July.
- Jianjun Miao & Alejandro Rivera, 2016. "Robust Contracts in Continuous Time," Econometrica, Econometric Society, vol. 84, pages 1405-1440, July.
- Jianjun Miao & Alejandro Rivera, 2013. "Robust Contracts in Continuous Time," Boston University - Department of Economics - Working Papers Series 2013-009, Boston University - Department of Economics.
Gilboa, Itzhak & Schmeidler, David, 1989. "Maxmin expected utility with non-unique prior," Journal of Mathematical Economics, Elsevier, vol. 18(2), pages 141-153, April.
- Gilboa, Itzhak & Schmeidler, David, 1986. "Maxmin Expected Utility with a Non-Unique Prior," Foerder Institute for Economic Research Working Papers 275405, Tel-Aviv University > Foerder Institute for Economic Research.
- Itzhak Gilboa & David Schmeidler, 1989. "Maxmin Expected Utility with Non-Unique Prior," Post-Print hal-00753237, HAL.
Yulei Luo, 2017. "Robustly Strategic Consumption–Portfolio Rules with Informational Frictions," Management Science, INFORMS, vol. 63(12), pages 4158-4174, December.
- Luo, Yulei, 2015. "Robustly Strategic Consumption-Portfolio Rules with Informational Frictions," MPRA Paper 64312, University Library of Munich, Germany.
Weitzman, Martin L, 1979. "Optimal Search for the Best Alternative," Econometrica, Econometric Society, vol. 47(3), pages 641-654, May.
- M. L. Weitzman, 1978. "Optimal Search for the Best Alternative," Working papers 214, Massachusetts Institute of Technology (MIT), Department of Economics.
Frank Riedel, 2009. "Optimal Stopping With Multiple Priors," Econometrica, Econometric Society, vol. 77(3), pages 857-908, May.
Godfrey Keller & Sven Rady, 1999. "Optimal Experimentation in a Changing Environment," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 66(3), pages 475-507.
- Godfrey Keller & Sven Rady, 1997. "Optimal Experimentation in a Changing Environment," STICERD - Theoretical Economics Paper Series 333, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Godfrey Keller & Sven Rady, 1998. "Optimal Experimentation in a Changing Environment," Game Theory and Information 9801001, University Library of Munich, Germany.
Lars Peter Hansen & Thomas J Sargent, 2014. "Robust Control and Model Uncertainty," World Scientific Book Chapters, in: UNCERTAINTY WITHIN ECONOMIC MODELS, chapter 5, pages 145-154, World Scientific Publishing Co. Pte. Ltd..
- Thomas J. Sargent & LarsPeter Hansen, 2001. "Robust Control and Model Uncertainty," American Economic Review, American Economic Association, vol. 91(2), pages 60-66, May.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Epstein, Larry G. & Schneider, Martin, 2003. "Recursive multiple-priors," Journal of Economic Theory, Elsevier, vol. 113(1), pages 1-31, November.
- Larry G. Epstein & Martin Schneider, 2001. "Recursive Multiple-Priors," RCER Working Papers 485, University of Rochester - Center for Economic Research (RCER).
Massimo Marinacci, 2002. "Learning from ambiguous urns," Statistical Papers, Springer, vol. 43(1), pages 143-151, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Matt Van Essen & John Wooders, 2023. "Dual auctions for assigning winners and compensating losers," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 76(4), pages 1069-1114, November.
- John Wooders & Matt Van Essen, 2018. "Dual Auctions for Assigning Winners and Compensating Losers," Working Papers 20180013, New York University Abu Dhabi, Department of Social Science, revised Jan 2018.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Farzad Pourbabaee, 2021. "Robust Experimentation in the Continuous Time Bandit Problem," Papers 2104.00102, arXiv.org.
Li, Jian, 2019. "The K-armed bandit problem with multiple priors," Journal of Mathematical Economics, Elsevier, vol. 80(C), pages 22-38.
Paul Viefers, 2012. "Should I Stay or Should I Go?: A Laboratory Analysis of Investment Opportunities under Ambiguity," Discussion Papers of DIW Berlin 1228, DIW Berlin, German Institute for Economic Research.
Alexander Zimper, 2011. "Do Bayesians Learn Their Way Out of Ambiguity?," Decision Analysis, INFORMS, vol. 8(4), pages 269-285, December.
- Alexander Zimper, 2011. "Do Bayesians learn their way out of ambiguity?," Working Papers 240, Economic Research Southern Africa.
Cosmin L. Ilut & Martin Schneider, 2022. "Modeling Uncertainty as Ambiguity: a Review," NBER Working Papers 29915, National Bureau of Economic Research, Inc.
Jang, Bong-Gyu & Lee, Seungkyu & Lim, Byung Hwa, 2016. "Robust consumption and portfolio rules with time-varying model confidence," Finance Research Letters, Elsevier, vol. 18(C), pages 342-352.
Peter G. Hansen, 2021. "New Formulations of Ambiguous Volatility with an Application to Optimal Dynamic Contracting," Papers 2101.12306, arXiv.org.
Swagata Bhattacharjee, 2019. "Dynamic Contracting for Innovation Under Ambiguity," Working Papers 1022, Ashoka University, Department of Economics, revised Aug 2019.
Battigalli, P. & Francetich, A. & Lanzani, G. & Marinacci, M., 2019. "Learning and self-confirming long-run biases," Journal of Economic Theory, Elsevier, vol. 183(C), pages 740-785.
Daniele Pennesi, 2013. "Asset Prices in an Ambiguous Economy," Carlo Alberto Notebooks 315, Collegio Carlo Alberto.
Bhattacharjee, Swagata, 2022. "Dynamic contracting for innovation under ambiguity," Games and Economic Behavior, Elsevier, vol. 132(C), pages 534-552.
- Swagata Bhattacharjee, 2019. "Dynamic Contracting for Innovation Under Ambiguity," Working Papers 15, Ashoka University, Department of Economics, revised 02 Aug 2019.
Hansen, Peter G., 2022. "New formulations of ambiguous volatility with an application to optimal dynamic contracting," Journal of Economic Theory, Elsevier, vol. 199(C).
Moreno Othón M., 2014. "Consumption of Durable Goods under Ambiguity," Working Papers 2014-02, Banco de México.
Li, Jing, 2018. "Essays on model uncertainty in financial models," Other publications TiSEM 202cd910-7ef1-4db4-94ae-d, Tilburg University, School of Economics and Management.
Hansen, Lars Peter & Sargent, Thomas J., 2022. "Structured ambiguity and model misspecification," Journal of Economic Theory, Elsevier, vol. 199(C).
Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
- Vladimir Novak & Tim Willems, 2018. "A Note on Optimal Experimentation under Risk Aversion," CERGE-EI Working Papers wp618, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
Aït-Sahalia, Yacine & Matthys, Felix, 2019. "Robust consumption and portfolio policies when asset prices can jump," Journal of Economic Theory, Elsevier, vol. 179(C), pages 1-56.
Hui Chen & Nengjiu Ju & Jianjun Miao, 2014. "Dynamic Asset Allocation with Ambiguous Return Predictability," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 17(4), pages 799-823, October.
- Hui Chen & Nengjiu Ju & Jianjun Miao, "undated". "Dynamic Asset Allocation with Ambiguous Return Predictability," Boston University - Department of Economics - Working Papers Series wp2009-015, Boston University - Department of Economics.
- Hui Chen & Nengjiu Ju & Jianjun Miao, 2008. "Dynamic Asset Allocation with Ambiguous Return Predictability," Boston University - Department of Economics - The Institute for Economic Development Working Papers Series dp-179, Boston University - Department of Economics, revised Feb 2009.
Werner, Jan, 2022. "Speculative trade under ambiguity," Journal of Economic Theory, Elsevier, vol. 199(C).
- Jan Werner, 2016. "Speculative Trade under Ambiguity," 2016 Meeting Papers 1607, Society for Economic Dynamics.
Yacine Aït-Sahalia & Felix Matthys & Emilio Osambela & Ronnie Sircar, 2021. "When Uncertainty and Volatility Are Disconnected: Implications for Asset Pricing and Portfolio Performance," NBER Working Papers 29195, National Bureau of Economic Research, Inc.
- Yacine Aït-Sahalia & Felix Matthys & Emilio Osambela & Ronnie Sircar, 2021. "When Uncertainty and Volatility Are Disconnected: Implications for Asset Pricing and Portfolio Performance," Finance and Economics Discussion Series 2021-063, Board of Governors of the Federal Reserve System (U.S.).

More about this item

Keywords

Model uncertainty; Dynamic experimentation; Variational preferences; Information valuation; Ambiguous diffusion;
All these keywords.

JEL classification:

C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory
C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis
C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
D81 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Criteria for Decision-Making under Risk and Uncertainty
D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joecth:v:73:y:2022:i:1:d:10.1007_s00199-020-01328-3. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Robust experimentation in the continuous time bandit problem

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data