Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model

My bibliography Save this article

Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model

Author

Listed:

Stanton Hudja
Daniel Woods

Registered:

Abstract

This paper analyzes how individuals resolve an exploration versus exploitation trade‐off in a laboratory experiment. The experiment implements the single‐agent exponential bandit model. We analyze how subjects respond to changes in the prior belief, safe action, and discount factor. We find that subjects respond in the predicted direction to these changes. However, we find that subjects under‐respond to the prior belief, under‐respond to the safe action, and typically explore less than predicted. Our results suggest that neither risk aversion nor the random termination probability are driving under‐experimentation. Our results are consistent with subjects having incorrect beliefs about exploration.

Suggested Citation

Stanton Hudja & Daniel Woods, 2024. "Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model," Economic Inquiry, Western Economic Association International, vol. 62(1), pages 267-286, January.

Handle: RePEc:bla:ecinqu:v:62:y:2024:i:1:p:267-286
DOI: 10.1111/ecin.13164

Download full text from publisher

References listed on IDEAS

Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
James G. MacKinnon, 2002. "Bootstrap inference in econometrics," Canadian Journal of Economics, Canadian Economics Association, vol. 35(4), pages 615-645, November.
- James G. MacKinnon, 2002. "Bootstrap inference in econometrics," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 35(4), pages 615-645, November.
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
Ryan Oprea & Daniel Friedman & Steven T. Anderson, 2009. "Learning to Wait: A Laboratory Investigation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 76(3), pages 1103-1124.
Evan Calford & Ryan Oprea, 2017. "Continuity, Inertia, and Strategic Uncertainty: A Test of the Theory of Continuous Time Games," Econometrica, Econometric Society, vol. 85, pages 915-935, May.
Hudja, Stanton, 2021. "Is Experimentation Invariant to Group Size? A Laboratory Analysis of Innovation Contests," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 91(C).
Ben Greiner, 2015. "Subject pool recruitment procedures: organizing experiments with ORSEE," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 1(1), pages 114-125, July.
Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 10(1), pages 55-77.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Cox, James C & Oaxaca, Ronald L, 1992. "Direct Tests of the Reservation Wage Property," Economic Journal, Royal Economic Society, vol. 102(415), pages 1423-1432, November.
Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
James Banovetz & Ryan Oprea, 2023. "Complexity and Procedural Choice," American Economic Journal: Microeconomics, American Economic Association, vol. 15(2), pages 384-413, May.
Oprea, Ryan & Henwood, Keith & Friedman, Daniel, 2011. "Separating the Hawks from the Doves: Evidence from continuous time laboratory games," Journal of Economic Theory, Elsevier, vol. 146(6), pages 2206-2225.
- Ryan Oprea & Keith Henwood & Daniel Friedman, 2010. "Separating the Hawks from the Doves: Evidence from Continuous Time Laboratory Games," CESifo Working Paper Series 3129, CESifo.
Cox, James C & Oaxaca, Ronald L, 1989. "Laboratory Experiments with a Finite-Horizon Job-Search Model," Journal of Risk and Uncertainty, Springer, vol. 2(3), pages 301-329, September.
Ryan Oprea, 2014. "Survival versus Profit Maximization in a Dynamic Stochastic Experiment," Econometrica, Econometric Society, vol. 82, pages 2225-2255, November.
Marina Halac & Navin Kartik & Qingmin Liu, 2017. "Contests for Experimentation," Journal of Political Economy, University of Chicago Press, vol. 125(5), pages 1523-1569.
Daniel Friedman & Ryan Oprea, 2012. "A Continuous Dilemma," American Economic Review, American Economic Association, vol. 102(1), pages 337-363, February.
Steven T. Anderson & Daniel Friedman & Ryan Oprea, 2010. "Preemption Games: Theory and Experiment," American Economic Review, American Economic Association, vol. 100(4), pages 1778-1803, September.
- Anderson, Steven T & Friedman, Daniel & Oprea, Ryan, 2008. "Preemption Games: Theory and Experiment," Santa Cruz Department of Economics, Working Paper Series qt0pr4g8h1, Department of Economics, UC Santa Cruz.
Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
- Vladimir Novak & Tim Willems, 2018. "A Note on Optimal Experimentation under Risk Aversion," CERGE-EI Working Papers wp618, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
Bruno Strulovici, 2010. "Learning While Voting: Determinants of Collective Experimentation," Econometrica, Econometric Society, vol. 78(3), pages 933-971, May.
- Bruno Strulovici, 2008. "Learning while voting: determinants of collective experimentation," Economics Papers 2008-W08, Economics Group, Nuffield College, University of Oxford.
Marina Halac & Navin Kartik & Qingmin Liu, 2016. "Optimal Contracts for Experimentation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(3), pages 1040-1091.
Othon M. Moreno & Yaroslav Rosokha, 2016. "Learning under compound risk vs. learning under ambiguity – an experiment," Journal of Risk and Uncertainty, Springer, vol. 53(2), pages 137-162, December.
Andrew M. Davis & Elena Katok & Anthony M. Kwasnica, 2011. "Do Auctioneers Pick Optimal Reserve Prices?," Management Science, INFORMS, vol. 57(1), pages 177-192, January.
Kostas Bimpikis & Shayan Ehsani & Mohamed Mostagir, 2019. "Designing Dynamic Contests," Operations Research, INFORMS, vol. 67(2), pages 339-356, March.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Johannes Hoelzemann & Nicolas Klein, 2021. "Bandits in the lab," Quantitative Economics, Econometric Society, vol. 12(3), pages 1021-1051, July.
- HOELZEMANN, Johannes & KLEIN, Nicolas, 2018. "Bandits in the Lab," Cahiers de recherche 2018-09, Universite de Montreal, Departement de sciences economiques.
- Johannes HOELZEMANN & Nicolas KLEIN, 2018. "Bandits in the Lab," Cahiers de recherche 12-2018, Centre interuniversitaire de recherche en Ã©conomie quantitative, CIREQ.
Bosch-Rosa, Ciril, 2018. "That's how we roll: An experiment on rollover risk," Journal of Economic Behavior & Organization, Elsevier, vol. 145(C), pages 495-510.
- Ciril Bosch-Rosa, 2014. "That's how we roll: an experiment on rollover risk," SFB 649 Discussion Papers SFB649DP2014-048, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
repec:sus:susewp:0623 is not listed on IDEAS
Fudenberg, Drew & He, Kevin, 2021. "Player-compatible learning and player-compatible equilibrium," Journal of Economic Theory, Elsevier, vol. 194(C).
- Drew Fudenberg & Kevin He, 2017. "Player-Compatible Learning and Player-Compatible Equilibrium," Papers 1712.08954, arXiv.org, revised May 2020.
Zhao, Shuchen, 2021. "Taking turns in continuous time," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 257-279.
Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
- Chia-Hui Chen & Junichiro Ishida, 2015. "Hierarchical Experimentation," ISER Discussion Paper 0949, Institute of Social and Economic Research, Osaka University.
Benndorf, Volker & Martínez-Martínez, Ismael & Normann, Hans-Theo, 2021. "Games with coupled populations: An experiment in continuous time," Journal of Economic Theory, Elsevier, vol. 195(C).
Marlats, Chantal & Ménager, Lucie, 2021. "Strategic observation with exponential bandits," Journal of Economic Theory, Elsevier, vol. 193(C).
Christopher Anderson, 2012. "Ambiguity aversion in multi-armed bandit problems," Theory and Decision, Springer, vol. 72(1), pages 15-33, January.
M. Djiguemde & D. Dubois & A. Sauquet & M. Tidball, 2022. "Continuous Versus Discrete Time in Dynamic Common Pool Resource Game Experiments," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 82(4), pages 985-1014, August.
- Anmina Murielle Djiguemde & Dimitri Dubois & Alexandre Sauquet & Mabel Tidball, 2021. "Continuous versus Discrete Time in Dynamic Common Pool Resource Game Experiments," CEE-M Working Papers hal-03214973, CEE-M, Universtiy of Montpellier, CNRS, INRA, Montpellier SupAgro.
- Anmina Murielle Djiguemde & Dimitri Dubois & Alexandre Sauquet & Mabel Tidball, 2022. "Continuous versus Discrete Time in Dynamic Common Pool Resource Game Experiments," Post-Print hal-03664156, HAL.
- Anmina Murielle Djiguemde & Dimitri Dubois & Alexandre Sauquet & Mabel Tidball, 2022. "Continuous versus Discrete Time in Dynamic Common Pool Resource Game Experiments," Post-Print hal-03726448, HAL.
- Anmina Murielle Djiguemde & Dimitri Dubois & Alexandre Sauquet & Mabel Tidball, 2021. "Continuous versus Discrete Time in Dynamic Common Pool Resource Game Experiments," Working Papers hal-03214973, HAL.
Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
- Yariv, Leeat & Cetemen, Doruk & Urgun, Can, 2021. "Collective Progress: Dynamics of Exit Waves," CEPR Discussion Papers 16341, C.E.P.R. Discussion Papers.
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," NBER Working Papers 29008, National Bureau of Economic Research, Inc.
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," Working Papers 2021-34, Princeton University. Economics Department..
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," Papers 2107.00406, arXiv.org.
Jie Ning & Volodymyr Babich, 2018. "R&D Investments in the Presence of Knowledge Spillover and Debt Financing: Can Risk Shifting Cure Free Riding?," Manufacturing & Service Operations Management, INFORMS, vol. 20(1), pages 97-112, February.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, C.E.P.R. Discussion Papers.
Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
- Fahad Khalil & Jacques Lawarree & Alexander Rodivilov, 2018. "Learning from Failures: Optimal Contract for Experimentation and Production," CESifo Working Paper Series 7310, CESifo.
Luhan, Wolfgang J. & Poulsen, Anders U. & Roos, Michael W.M., 2017. "Real-time tacit bargaining, payoff focality, and coordination complexity: Experimental evidence," Games and Economic Behavior, Elsevier, vol. 102(C), pages 687-699.
- Wolfgang Luhan & Anders Poulsen & Michael Roos, 2015. "Real time tacit bargaining, payoff focality, and coordination complexity: Experimental evidence," Working Paper series, University of East Anglia, Centre for Behavioural and Experimental Social Science (CBESS) 15-11, School of Economics, University of East Anglia, Norwich, UK..
Jean Paul Rabanal & Aleksei Chernulich & John Horowitz & Olga A. Rud & Manizha Sharifova, 2019. "Market timing under public and private information," Working Papers 151, Peruvian Economic Association.
Marcoul, Philippe & Weninger, Quinn, 2008. "Search and active learning with correlated information: Empirical evidence from mid-Atlantic clam fishermen," Journal of Economic Dynamics and Control, Elsevier, vol. 32(6), pages 1921-1948, June.
- Marcoul, Philippe & Weninger, Quinn, 2008. "Search and Active Learning with Correlated Information: Empirical Evidence from Mid-Atlantic Clam Fishermen," Staff General Research Papers Archive 11601, Iowa State University, Department of Economics.
- Marcoul, Philippe & Weninger, Quinn, 2008. "Search and active learning with correlated information: Empirical evidence from mid-Atlantic clam fishermen," ISU General Staff Papers 200806010700001485, Iowa State University, Department of Economics.
Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
Kaustav Das & Nicolas Klein & Katharina Schmid, 2020. "Strategic experimentation with asymmetric players," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(4), pages 1147-1175, June.
Farzad Pourbabaee, 2022. "Robust experimentation in the continuous time bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 73(1), pages 151-181, February.
Tinghua Yu, 2021. "Accountability and learning with motivated agents," BCAM Working Papers 2107, Birkbeck Centre for Applied Macroeconomics.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:ecinqu:v:62:y:2024:i:1:p:267-286. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/weaaaea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data