IDEAS home Printed from https://ideas.repec.org/a/bla/ecinqu/v62y2024i1p267-286.html
   My bibliography  Save this article

Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model

Author

Listed:
  • Stanton Hudja
  • Daniel Woods

Abstract

This paper analyzes how individuals resolve an exploration versus exploitation trade‐off in a laboratory experiment. The experiment implements the single‐agent exponential bandit model. We analyze how subjects respond to changes in the prior belief, safe action, and discount factor. We find that subjects respond in the predicted direction to these changes. However, we find that subjects under‐respond to the prior belief, under‐respond to the safe action, and typically explore less than predicted. Our results suggest that neither risk aversion nor the random termination probability are driving under‐experimentation. Our results are consistent with subjects having incorrect beliefs about exploration.

Suggested Citation

  • Stanton Hudja & Daniel Woods, 2024. "Exploration versus exploitation: A laboratory test of the single‐agent exponential bandit model," Economic Inquiry, Western Economic Association International, vol. 62(1), pages 267-286, January.
  • Handle: RePEc:bla:ecinqu:v:62:y:2024:i:1:p:267-286
    DOI: 10.1111/ecin.13164
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/ecin.13164
    Download Restriction: no

    File URL: https://libkey.io/10.1111/ecin.13164?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
    2. James G. MacKinnon, 2002. "Bootstrap inference in econometrics," Canadian Journal of Economics, Canadian Economics Association, vol. 35(4), pages 615-645, November.
    3. Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
    4. Ryan Oprea & Daniel Friedman & Steven T. Anderson, 2009. "Learning to Wait: A Laboratory Investigation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 76(3), pages 1103-1124.
    5. Evan Calford & Ryan Oprea, 2017. "Continuity, Inertia, and Strategic Uncertainty: A Test of the Theory of Continuous Time Games," Econometrica, Econometric Society, vol. 85, pages 915-935, May.
    6. Hudja, Stanton, 2021. "Is Experimentation Invariant to Group Size? A Laboratory Analysis of Innovation Contests," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 91(C).
    7. Ben Greiner, 2015. "Subject pool recruitment procedures: organizing experiments with ORSEE," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 1(1), pages 114-125, July.
    8. Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 10(1), pages 55-77.
    9. Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
    10. Cox, James C & Oaxaca, Ronald L, 1992. "Direct Tests of the Reservation Wage Property," Economic Journal, Royal Economic Society, vol. 102(415), pages 1423-1432, November.
    11. Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
    12. James Banovetz & Ryan Oprea, 2023. "Complexity and Procedural Choice," American Economic Journal: Microeconomics, American Economic Association, vol. 15(2), pages 384-413, May.
    13. Oprea, Ryan & Henwood, Keith & Friedman, Daniel, 2011. "Separating the Hawks from the Doves: Evidence from continuous time laboratory games," Journal of Economic Theory, Elsevier, vol. 146(6), pages 2206-2225.
    14. Cox, James C & Oaxaca, Ronald L, 1989. "Laboratory Experiments with a Finite-Horizon Job-Search Model," Journal of Risk and Uncertainty, Springer, vol. 2(3), pages 301-329, September.
    15. Ryan Oprea, 2014. "Survival versus Profit Maximization in a Dynamic Stochastic Experiment," Econometrica, Econometric Society, vol. 82, pages 2225-2255, November.
    16. Marina Halac & Navin Kartik & Qingmin Liu, 2017. "Contests for Experimentation," Journal of Political Economy, University of Chicago Press, vol. 125(5), pages 1523-1569.
    17. Daniel Friedman & Ryan Oprea, 2012. "A Continuous Dilemma," American Economic Review, American Economic Association, vol. 102(1), pages 337-363, February.
    18. Steven T. Anderson & Daniel Friedman & Ryan Oprea, 2010. "Preemption Games: Theory and Experiment," American Economic Review, American Economic Association, vol. 100(4), pages 1778-1803, September.
    19. Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
    20. Bruno Strulovici, 2010. "Learning While Voting: Determinants of Collective Experimentation," Econometrica, Econometric Society, vol. 78(3), pages 933-971, May.
    21. Marina Halac & Navin Kartik & Qingmin Liu, 2016. "Optimal Contracts for Experimentation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(3), pages 1040-1091.
    22. Othon M. Moreno & Yaroslav Rosokha, 2016. "Learning under compound risk vs. learning under ambiguity – an experiment," Journal of Risk and Uncertainty, Springer, vol. 53(2), pages 137-162, December.
    23. Andrew M. Davis & Elena Katok & Anthony M. Kwasnica, 2011. "Do Auctioneers Pick Optimal Reserve Prices?," Management Science, INFORMS, vol. 57(1), pages 177-192, January.
    24. Kostas Bimpikis & Shayan Ehsani & Mohamed Mostagir, 2019. "Designing Dynamic Contests," Operations Research, INFORMS, vol. 67(2), pages 339-356, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Johannes Hoelzemann & Nicolas Klein, 2021. "Bandits in the lab," Quantitative Economics, Econometric Society, vol. 12(3), pages 1021-1051, July.
    2. Bosch-Rosa, Ciril, 2018. "That's how we roll: An experiment on rollover risk," Journal of Economic Behavior & Organization, Elsevier, vol. 145(C), pages 495-510.
    3. repec:sus:susewp:0623 is not listed on IDEAS
    4. Fudenberg, Drew & He, Kevin, 2021. "Player-compatible learning and player-compatible equilibrium," Journal of Economic Theory, Elsevier, vol. 194(C).
    5. Zhao, Shuchen, 2021. "Taking turns in continuous time," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 257-279.
    6. Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
    7. Benndorf, Volker & Martínez-Martínez, Ismael & Normann, Hans-Theo, 2021. "Games with coupled populations: An experiment in continuous time," Journal of Economic Theory, Elsevier, vol. 195(C).
    8. Marlats, Chantal & Ménager, Lucie, 2021. "Strategic observation with exponential bandits," Journal of Economic Theory, Elsevier, vol. 193(C).
    9. Christopher Anderson, 2012. "Ambiguity aversion in multi-armed bandit problems," Theory and Decision, Springer, vol. 72(1), pages 15-33, January.
    10. M. Djiguemde & D. Dubois & A. Sauquet & M. Tidball, 2022. "Continuous Versus Discrete Time in Dynamic Common Pool Resource Game Experiments," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 82(4), pages 985-1014, August.
    11. Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
    12. Jie Ning & Volodymyr Babich, 2018. "R&D Investments in the Presence of Knowledge Spillover and Debt Financing: Can Risk Shifting Cure Free Riding?," Manufacturing & Service Operations Management, INFORMS, vol. 20(1), pages 97-112, February.
    13. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    14. Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
    15. Luhan, Wolfgang J. & Poulsen, Anders U. & Roos, Michael W.M., 2017. "Real-time tacit bargaining, payoff focality, and coordination complexity: Experimental evidence," Games and Economic Behavior, Elsevier, vol. 102(C), pages 687-699.
    16. Jean Paul Rabanal & Aleksei Chernulich & John Horowitz & Olga A. Rud & Manizha Sharifova, 2019. "Market timing under public and private information," Working Papers 151, Peruvian Economic Association.
    17. Marcoul, Philippe & Weninger, Quinn, 2008. "Search and active learning with correlated information: Empirical evidence from mid-Atlantic clam fishermen," Journal of Economic Dynamics and Control, Elsevier, vol. 32(6), pages 1921-1948, June.
    18. Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
    19. Kaustav Das & Nicolas Klein & Katharina Schmid, 2020. "Strategic experimentation with asymmetric players," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(4), pages 1147-1175, June.
    20. Farzad Pourbabaee, 2022. "Robust experimentation in the continuous time bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 73(1), pages 151-181, February.
    21. Tinghua Yu, 2021. "Accountability and learning with motivated agents," BCAM Working Papers 2107, Birkbeck Centre for Applied Macroeconomics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:ecinqu:v:62:y:2024:i:1:p:267-286. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/weaaaea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.