IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04206821.html
   My bibliography  Save this paper

Overcoming Free-Riding in Bandit Games

Author

Listed:
  • Johannes Hörner

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - UT - Université de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Yale University [New Haven], CEPR - Centre for Economic Policy Research)

  • Nicolas Klein

    (UdeM - Université de Montréal, CIREQ - Centre interuniversitaire de recherche en économie quantitative)

  • Sven Rady

    (CEPR - Centre for Economic Policy Research, Universität Bonn = University of Bonn)

Abstract

This article considers a class of experimentation games with Lévy bandits encompassing those of Bolton and Harris (1999, Econometrica, 67, 349–374) and Keller, Rady, and Cripps (2005, Econometrica, 73, 39–68). Its main result is that efficient (perfect Bayesian) equilibria exist whenever players' payoffs have a diffusion component. Hence, the trade-offs emphasized in the literature do not rely on the intrinsic nature of bandit models but on the commonly adopted solution concept (Markov perfect equilibrium). This is not an artefact of continuous time: we prove that efficient equilibria arise as limits of equilibria in the discrete-time game. Furthermore, it suffices to relax the solution concept to strongly symmetric equilibrium.

Suggested Citation

  • Johannes Hörner & Nicolas Klein & Sven Rady, 2022. "Overcoming Free-Riding in Bandit Games," Post-Print hal-04206821, HAL.
  • Handle: RePEc:hal:journl:hal-04206821
    DOI: 10.1093/restud/rdab078
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a search for a similarly titled item that would be available.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    2. Drew Fudenberg & David K. Levine & Satoru Takahashi, 2008. "Perfect public equilibrium when players are patient," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 16, pages 345-367, World Scientific Publishing Co. Pte. Ltd..
    3. Bruno Biais & Thomas Mariotti & Guillaume Plantin & Jean-Charles Rochet, 2007. "Dynamic Security Design: Convergence to Continuous Time and Asset Pricing Implications," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(2), pages 345-390.
    4. Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
    5. Tomasz Sadzik & Ennio Stacchetti, 2015. "Agency Models With Frequent Actions," Econometrica, Econometric Society, vol. 83, pages 193-237, January.
    6. Simon, Leo K & Stinchcombe, Maxwell B, 1995. "Equilibrium Refinement for Infinite Normal-Form Games," Econometrica, Econometric Society, vol. 63(6), pages 1421-1443, November.
    7. , & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
    8. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    9. Dutta Prajit K., 1995. "A Folk Theorem for Stochastic Games," Journal of Economic Theory, Elsevier, vol. 66(1), pages 1-32, June.
    10. Abrea Dilip & Pearce David & Stacchetti Ennio, 1993. "Renegotiation and Symmetry in Repeated Games," Journal of Economic Theory, Elsevier, vol. 60(2), pages 217-240, August.
    11. Avinash K. Dixit & Robert S. Pindyck, 1994. "Investment under Uncertainty," Economics Books, Princeton University Press, edition 1, number 5474.
    12. repec:cwl:cwldpp:1726rrr is not listed on IDEAS
    13. repec:cwl:cwldpp:1726rr is not listed on IDEAS
    14. Bergin, James & MacLeod, W Bentley, 1993. "Continuous Time Repeated Games," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 34(1), pages 21-37, February.
    15. Abreu, Dilip & Pearce, David & Stacchetti, Ennio, 1986. "Optimal cartel equilibria with imperfect monitoring," Journal of Economic Theory, Elsevier, vol. 39(1), pages 251-269, June.
    16. Abreu, Dilip, 1986. "Extremal equilibria of oligopolistic supergames," Journal of Economic Theory, Elsevier, vol. 39(1), pages 191-225, June.
    17. Johannes Hörner & Takuo Sugaya & Satoru Takahashi & Nicolas Vieille, 2011. "Recursive Methods in Discounted Stochastic Games: An Algorithm for δ→ 1 and a Folk Theorem," Econometrica, Econometric Society, vol. 79(4), pages 1277-1318, July.
    18. Drew Fudenberg & David K. Levine, 2009. "Repeated Games with Frequent Signals," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(1), pages 233-265.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hwang, Ilwoo, 2023. "Policy experimentation with repeated elections," Games and Economic Behavior, Elsevier, vol. 142(C), pages 623-644.
    2. Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sven Rady & Nicolas Klein & Johannes Horner, 2013. "Strongly Symmetric Equilibria in Bandit Games," 2013 Meeting Papers 1107, Society for Economic Dynamics.
    2. Weng, Xi, 2015. "Dynamic pricing in the presence of individual learning," Journal of Economic Theory, Elsevier, vol. 155(C), pages 262-299.
    3. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    4. Rodivilov, Alexander, 2022. "Monitoring innovation," Games and Economic Behavior, Elsevier, vol. 135(C), pages 297-326.
    5. Kimmo Berg, 2016. "Elementary Subpaths in Discounted Stochastic Games," Dynamic Games and Applications, Springer, vol. 6(3), pages 304-323, September.
    6. Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
    7. Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
    8. Mira Frick & Yuhta Ishii, 2015. "Innovation Adoption by Forward-Looking Social Learners," Cowles Foundation Discussion Papers 1877, Cowles Foundation for Research in Economics, Yale University.
    9. repec:cwl:cwldpp:1726rrr is not listed on IDEAS
    10. Wagner, Peter A. & Klein, Nicolas, 2022. "Strategic investment and learning with private information," Journal of Economic Theory, Elsevier, vol. 204(C).
    11. Osório António M., 2012. "A Folk Theorem for Games when Frequent Monitoring Decreases Noise," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 12(1), pages 1-27, April.
    12. Fudenberg, Drew & Ishii, Yuhta & Kominers, Scott Duke, 2014. "Delayed-response strategies in repeated games with observation lags," Journal of Economic Theory, Elsevier, vol. 150(C), pages 487-514.
    13. Sofia Moroni, 2016. "Experimentation in Organizations," Working Paper 5876, Department of Economics, University of Pittsburgh.
    14. Boyarchenko, Svetlana, 2021. "Inefficiency of sponsored research," Journal of Mathematical Economics, Elsevier, vol. 95(C).
    15. Nicolas Klein & Sven Rady, 2011. "Negatively Correlated Bandits," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(2), pages 693-732.
    16. Pearce, David & Stacchetti, Ennio, 1997. "Time Consistent Taxation by a Government with Redistributive Goals," Journal of Economic Theory, Elsevier, vol. 72(2), pages 282-305, February.
    17. Vincent Anesi & T Renee Bowen, 2018. "Policy Experimentation, Redistribution and Voting Rules," Discussion Papers 2018-09, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
    18. Keller, Godfrey & Rady, Sven, 2015. "Breakdowns," Theoretical Economics, Econometric Society, vol. 10(1), January.
    19. Svetlana Boyarchenko, 2020. "Super- and submodularity of stopping games with random observations," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(4), pages 983-1022, November.
    20. Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
    21. Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Papers 2404.19116, arXiv.org.

    More about this item

    Keywords

    Two-armed bandit; Bayesian learning; Strategic experimentation; Strongly symmetric equilibrium;
    All these keywords.

    JEL classification:

    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04206821. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.