IDEAS home Printed from https://ideas.repec.org/p/cpr/ceprdp/3814.html
   My bibliography  Save this paper

Strategic Experimentation with Exponential Bandits

Author

Listed:
  • Rady, Sven
  • Cripps, Martin William
  • Keller, R Godfrey

Abstract

This Paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a pay-off only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected pay-off of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate pay-offs.

Suggested Citation

  • Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
  • Handle: RePEc:cpr:ceprdp:3814
    as

    Download full text from publisher

    File URL: https://cepr.org/publications/DP3814
    Download Restriction: CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Leslie M. Marx & Steven A. Matthews, 2000. "Dynamic Voluntary Contribution to a Public Project," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 67(2), pages 327-358.
    2. David A. Malueg & Shunichi O. Tsutsui, 1997. "Dynamic R&D Competition with Learning," RAND Journal of Economics, The RAND Corporation, vol. 28(4), pages 751-772, Winter.
    3. Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
    4. Dirk Bergemann & Ulrigh Hege, 2005. "The Financing of Innovation: Learning and Stopping," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 719-752, Winter.
    5. Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
    6. Ben Lockwood & Jonathan P. Thomas, 2002. "Gradualism and Irreversibility," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 69(2), pages 339-356.
    7. Anat R. Admati & Motty Perry, 1991. "Joint Projects without Commitment," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(2), pages 259-276.
    8. Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
    9. Christopher Harris, 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 0044, Boston University - Industry Studies Programme.
    10. Keller, Godfrey & Rady, Sven, 2003. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," RAND Journal of Economics, The RAND Corporation, vol. 34(1), pages 138-165, Spring.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Cripps & Godfrey Keller & Sven Rady, 2000. "Strategic Experimentation: The Case of the Poisson Bandits," Econometric Society World Congress 2000 Contributed Papers 0878, Econometric Society.
    2. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    3. Chen, Yi, 2020. "A revision game of experimentation on a common threshold," Journal of Economic Theory, Elsevier, vol. 186(C).
    4. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    5. Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
    6. Alessandro Bonatti & Johannes Horner, 2011. "Collaborating," American Economic Review, American Economic Association, vol. 101(2), pages 632-663, April.
    7. Külpmann, Philipp, 2015. "Procrastination and projects," Center for Mathematical Economics Working Papers 544, Center for Mathematical Economics, Bielefeld University.
    8. Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
    9. Matros, Alexander & Smirnov, Vladimir, 2011. "Treasure game," Working Papers 2011-10, University of Sydney, School of Economics, revised May 2014.
    10. , & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
    11. Nicolas Klein & Tymofiy Mylovanov, 2011. "Should the Flatterers be Avoided?," 2011 Meeting Papers 1273, Society for Economic Dynamics.
    12. Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
    13. Besanko, David & Tong, Jian & Wu, Jianjun, 2016. "Subsidizing research programs with "if" and "when" uncertainty in the face of severe informational constraints," Discussion Paper Series In Economics And Econometrics 1605, Economics Division, School of Social Sciences, University of Southampton.
    14. Matros, Alexander & Smirnov, Vladimir, 2016. "Duplicative search," Games and Economic Behavior, Elsevier, vol. 99(C), pages 1-22.
    15. May Elsayyad & Florian Morath, 2016. "Technology Transfers For Climate Change," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 57(3), pages 1057-1084, August.
    16. Aghamolla, Cyrus & Hashimoto, Tadashi, 2020. "Information arrival, delay, and clustering in financial markets with dynamic freeriding," Journal of Financial Economics, Elsevier, vol. 138(1), pages 27-52.
    17. Georgiadis, George, 2017. "Deadlines and infrequent monitoring in the dynamic provision of public goods," Journal of Public Economics, Elsevier, vol. 152(C), pages 1-12.
    18. Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
    19. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    20. Keller, Godfrey & Rady, Sven, 2020. "Undiscounted bandit games," Games and Economic Behavior, Elsevier, vol. 124(C), pages 43-61.

    More about this item

    Keywords

    Strategic experimentation; Two-armed bandits; Exponential distribution; Bayesian learning; Markov perfect equilibrium; Public goods;
    All these keywords.

    JEL classification:

    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • H41 - Public Economics - - Publicly Provided Goods - - - Public Goods
    • O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:3814. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.