IDEAS home Printed from https://ideas.repec.org/p/cpr/ceprdp/3814.html
   My bibliography  Save this paper

Strategic Experimentation with Exponential Bandits

Author

Listed:
  • Cripps, Martin William
  • Keller, R Godfrey
  • Rady, Sven

Abstract

This Paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a pay-off only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected pay-off of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate pay-offs.

Suggested Citation

  • Cripps, Martin William & Keller, R Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
  • Handle: RePEc:cpr:ceprdp:3814
    as

    Download full text from publisher

    File URL: http://www.cepr.org/active/publications/discussion_papers/dp.php?dpno=3814
    Download Restriction: CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Leslie M. Marx & Steven A. Matthews, 2000. "Dynamic Voluntary Contribution to a Public Project," Review of Economic Studies, Oxford University Press, vol. 67(2), pages 327-358.
    2. Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
    3. Ben Lockwood & Jonathan P. Thomas, 2002. "Gradualism and Irreversibility," Review of Economic Studies, Oxford University Press, vol. 69(2), pages 339-356.
    4. Harris, C., 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 44, Boston University - Industry Studies Programme.
    5. Dirk Bergemann & Ulrigh Hege, 2005. "The Financing of Innovation: Learning and Stopping," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 719-752, Winter.
    6. Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
    7. Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
    8. Anat R. Admati & Motty Perry, 1991. "Joint Projects without Commitment," Review of Economic Studies, Oxford University Press, vol. 58(2), pages 259-276.
    9. Keller, Godfrey & Rady, Sven, 2003. " Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," RAND Journal of Economics, The RAND Corporation, vol. 34(1), pages 138-165, Spring.
    Full references (including those not matched with items on IDEAS)

    More about this item

    Keywords

    bayesian learning; exponential distribution; markov perfect equilibrium; public goods; strategic experimentation; two-armed bandits;

    JEL classification:

    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • H41 - Public Economics - - Publicly Provided Goods - - - Public Goods
    • O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:3814. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.