Strategic Experimentation with Poisson Bandits
We study a game of strategic experimentation with two-armed bandits where the risky arm distributes lump-sum payoffs according to a Poisson process. Its intensity is either high or low, and unknown to the players. We consider Markov perfect equilibria with beliefs as the state variable. As the belief process is piecewise deterministic, payoff functions solve differential-difference equations. There is no equilibrium where all players use cut-off strategies, and all equilibria exhibit an 'encouragement effect' relative to the single-agent optimum. We construct asymmetric equilibria in which players have symmetric continuation values at sufficiently optimistic beliefs yet take turns playing the risky arm before all experimentation stops. Owing to the encouragement effect, these equilibria Pareto dominate the unique symmetric one for sufficiently frequent turns. Rewarding the last experimenter with a higher continuation value increases the range of beliefs where players experiment, but may reduce average payoffs at more optimistic beliefs. Some equilibria exhibit an 'anticipation effect': as beliefs become more pessimistic, the continuation value of a single experimenter increases over some range because a lower belief means a shorter wait until another player takes over.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
|Date of creation:||Apr 2009|
|Contact details of provider:|| Postal: Centre for Economic Policy Research, 77 Bastwick Street, London EC1V 3PZ.|
Phone: 44 - 20 - 7183 8801
Fax: 44 - 20 - 7183 8820
|Order Information:|| Email: |
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Decamps, Jean-Paul & Mariotti, Thomas, 2004. "Investment timing and learning externalities," Journal of Economic Theory, Elsevier, vol. 118(1), pages 80-102, September.
- Nicolas Klein & Sven Rady, 2011.
"Negatively Correlated Bandits,"
Review of Economic Studies,
Oxford University Press, vol. 78(2), pages 693-732.
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," CEPR Discussion Papers 6983, C.E.P.R. Discussion Papers.
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 243, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Papers in Economics 5332, University of Munich, Department of Economics.
- Sven Rady & Nicolas Klein, 2008. "Negatively Correlated Bandits," 2008 Meeting Papers 136, Society for Economic Dynamics.
- Nicolas Klein & Sven Rady, 2008. "Negatively Correlated Bandits," Working Papers 040, Bavarian Graduate Program in Economics (BGPE).
- Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, 01.
- Cripps, Martin William & Keller, R Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Nicolas Klein, 2009. "Free-Riding And Delegation In Research Teams," 2009 Meeting Papers 253, Society for Economic Dynamics.
- Guiseppe Moscarini & Francesco Squintani, 2004. "Competitive Experimentation with Private Information," Cowles Foundation Discussion Papers 1489, Cowles Foundation for Research in Economics, Yale University.
- Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
- Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
- Dirk Bergemann & Juuso Valimaki, 2006. "Bandit Problems," Cowles Foundation Discussion Papers 1551, Cowles Foundation for Research in Economics, Yale University. Full references (including those not matched with items on IDEAS)
When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:7270. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ()
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.