Strategic Learning in Teams
This paper analyzes a two-player game of strategic experimentation with three-armed exponential bandits in continuous time. Players face replica bandits, with one arm that is safe in that it generates a known payoff, whereas the likelihood of the risky armsâ€™ yielding a positive payoff is initially unknown. It is common knowledge that the types of the two risky arms are perfectly negatively correlated. I show that the efficient policy is incentive-compatible if, and only if, the stakes are high enough. Moreover, learning will be complete in any Markov perfect equilibrium with continuous value functions if, and only if, the stakes exceed a certain threshold.
|Date of creation:||Jul 2010|
|Date of revision:|
|Contact details of provider:|| Postal: Geschwister-Scholl-Platz 1, D-80539 Munich, Germany|
Web page: http://www.sfbtr15.de/
More information through EDIRC
When requesting a correction, please mention this item's handle: RePEc:trf:wpaper:333. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Tamilla Benkelberg)
If references are entirely missing, you can add them using this form.