In the canonical learning model, the multi-armed bandit with independent arms, a decision maker learns about the different alternatives only through his private experience. It is well known that any optimal experimentation strategy for this problem is ex-post inefficient: it sometimes leads the superior alternative to be dropped altogether. Many situations of interest, however, involve learning from individual experience and the experience of others. This paper shows how learning in society can overcome this inefficiency. We consider an economy populated with a continuum of infinitely lived agents where each one of them faces a multi-armed bandit. The unknown stochastic payoffs of each arm are the same for all agents. In each period, they are randomly and anonymously matched in pairs, and in any such match they observe their partner's current action choice and its outcome. We establish that if initial beliefs are sufficiently heterogeneous, then the fraction of the population choosing the superior arm converges to one in any perfect bayesian equilibrium of this game. We also show that the same conclusion holds when only action choices are observable within a match and the number of arms is two
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
Publisher Info
Paper provided by Society for Economic Dynamics in its series 2006 Meeting Papers with number
435.
Length: Date of creation: 03 Dec 2006 Date of revision: Handle: RePEc:red:sed006:435
Contact details of provider: Postal: Society for Economic Dynamics Anne Stubing CV Starr Center for Applied Economics 269 Mercer Street, Room 303 New York University New York, NY 10003 Fax: 1-860-486-4463 Email: Web page: http://www.EconomicDynamics.org/society.htm More information through EDIRC
For technical questions regarding this item, or to correct its listing, contact: (Christian Zimmermann).
Find related papers by JEL classification: C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search, Learning, and Information
This paper has been announced in the following NEP Reports:
Cited by: (explanations, Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.)