The Speed of Learning in Noisy Games: Partial Reinforcement and the Sustainability of Cooperation
AbstractIn an experiment, players? ability to learn to cooperate in the repeated prisoner?s dilemma was substantially diminished when the payoffs were noisy, even though players could monitor one another?s past actions perfectly. In contrast, in one-time play against a succession of opponents, noisy payoffs increased cooperation, by slowing the rate at which cooperation decays. These observations are consistent with the robust observation from the psychology literature that partial reinforcement (adding randomness to the link between an action and its consequences while holding expected payoffs constant) slows learning. This effect is magnified in the repeated game: when others are slow to learn to cooperate, the benefits of cooperation are reduced, which further hampers cooperation. These results show that a small change in the payoff environment, which changes the speed of individual learning, can have a large effect on collective behavior. And they show that there may be interesting comparative dynamics that can be derived from careful attention to the fact that at least some economic behavior is learned from experience. (JEL C71, C72, C73, D83)
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoArticle provided by American Economic Association in its journal American Economic Review.
Volume (Year): 96 (2006)
Issue (Month): 4 (September)
Other versions of this item:
- Roth, Alvin & Bereby-Meyer, Yoella, 2006. "The Speed of Learning in Noisy Games: Partial Reinforcement and the Sustainability of Cooperation," Scholarly Articles 2580381, Harvard University Department of Economics.
- C71 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Cooperative Games
- C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
- C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
- D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search, Learning, and Information
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
- Kahneman, Daniel & Tversky, Amos, 1979.
"Prospect Theory: An Analysis of Decision under Risk,"
Econometric Society, vol. 47(2), pages 263-91, March.
- Amos Tversky & Daniel Kahneman, 1979. "Prospect Theory: An Analysis of Decision under Risk," Levine's Working Paper Archive 7656, David K. Levine.
- Sainty, Barbara, 1999. "Achieving greater cooperation in a noisy prisoner's dilemma: an experimental investigation," Journal of Economic Behavior & Organization, Elsevier, vol. 39(4), pages 421-435, July.
- Alvin E. Roth & Tayfun Sonmez & M. Utku Unver, 2003.
NBER Working Papers
10002, National Bureau of Economic Research, Inc.
- Selten, Reinhard & Stoecker, Rolf, 1986. "End behavior in sequences of finite Prisoner's Dilemma supergames A learning theory approach," Journal of Economic Behavior & Organization, Elsevier, vol. 7(1), pages 47-70, March.
- Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
- Cooper, R. & DeJong, D.W. & Ross, T.W., 1992.
"Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games,"
36, Boston University - Industry Studies Programme.
- Cooper, Russell & DeJong, Douglas V. & Forsythe, Robert & Ross, Thomas W., 1996. "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games," Games and Economic Behavior, Elsevier, vol. 12(2), pages 187-218, February.
- Russell Cooper & Douglas V. DeJong & Thomas W. Ross, 1992. "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games," Papers 0036, Boston University - Industry Studies Programme.
- Esther Hauk & Rosemarie Nagel, 2000. "Choice of partners in multiple two-person prisoner's dilemma games: An experimental study," Economics Working Papers 487, Department of Economics and Business, Universitat Pompeu Fabra.
- Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
- SÃ¶nmez, Tayfun & Pathak, Parag A. & Abdulkadiroglu, Atila & Roth, Alvin, 2005.
"The Boston Public School Match,"
2562764, Harvard University Department of Economics.
- Nick Feltovich & John Duffy, 1999.
"Does observation of others affect learning in strategic environments? An experimental study,"
International Journal of Game Theory,
Springer, vol. 28(1), pages 131-152.
- John Duffy & Nick Feltovich, 1997. "Does Observation of Others Affect Learning in Strategic Environments? An Experimental Study," Levine's Working Paper Archive 592, David K. Levine.
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-81, September.
- Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
- Milgrom,Paul, 2004.
"Putting Auction Theory to Work,"
Cambridge University Press, number 9780521551847, December.
- Miller, John H., 1996. "The coevolution of automata in the repeated Prisoner's Dilemma," Journal of Economic Behavior & Organization, Elsevier, vol. 29(1), pages 87-112, January.
- Green, Edward J & Porter, Robert H, 1984.
"Noncooperative Collusion under Imperfect Price Information,"
Econometric Society, vol. 52(1), pages 87-100, January.
- Edward J Green & Robert H Porter, 1997. "Noncooperative Collusion Under Imperfect Price Information," Levine's Working Paper Archive 1147, David K. Levine.
- Green, Edward J. & Porter, Robert H., 1982. "Noncooperative Collusion Under Imperfect Price Information," Working Papers 367, California Institute of Technology, Division of the Humanities and Social Sciences.
- Drew Fudenberg & David K. Levine, 1996.
"The Theory of Learning in Games,"
Levine's Working Paper Archive
624, David K. Levine.
This item has more than 25 citations. To prevent cluttering this page, these citations are listed on a separate page. reading list or among the top items on IDEAS.Access and download statisticsgeneral information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Jane Voros) or (Michael P. Albert).
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.