IDEAS home Printed from https://ideas.repec.org/p/hrv/faseco/2580381.html
   My bibliography  Save this paper

The Speed of Learning in Noisy Games: Partial Reinforcement and the Sustainability of Cooperation

Author

Listed:
  • Roth, Alvin
  • Bereby-Meyer, Yoella

Abstract

In an experiment, players’ ability to learn to cooperate in the repeated prisoner’s dilemma was substantially diminished when the payoffs were noisy, even though players could monitor one another's past actions perfectly. In contrast, in one-time play against a succession of opponents, noisy payoffs increased cooperation, by slowing the rate at which cooperation decays. These observations are consistent with the robust observation from the psychology literature that partial reinforcement (adding randomness to the link between an action and its consequences while holding expected payoffs constant) slows learning. This effect is magnified in the repeated game: When others are slow to learn to cooperate, the benefits of cooperation are reduced, which further hampers cooperation. These results show that a small change in the payoff environment, which changes the speed of individual learning, can have a large effect on collective behavior. And they show that there may be interesting comparative dynamics that can be derived from careful attention to the fact that at least some economic behavior is learned from experience.

Suggested Citation

  • Roth, Alvin & Bereby-Meyer, Yoella, 2006. "The Speed of Learning in Noisy Games: Partial Reinforcement and the Sustainability of Cooperation," Scholarly Articles 2580381, Harvard University Department of Economics.
  • Handle: RePEc:hrv:faseco:2580381
    as

    Download full text from publisher

    File URL: http://dash.harvard.edu/bitstream/handle/1/2580381/Roth_Noisygames_retry.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Atila Abdulkadiroğlu & Parag A. Pathak & Alvin E. Roth, 2005. "The New York City High School Match," American Economic Review, American Economic Association, vol. 95(2), pages 364-367, May.
    2. Kreps, David M. & Milgrom, Paul & Roberts, John & Wilson, Robert, 1982. "Rational cooperation in the finitely repeated prisoners' dilemma," Journal of Economic Theory, Elsevier, vol. 27(2), pages 245-252, August.
    3. Alvin E. Roth & Tayfun Sönmez & M. Utku Ünver, 2004. "Kidney Exchange," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(2), pages 457-488.
    4. Atila Abdulkadiroğlu & Parag A. Pathak & Alvin E. Roth & Tayfun Sönmez, 2005. "The Boston Public School Match," American Economic Review, American Economic Association, vol. 95(2), pages 368-371, May.
    5. Dan Ariely & Axel Ockenfels & Alvin E. Roth, 2005. "An Experimental Analysis of Ending Rules in Internet Auctions," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 890-907, Winter.
    6. Robert Wilson, 2002. "Architecture of Power Markets," Econometrica, Econometric Society, vol. 70(4), pages 1299-1340, July.
    7. Green, Edward J & Porter, Robert H, 1984. "Noncooperative Collusion under Imperfect Price Information," Econometrica, Econometric Society, vol. 52(1), pages 87-100, January.
    8. Miller, John H., 1996. "The coevolution of automata in the repeated Prisoner's Dilemma," Journal of Economic Behavior & Organization, Elsevier, vol. 29(1), pages 87-112, January.
    9. Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
    10. Per Molander, 1985. "The Optimal Level of Generosity in a Selfish, Uncertain Environment," Journal of Conflict Resolution, Peace Science Society (International), vol. 29(4), pages 611-618, December.
    11. Cooper, Russell & DeJong, Douglas V. & Forsythe, Robert & Ross, Thomas W., 1996. "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games," Games and Economic Behavior, Elsevier, vol. 12(2), pages 187-218, February.
    12. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    13. Roth, Alvin E. & Sonmez, Tayfun & Utku Unver, M., 2005. "Pairwise kidney exchange," Journal of Economic Theory, Elsevier, vol. 125(2), pages 151-188, December.
    14. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    15. Andreoni, James A & Miller, John H, 1993. "Rational Cooperation in the Finitely Repeated Prisoner's Dilemma: Experimental Evidence," Economic Journal, Royal Economic Society, vol. 103(418), pages 570-585, May.
    16. Matthew Rabin, 2000. "Risk Aversion and Expected-Utility Theory: A Calibration Theorem," Econometrica, Econometric Society, vol. 68(5), pages 1281-1292, September.
    17. Muriel Niederle & Alvin E. Roth, 2005. "The Gastroenterology Fellowship Market: Should There Be a Match?," American Economic Review, American Economic Association, vol. 95(2), pages 372-375, May.
    18. Jonathan Bendor, 1993. "Uncertainty and the Evolution of Cooperation," Journal of Conflict Resolution, Peace Science Society (International), vol. 37(4), pages 709-734, December.
    19. Esther Hauk & Rosemarie Nagel, 2000. "Choice of partners in multiple two-person prisoner's dilemma games: An experimental study," Economics Working Papers 487, Department of Economics and Business, Universitat Pompeu Fabra.
    20. Stahl, Dale O., 2000. "Rule Learning in Symmetric Normal-Form Games: Theory and Evidence," Games and Economic Behavior, Elsevier, vol. 32(1), pages 105-138, July.
    21. Nick Feltovich & John Duffy, 1999. "Does observation of others affect learning in strategic environments? An experimental study," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 131-152.
    22. Daniel Kahneman & Amos Tversky, 2013. "Prospect Theory: An Analysis of Decision Under Risk," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 6, pages 99-127, World Scientific Publishing Co. Pte. Ltd..
    23. Selten, Reinhard & Stoecker, Rolf, 1986. "End behavior in sequences of finite Prisoner's Dilemma supergames A learning theory approach," Journal of Economic Behavior & Organization, Elsevier, vol. 7(1), pages 47-70, March.
    24. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    25. Alvin E. Roth & Tayfun Sönmez, 2005. "A Kidney Exchange Clearinghouse in New England," American Economic Review, American Economic Association, vol. 95(2), pages 376-380, May.
    26. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    27. Dale O. Stahl, 1999. "Evidence based rules and learning in symmetric normal-form games," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 111-130.
    28. Alvin E. Roth, 2002. "The Economist as Engineer: Game Theory, Experimentation, and Computation as Tools for Design Economics," Econometrica, Econometric Society, vol. 70(4), pages 1341-1378, July.
    29. Alvin E. Roth & Axel Ockenfels, 2002. "Last-Minute Bidding and the Rules for Ending Second-Price Auctions: Evidence from eBay and Amazon Auctions on the Internet," American Economic Review, American Economic Association, vol. 92(4), pages 1093-1103, September.
    30. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    31. Milgrom,Paul, 2004. "Putting Auction Theory to Work," Cambridge Books, Cambridge University Press, number 9780521536721.
    32. Sainty, Barbara, 1999. "Achieving greater cooperation in a noisy prisoner's dilemma: an experimental investigation," Journal of Economic Behavior & Organization, Elsevier, vol. 39(4), pages 421-435, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Grosskopf, Brit & Roth, Alvin E., 2009. "If you are offered the Right of First Refusal, should you accept? An investigation of contract design," Games and Economic Behavior, Elsevier, vol. 65(1), pages 176-204, January.
    2. Alvin E. Roth, 2009. "What Have We Learned from Market Design?," Innovation Policy and the Economy, University of Chicago Press, vol. 9(1), pages 79-112.
    3. Alvin E. Roth & Tayfun Sönmez & M. Utku Ünver, 2005. "Efficient Kidney Exchange: Coincidence of Wants in a Structured Market," Boston College Working Papers in Economics 621, Boston College Department of Economics.
    4. Alvin E. Roth, 2010. "Marketplace Institutions Related to the Timing of Transactions," NBER Working Papers 16556, National Bureau of Economic Research, Inc.
    5. Todd Guilfoos & Andreas Pape, 2016. "Predicting human cooperation in the Prisoner’s Dilemma using case-based decision theory," Theory and Decision, Springer, vol. 80(1), pages 1-32, January.
    6. Alvin Roth, 2008. "Deferred acceptance algorithms: history, theory, practice, and open questions," International Journal of Game Theory, Springer;Game Theory Society, vol. 36(3), pages 537-569, March.
    7. Committee, Nobel Prize, 2012. "Alvin E. Roth and Lloyd S. Shapley: Stable allocations and the practice of market design," Nobel Prize in Economics documents 2012-1, Nobel Prize Committee.
    8. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    9. Axel Ockenfels, 2009. "Marktdesign und Experimentelle Wirtschaftsforschung," Perspektiven der Wirtschaftspolitik, Verein für Socialpolitik, vol. 10(s1), pages 31-53, May.
    10. Scott Duke Kominers & Alexander Teytelboym & Vincent P Crawford, 2017. "An invitation to market design," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 33(4), pages 541-571.
    11. Alvin E. Roth, 2012. "Marketplace Institutions Related to the Timing of Transactions: Reply to Priest," Journal of Labor Economics, University of Chicago Press, vol. 30(2), pages 479-494.
    12. Atila Abdulkadiroglu & Parag A. Pathak & Alvin E. Roth & Tayfun Sönmez, 2006. "Changing the Boston School Choice Mechanism," Levine's Bibliography 122247000000001022, UCLA Department of Economics.
    13. Tayfun Sönmez & Alvin E. Roth & M. Utku Ünver, 2007. "Efficient Kidney Exchange: Coincidence of Wants in Markets with Compatibility-Based Preferences," American Economic Review, American Economic Association, vol. 97(3), pages 828-851, June.
    14. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    15. Camerer, Colin F. & Ho, Teck-Hua, 2015. "Behavioral Game Theory Experiments and Modeling," Handbook of Game Theory with Economic Applications,, Elsevier.
    16. Gary Bolton & Ben Greiner & Axel Ockenfels, 2013. "Engineering Trust: Reciprocity in the Production of Reputation Information," Management Science, INFORMS, vol. 59(2), pages 265-285, January.
    17. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    18. Albert Banal-Estañol & Augusto Rupérez-Micola, 2010. "Are agent-based simulations robust? The wholesale electricity trading case," Economics Working Papers 1214, Department of Economics and Business, Universitat Pompeu Fabra.
    19. Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
    20. Yan Chen & Peter Cramton & John A. List & Axel Ockenfels, 2021. "Market Design, Human Behavior, and Management," Management Science, INFORMS, vol. 67(9), pages 5317-5348, September.

    More about this item

    JEL classification:

    • C71 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Cooperative Games
    • C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hrv:faseco:2580381. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Office for Scholarly Communication (email available below). General contact details of provider: https://edirc.repec.org/data/deharus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.