IDEAS home Printed from https://ideas.repec.org/a/eee/jeborg/v69y2009i1p27-38.html
   My bibliography  Save this article

Learning to play 3×3 games: Neural networks as bounded-rational players

Author

Listed:
  • Sgroi, Daniel
  • Zizzo, Daniel John

Abstract

We present a neural network methodology for learning game-playing rules in general. Existing research suggests learning to find a Nash equilibrium in a new game is too difficult a task for a neural network, but says little about what it will do instead. We observe that a neural network trained to find Nash equilibria in a known subset of games will use self-taught rules developed endogenously when facing new games. These rules are close to payoff dominance and its best response. Our findings are consistent with existing experimental results, both in terms of subject's methodology and success rates.

Suggested Citation

  • Sgroi, Daniel & Zizzo, Daniel John, 2009. "Learning to play 3×3 games: Neural networks as bounded-rational players," Journal of Economic Behavior & Organization, Elsevier, vol. 69(1), pages 27-38, January.
  • Handle: RePEc:eee:jeborg:v:69:y:2009:i:1:p:27-38
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-2681(08)00195-9
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stahl Dale O. & Wilson Paul W., 1995. "On Players' Models of Other Players: Theory and Experimental Evidence," Games and Economic Behavior, Elsevier, vol. 10(1), pages 218-254, July.
    2. Ben-porath, Elchanan, 1990. "The complexity of computing a best response automaton in repeated games with mixed strategies," Games and Economic Behavior, Elsevier, vol. 2(1), pages 1-12, March.
    3. Cho, In-Koo, 1994. "Bounded Rationality, Neural Network and Folk Theorem in Repeated Games with Discounting," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 4(6), pages 935-957, October.
    4. Cho, In-Koo & Sargent, Thomas J., 1996. "Neural networks for encoding and adapting in dynamic economies," Handbook of Computational Economics, in: H. M. Amman & D. A. Kendrick & J. Rust (ed.), Handbook of Computational Economics, edition 1, volume 1, chapter 9, pages 441-470, Elsevier.
    5. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    6. Gilboa, Itzhak & Zemel, Eitan, 1989. "Nash and correlated equilibria: Some complexity considerations," Games and Economic Behavior, Elsevier, vol. 1(1), pages 80-93, March.
    7. Gilboa,Itzhak & Schmeidler,David, 2001. "A Theory of Case-Based Decisions," Cambridge Books, Cambridge University Press, number 9780521802345, January.
    8. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    9. Cho, In-Koo, 1996. "Perceptrons Play Repeated Games with Imperfect Monitoring," Games and Economic Behavior, Elsevier, vol. 16(1), pages 22-53, September.
    10. Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
    11. Slonim, Robert L., 1999. "Learning rules of thumb or learning more rational rules," Journal of Economic Behavior & Organization, Elsevier, vol. 38(2), pages 217-236, February.
    12. Stahl, Dale O., 2001. "Population rule learning in symmetric normal-form games: theory and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 45(1), pages 19-35, May.
    13. Barr, Jason & Saraceno, Francesco, 2005. "Cournot competition, organization and learning," Journal of Economic Dynamics and Control, Elsevier, vol. 29(1-2), pages 277-295, January.
    14. Rubinstein, Ariel, 1993. "On Price Recognition and Computational Complexity in a Monopolistic Model," Journal of Political Economy, University of Chicago Press, vol. 101(3), pages 473-484, June.
    15. Miller, John H., 1996. "The coevolution of automata in the repeated Prisoner's Dilemma," Journal of Economic Behavior & Organization, Elsevier, vol. 29(1), pages 87-112, January.
    16. Costa-Gomes, Miguel & Crawford, Vincent P & Broseta, Bruno, 2001. "Cognition and Behavior in Normal-Form Games: An Experimental Study," Econometrica, Econometric Society, vol. 69(5), pages 1193-1235, September.
    17. Ray Barrell & Francesco Saraceno, 2005. "Cournot Competition, Organization and Learning," Post-Print hal-03597732, HAL.
    18. Gilboa, Itzhak, 1988. "The complexity of computing best-response automata in repeated games," Journal of Economic Theory, Elsevier, vol. 45(2), pages 342-352, August.
    19. D. Sgroi & D. J. Zizzo, 2002. "Strategy Learning in 3x3 Games by Neural Networks," Cambridge Working Papers in Economics 0207, Faculty of Economics, University of Cambridge.
    20. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    21. Ho, Teck-Hua, 1996. "Finite automata play repeated prisoner's dilemma with information processing costs," Journal of Economic Dynamics and Control, Elsevier, vol. 20(1-3), pages 173-207.
    22. Daniel John Zizzo, 2002. "Neurobiological measurements of cardinal utility: Hedonimeters or learning algorithms?," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 19(3), pages 477-488.
    23. Stahl, Dale II & Wilson, Paul W., 1994. "Experimental evidence on players' models of other players," Journal of Economic Behavior & Organization, Elsevier, vol. 25(3), pages 309-327, December.
    24. Stahl, Dale O., 2000. "Rule Learning in Symmetric Normal-Form Games: Theory and Evidence," Games and Economic Behavior, Elsevier, vol. 32(1), pages 105-138, July.
    25. David Cooper & John H. Kagel, 2003. "Lessons Learned: Generalizing Learning Across Games," American Economic Review, American Economic Association, vol. 93(2), pages 202-207, May.
    26. Cho In-Koo, 1995. "Perceptrons Play the Repeated Prisoner's Dilemma," Journal of Economic Theory, Elsevier, vol. 67(1), pages 266-284, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
    2. Burka, Dávid & Puppe, Clemens & Szepesváry, László & Tasnádi, Attila, 2016. "Neural networks would 'vote' according to Borda's Rule," Corvinus Economics Working Papers (CEWP) 2016/13, Corvinus University of Budapest.
    3. Mohlin, Erik, 2012. "Evolution of theories of mind," Games and Economic Behavior, Elsevier, vol. 75(1), pages 299-318.
    4. Burka, Dávid & Puppe, Clemens & Szepesváry, László & Tasnádi, Attila, 2022. "Voting: A machine learning approach," European Journal of Operational Research, Elsevier, vol. 299(3), pages 1003-1017.
    5. Spiliopoulos, Leonidas, 2012. "Interactive learning in 2×2 normal form games by neural network agents," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(22), pages 5557-5562.
    6. Salle, Isabelle L., 2015. "Modeling expectations in agent-based models — An application to central bank's communication and monetary policy," Economic Modelling, Elsevier, vol. 46(C), pages 130-141.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Haruvy, Ernan & Stahl, Dale O., 2012. "Between-game rule learning in dissimilar symmetric normal-form games," Games and Economic Behavior, Elsevier, vol. 74(1), pages 208-221.
    2. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    3. Teck-Hua Ho & So-Eun Park & Xuanming Su, 2021. "A Bayesian Level- k Model in n -Person Games," Management Science, INFORMS, vol. 67(3), pages 1622-1638, March.
    4. Daniel John Zizzo & Daniel Sgroi, 2001. "Bounded-Rational Behavior by Neural Networks in Normal Form Games," Economics Series Working Papers 2000-W30, University of Oxford, Department of Economics.
    5. Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
    6. D. Sgroi & D. J. Zizzo, 2002. "Strategy Learning in 3x3 Games by Neural Networks," Cambridge Working Papers in Economics 0207, Faculty of Economics, University of Cambridge.
    7. Breitmoser, Yves & Tan, Jonathan H.W. & Zizzo, Daniel John, 2014. "On the beliefs off the path: Equilibrium refinement due to quantal response and level-k," Games and Economic Behavior, Elsevier, vol. 86(C), pages 102-125.
    8. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    9. Camerer, Colin F. & Ho, Teck-Hua, 2015. "Behavioral Game Theory Experiments and Modeling," Handbook of Game Theory with Economic Applications,, Elsevier.
    10. Mariano Runco, 2013. "Estimating depth of reasoning in a repeated guessing game with no feedback," Experimental Economics, Springer;Economic Science Association, vol. 16(3), pages 402-413, September.
    11. Barr, Jason & Saraceno, Francesco, 2009. "Organization, learning and cooperation," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 39-53, May.
    12. Rick, Scott & Weber, Roberto A., 2010. "Meaningful learning and transfer of learning in games played repeatedly without feedback," Games and Economic Behavior, Elsevier, vol. 68(2), pages 716-730, March.
    13. repec:hal:spmain:info:hdl:2441/6782 is not listed on IDEAS
    14. Stahl, Dale O., 2000. "Rule Learning in Symmetric Normal-Form Games: Theory and Evidence," Games and Economic Behavior, Elsevier, vol. 32(1), pages 105-138, July.
    15. Yoella Bereby-Meyer & Alvin E. Roth, 2006. "The Speed of Learning in Noisy Games: Partial Reinforcement and the Sustainability of Cooperation," American Economic Review, American Economic Association, vol. 96(4), pages 1029-1042, September.
    16. Iriberri, Nagore & García-Pola, Bernardo, 2019. "Naivete and Sophistication in Initial and Repeated Play in Games," CEPR Discussion Papers 14088, C.E.P.R. Discussion Papers.
    17. repec:hal:wpspec:info:hdl:2441/6782 is not listed on IDEAS
    18. Kawagoe, Toshiji & Takizawa, Hirokazu, 2012. "Level-k analysis of experimental centipede games," Journal of Economic Behavior & Organization, Elsevier, vol. 82(2), pages 548-566.
    19. repec:hal:wpspec:info:hdl:2441/9832 is not listed on IDEAS
    20. Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.
    21. Costa-Gomes, Miguel & Crawford, Vincent P & Broseta, Bruno, 2001. "Cognition and Behavior in Normal-Form Games: An Experimental Study," Econometrica, Econometric Society, vol. 69(5), pages 1193-1235, September.
    22. repec:hal:spmain:info:hdl:2441/9832 is not listed on IDEAS
    23. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    24. Fabrizio Germano, 2007. "Stochastic Evolution of Rules for Playing Finite Normal Form Games," Theory and Decision, Springer, vol. 62(4), pages 311-333, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jeborg:v:69:y:2009:i:1:p:27-38. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jebo .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.