IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2409.15197.html
   My bibliography  Save this paper

Deep Learning to Play Games

Author

Listed:
  • Daniele Condorelli
  • Massimiliano Furlan

Abstract

We train two neural networks adversarially to play normal-form games. At each iteration, a row and column network take a new randomly generated game and output individual mixed strategies. The parameters of each network are independently updated via stochastic gradient descent to minimize expected regret given the opponent's strategy. Our simulations demonstrate that the joint behavior of the networks converges to strategies close to Nash equilibria in almost all games. For all $2 \times 2$ and in 80% of $3 \times 3$ games with multiple equilibria, the networks select the risk-dominant equilibrium. Our results show how Nash equilibrium emerges from learning across heterogeneous games.

Suggested Citation

  • Daniele Condorelli & Massimiliano Furlan, 2024. "Deep Learning to Play Games," Papers 2409.15197, arXiv.org.
  • Handle: RePEc:arx:papers:2409.15197
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2409.15197
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John C. Harsanyi & Reinhard Selten, 1988. "A General Theory of Equilibrium Selection in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262582384, April.
    2. Jacob K. Goeree & Charles A. Holt, 2001. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," American Economic Review, American Economic Association, vol. 91(5), pages 1402-1422, December.
    3. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    4. Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
    5. Jehiel, Philippe, 2005. "Analogy-based expectation equilibrium," Journal of Economic Theory, Elsevier, vol. 123(2), pages 81-104, August.
    6. , & ,, 2008. "Contagion through learning," Theoretical Economics, Econometric Society, vol. 3(4), December.
    7. David Cooper & John H. Kagel, 2003. "Lessons Learned: Generalizing Learning Across Games," American Economic Review, American Economic Association, vol. 93(2), pages 202-207, May.
    8. Robert Aumann & Adam Brandenburger, 2014. "Epistemic Conditions for Nash Equilibrium," World Scientific Book Chapters, in: The Language of Game Theory Putting Epistemics into the Mathematics of Games, chapter 5, pages 113-136, World Scientific Publishing Co. Pte. Ltd..
    9. Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
    10. Kandori, Michihiro & Mailath, George J & Rob, Rafael, 1993. "Learning, Mutation, and Long Run Equilibria in Games," Econometrica, Econometric Society, vol. 61(1), pages 29-56, January.
    11. Samuelson, Larry, 2001. "Analogies, Adaptation, and Anomalies," Journal of Economic Theory, Elsevier, vol. 97(2), pages 320-366, April.
    12. Devetag, Giovanna, 2005. "Precedent transfer in coordination games: An experiment," Economics Letters, Elsevier, vol. 89(2), pages 227-232, November.
    13. Drew Fudenberg & Annie Liang, 2019. "Predicting and Understanding Initial Play," American Economic Review, American Economic Association, vol. 109(12), pages 4112-4141, December.
    14. Grimm, Veronika & Mengel, Friederike, 2009. "Cooperation in viscous populations--Experimental evidence," Games and Economic Behavior, Elsevier, vol. 66(1), pages 202-220, May.
    15. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
    16. Sergiu Hart & Andreu Mas-Colell, 2013. "Uncoupled Dynamics Do Not Lead To Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 7, pages 153-163, World Scientific Publishing Co. Pte. Ltd..
    17. Marchiori, Davide & Di Guida, Sibilla & Polonio, Luca, 2021. "Plasticity of strategic sophistication in interactive decision-making," Journal of Economic Theory, Elsevier, vol. 196(C).
    18. Ignacio Palacios-Huerta, 2003. "Professionals Play Minimax," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 70(2), pages 395-415.
    19. Mark Walker & John Wooders, 2001. "Minimax Play at Wimbledon," American Economic Review, American Economic Association, vol. 91(5), pages 1521-1538, December.
    20. LiCalzi Marco, 1995. "Fictitious Play by Cases," Games and Economic Behavior, Elsevier, vol. 11(1), pages 64-89, October.
    21. P.-A. Chiappori, 2002. "Testing Mixed-Strategy Equilibria When Players Are Heterogeneous: The Case of Penalty Kicks in Soccer," American Economic Review, American Economic Association, vol. 92(4), pages 1138-1151, September.
    22. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, April.
    23. Kohlberg, Elon & Mertens, Jean-Francois, 1986. "On the Strategic Stability of Equilibria," Econometrica, Econometric Society, vol. 54(5), pages 1003-1037, September.
    24. Sgroi, Daniel & Zizzo, Daniel John, 2009. "Learning to play 3×3 games: Neural networks as bounded-rational players," Journal of Economic Behavior & Organization, Elsevier, vol. 69(1), pages 27-38, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
    2. Christoph Kuzmics & Daniel Rodenburger, 2020. "A case of evolutionarily stable attainable equilibrium in the laboratory," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(3), pages 685-721, October.
    3. Mohlin, Erik, 2012. "Evolution of theories of mind," Games and Economic Behavior, Elsevier, vol. 75(1), pages 299-318.
    4. Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
    5. Grimm, Veronika & Mengel, Friederike, 2012. "An experiment on learning in a multiple games environment," Journal of Economic Theory, Elsevier, vol. 147(6), pages 2220-2259.
    6. Rossella Argenziano & Itzhak Gilboa, 2012. "History as a coordination device," Theory and Decision, Springer, vol. 73(4), pages 501-512, October.
    7. Anke Gerber & Thorsten Hens & Bodo Vogt, "undated". "Coordination in a Repeated Stochastic Game with Imperfect Monitoring," IEW - Working Papers 126, Institute for Empirical Research in Economics - University of Zurich.
    8. Battalio,R. & Samuelson,L. & Huyck,J. van, 1998. "Risk dominance, payoff dominance and probabilistic choice learning," Working papers 2, Wisconsin Madison - Social Systems.
    9. Gallice, Andrea, 2007. "Best Responding to What? A Behavioral Approach to One Shot Play in 2x2 Games," Discussion Papers in Economics 1365, University of Munich, Department of Economics.
    10. Christoph March, 2011. "Adaptive social learning," Working Papers halshs-00572528, HAL.
    11. He, Simin & Wu, Jiabin, 2020. "Compromise and coordination: An experimental study," Games and Economic Behavior, Elsevier, vol. 119(C), pages 216-233.
    12. Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
    13. Friederike Mengel & Emanuela Sciubba, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Working Papers 2010.148, Fondazione Eni Enrico Mattei.
    14. Marco LiCalzi & Roland Mühlenbernd, 2022. "Feature-weighted categorized play across symmetric games," Experimental Economics, Springer;Economic Science Association, vol. 25(3), pages 1052-1078, June.
    15. Demichelis, Stefano & Ritzberger, Klaus, 2003. "From evolutionary to strategic stability," Journal of Economic Theory, Elsevier, vol. 113(1), pages 51-75, November.
    16. Alós-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Contagion and efficiency," Journal of Economic Theory, Elsevier, vol. 143(1), pages 251-274, November.
    17. Spiliopoulos, Leonidas, 2012. "Pattern recognition and subjective belief learning in a repeated constant-sum game," Games and Economic Behavior, Elsevier, vol. 75(2), pages 921-935.
    18. Alos-Ferrer, Carlos & Weidenholzer, Simon, 2007. "Partial bandwagon effects and local interactions," Games and Economic Behavior, Elsevier, vol. 61(2), pages 179-197, November.
    19. Auriol, Emmanuelle & Platteau, Jean-Philippe & Camilotti, Giula, 2017. "Eradicating Women-Hurting Customs: What Role for Social Engineering?," CEPR Discussion Papers 12107, C.E.P.R. Discussion Papers.
    20. Ge Jiang & Simon Weidenholzer, 2017. "Local interactions under switching costs," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 64(3), pages 571-588, October.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2409.15197. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.