IDEAS home Printed from https://ideas.repec.org/p/wrk/wrkesp/99.html

Learning Correlated Equilibrium Via Neural Network Regret Minimisation

Author

Listed:
  • Sampat, Khushi

    (University of Warwick)

Abstract

This paper studies how decentralised neural agents trained by regret minimisation learn equilibrium behaviour in static games and whether such learning can be extended beyond Nash equilibria. The analysis proceeds in two parts. The first chapter examines equilibrium selection in coordination games with multiple Nash equilibria. Building on recent evidence that neural agents trained across large distributions of games systematically favour risk-dominant equilibria, the chapter introduces a structured pre-training curriculum designed to instil a bias toward payoffdominant outcomes in Stag Hunt environments. While pre-training successfully induces effcient coordination in these games, the results show that this bias is rapidly eroded under subsequent adversarial training on heterogeneous games, where play reverts to mixed or risk-sensitive equilibria. The second chapter investigates whether decentralised learners can acquire correlated equilibrium behaviour when coordination requires conditioning on private signals. Initial experiments demonstrate that standard personal regret objectives lead agents to ignore mediator signals and converge to unconditional Nash strategies. This limitation is overcome by replacing personal regret with a squared obedience (swap) regret objective. Under this modified objective, neural agents successfully learn signal-contingent behaviour and generalise correlated equilibrium strategies to unseen coordination games. Together, the findings clarify the capabilities and limitations of regret-based learning as a mechanism for equilibrium formation in strategic environments.

Suggested Citation

  • Sampat, Khushi, 2026. "Learning Correlated Equilibrium Via Neural Network Regret Minimisation," Warwick-Monash Economics Student Papers 99, Warwick Monash Economics Student Papers.
  • Handle: RePEc:wrk:wrkesp:99
    as

    Download full text from publisher

    File URL: https://warwick.ac.uk/fac/soc/economics/research/wmesp/manage/99-sampat.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John C. Harsanyi & Reinhard Selten, 1988. "A General Theory of Equilibrium Selection in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262582384, December.
    2. Jacob K. Goeree & Charles A. Holt, 2001. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," American Economic Review, American Economic Association, vol. 91(5), pages 1402-1422, December.
    3. Kreps, David M., 1990. "Game Theory and Economic Modelling," OUP Catalogue, Oxford University Press, number 9780198283812.
    4. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    5. Kandori, Michihiro & Mailath, George J & Rob, Rafael, 1993. "Learning, Mutation, and Long Run Equilibria in Games," Econometrica, Econometric Society, vol. 61(1), pages 29-56, January.
    6. Robert J. Aumann, 2025. "Subjectivity and Correlation in Randomized Strategies," World Scientific Book Chapters, in: SELECTED CONTRIBUTIONS TO GAME THEORY, chapter 4, pages 73-113, World Scientific Publishing Co. Pte. Ltd..
    7. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
    8. Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
    9. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Claudia Cerrone & Francesco Feri & Philip R. Neary, 2019. "Ignorance is bliss: a game of regret," Discussion Paper Series of the Max Planck Institute for Behavioral Economics 2019_10, Max Planck Institute for Behavioral Economics.
    2. Daniele Condorelli & Massimiliano Furlan, 2024. "Deep Learning Across Games," Papers 2409.15197, arXiv.org, revised May 2025.
    3. repec:osf:socarx:ymzrd_v1 is not listed on IDEAS
    4. Kets, Willemien & Kager, Wouter & Sandroni, Alvaro, 2022. "The value of a coordination game," Journal of Economic Theory, Elsevier, vol. 201(C).
    5. Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," PLOS Computational Biology, Public Library of Science, vol. 17(4), pages 1-19, April.
    6. Tom Johnston & Michael Savery & Alex Scott & Bassel Tarbush, 2023. "Game Connectivity and Adaptive Dynamics," Papers 2309.10609, arXiv.org, revised Jun 2025.
    7. Michael Kosfeld, 2002. "Stochastic strategy adjustment in coordination games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 20(2), pages 321-339.
    8. van Damme, E.E.C., 1995. "Game theory : The next stage," Other publications TiSEM 7779b0f9-bef5-45c7-ae6b-7, Tilburg University, School of Economics and Management.
    9. Alós-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Contagion and efficiency," Journal of Economic Theory, Elsevier, vol. 143(1), pages 251-274, November.
    10. Alos-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Erratum to "Partial bandwagon effects and local interactions" [Games Econ. Behav. 61 (2007) 179-197]," Games and Economic Behavior, Elsevier, vol. 62(1), pages 324-325, January.
    11. Ge Jiang & Simon Weidenholzer, 2017. "Local interactions under switching costs," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 64(3), pages 571-588, October.
    12. Anke Gerber & Thorsten Hens & Bodo Vogt, "undated". "Coordination in a Repeated Stochastic Game with Imperfect Monitoring," IEW - Working Papers 126, Institute for Empirical Research in Economics - University of Zurich.
    13. Pedro Dal Bo & Guillaume R. Frochette, 2011. "The Evolution of Cooperation in Infinitely Repeated Games: Experimental Evidence," American Economic Review, American Economic Association, vol. 101(1), pages 411-429, February.
    14. Soham R. Phade & Venkat Anantharam, 2023. "Learning in Games with Cumulative Prospect Theoretic Preferences," Dynamic Games and Applications, Springer, vol. 13(1), pages 265-306, March.
    15. Burkhard C. Schipper, 2022. "Strategic Teaching and Learning in Games," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 321-352, August.
    16. Cassar, Alessandra, 2007. "Coordination and cooperation in local, random and small world networks: Experimental evidence," Games and Economic Behavior, Elsevier, vol. 58(2), pages 209-230, February.
    17. Emmanuelle Auriol & Jean-Philippe Platteau, 2017. "The explosive combination of religious decentralization and autocracy," The Economics of Transition, The European Bank for Reconstruction and Development, vol. 25(2), pages 313-350, April.
    18. Tassos Patokos, 2014. "Introducing Disappointment Dynamics and Comparing Behaviors in Evolutionary Games: Some Simulation Results," Games, MDPI, vol. 5(1), pages 1-25, January.
    19. Matros, Alexander & Moser, Scott, 2015. "Coordination in a changing environment," Journal of Economic Behavior & Organization, Elsevier, vol. 112(C), pages 64-84.
    20. van Damme, E.E.C., 2000. "Non-cooperative Games," Discussion Paper 2000-96, Tilburg University, Center for Economic Research.
    21. Simon Weidenholzer, 2010. "Coordination Games and Local Interactions: A Survey of the Game Theoretic Literature," Games, MDPI, vol. 1(4), pages 1-35, November.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques
    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wrk:wrkesp:99. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sheila Kiggins (email available below). General contact details of provider: https://edirc.repec.org/data/dewaruk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.