IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v201y2025ip3s0960077925013955.html

Dilution, diffusion and symbiosis in spatial prisoner’s dilemma with reinforcement learning

Author

Listed:
  • Mangold, Gustavo C.
  • Vainstein, Mendeli H.
  • Fernandes, Heitor C.M.

Abstract

Recent studies on spatial prisoner’s dilemma games with reinforcement learning have shown that static agents can learn to cooperate through a variety of mechanisms, including noise injection, different learning algorithms, and access to neighbours’ payoff information. In this work, we use an independent multi-agent Q-learning algorithm to investigate the effects of dilution and mobility in the spatial version of the prisoner’s dilemma. Within this framework, different possible actions for the algorithm are defined, linking our results to those of the classical, non-reinforcement learning spatial prisoner’s dilemma. This highlights the algorithm’s versatility in modelling diverse game-theoretical scenarios and demonstrates its potential as a benchmarking tool. Our findings reveal a range of effects, including evidence that games with fixed update rules can be qualitatively equivalent to those with learned ones. Additionally, we observe the emergence of a symbiotic mutualistic effect between populations when multiple actions are defined.

Suggested Citation

  • Mangold, Gustavo C. & Vainstein, Mendeli H. & Fernandes, Heitor C.M., 2025. "Dilution, diffusion and symbiosis in spatial prisoner’s dilemma with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 201(P3).
  • Handle: RePEc:eee:chsofr:v:201:y:2025:i:p3:s0960077925013955
    DOI: 10.1016/j.chaos.2025.117382
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0960077925013955
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2025.117382?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Takesue, Hirofumi, 2019. "Effects of updating rules on the coevolving prisoner’s dilemma," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 513(C), pages 399-408.
    2. Axelrod, Robert, 1981. "The Emergence of Cooperation among Egoists," American Political Science Review, Cambridge University Press, vol. 75(2), pages 306-318, June.
    3. Jorge M Pacheco & Flávio L Pinheiro & Francisco C Santos, 2009. "Population Structure Induces a Symmetry Breaking Favoring the Emergence of Cooperation," PLOS Computational Biology, Public Library of Science, vol. 5(12), pages 1-7, December.
    4. Viswanathan, G.M & Afanasyev, V & Buldyrev, Sergey V & Havlin, Shlomo & da Luz, M.G.E & Raposo, E.P & Stanley, H.Eugene, 2000. "Lévy flights in random searches," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 282(1), pages 1-12.
    5. Flores, Lucas S. & Amaral, Marco A. & Vainstein, Mendeli H. & Fernandes, Heitor C.M., 2022. "Cooperation in regular lattices," Chaos, Solitons & Fractals, Elsevier, vol. 164(C).
    6. Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
    7. Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang, Yijie, 2025. "The evolution of cooperation in multi-games with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 201(P2).
    2. An, Tianbo & Zhang, Huizhen & Zhang, Zhanshuo & Liu, Guanghui & Li, Jiayu & Chen, Liangyu & Wang, Zhen, 2025. "Cooperation dynamics driven by reinforcement learning with interactive diversity in structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 201(P2).
    3. Pi, Jinxiu & Wang, Chun & Zhou, Die & Tang, Wei & Yang, Guanghui, 2024. "Evolutionary dynamics of N-person snowdrift game with two thresholds in well-mixed and structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).
    4. Wang, Weining & Shang, Lihui & Wu, Yipeng & Hu, Mingjian & Wang, Weiyu, 2025. "Bio-inspired mechanism promotes cooperation in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 200(P3).
    5. Yang, Yujin & Zhao, Dawei & Wang, Juan, 2025. "Evolution of cooperation in spatial public goods games driven by reinforcement learning and environmental feedback," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
    6. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).
    7. Shen, Shaofei & Zhang, Xuejun & Xu, Aobo & Duan, Taisen, 2024. "An adaptive exploration mechanism for Q-learning in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 189(P1).
    8. Lee, Hsuan-Wei & Weng, Yi-Ning, 2025. "Granular Q-learning adaptation boosts collective welfare in multi-agent Prisoner’s Dilemma," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
    9. Zheng, Guozhong & Zhang, Jiqiang & Deng, Shengfeng & Cai, Weiran & Chen, Li, 2024. "Evolution of cooperation in the public goods game with Q-learning," Chaos, Solitons & Fractals, Elsevier, vol. 188(C).
    10. Chen, Yunong & Belmonte, Andrew & Griffin, Christopher, 2021. "Imitation of success leads to cost of living mediated fairness in the Ultimatum Game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 583(C).
    11. Wu, Yu’e & Zhang, Zhipeng & Yang, Guoli & Liu, Haixin & Zhang, Qingfeng, 2022. "Evolution of cooperation driven by diversity on a double-layer square lattice," Chaos, Solitons & Fractals, Elsevier, vol. 162(C).
    12. Zhou, Meng & Yang, Yanlong & Xiang, Shuwen, 2025. "Effect of community learning mechanism on cooperation in conflict societies," Chaos, Solitons & Fractals, Elsevier, vol. 192(C).
    13. M. Kleshnina & K. Kaveh & K. Chatterjee, 2020. "The role of behavioural plasticity in finite vs infinite populations," Papers 2009.13160, arXiv.org.
    14. Jorge Peña & Yannick Rochat, 2012. "Bipartite Graphs as Models of Population Structures in Evolutionary Multiplayer Games," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-13, September.
    15. Peter S. Fader & John R. Hauser, 1988. "Implicit Coalitions in a Generalized Prisoner's Dilemma," Journal of Conflict Resolution, Peace Science Society (International), vol. 32(3), pages 553-582, September.
    16. Wu, Steven Y. & Roe, Brian E., 2007. "Discretionary Latitude and Relational Contracting," IZA Discussion Papers 2879, IZA Network @ LISER.
    17. Vogt, Carsten, 2000. "The evolution of cooperation in Prisoners' Dilemma with an endogenous learning mutant," Journal of Economic Behavior & Organization, Elsevier, vol. 42(3), pages 347-373, July.
    18. Wang, Jiaoyuan & Yang, Yanlong, 2026. "Research on the resilience of reputation mechanism to the cooperative environment in the face of external shocks," Applied Mathematics and Computation, Elsevier, vol. 510(C).
    19. Besir Ceka and Brian Burgo, 2014. "Discovering Cooperation: A Contractual Approach to Institutional Change in Regional International Organizations," EUI-RSCAS Working Papers p0388, European University Institute (EUI), Robert Schuman Centre of Advanced Studies (RSCAS).
    20. W. Bentley MacLeod & James M. Malcomson, 2023. "Implicit Contracts, Incentive Compatibility, and Involuntary Unemployment: Thirty Years On," Journal of Institutional and Theoretical Economics (JITE), Mohr Siebeck, Tübingen, vol. 179(3-4), pages 470-499.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:201:y:2025:i:p3:s0960077925013955. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.