IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v199y2025ip1s0960077925006411.html

Beyond cyclic dominance: Reinforcement learning promotes cooperation in the spatial rock–paper–scissors game

Author

Listed:
  • Si, Zehua
  • Ito, Takayuki

Abstract

Cyclic dominance is widely recognized as a fundamental mechanism for sustaining cooperation in structured populations. However, existing studies primarily employ strategy updating mechanisms based on imitation, neglecting the role of individual learning from experience. In this study, we investigate the spatial rock–paper–scissors game with destructive agents on a two-dimensional lattice network, incorporating reinforcement learning as the strategy updating mechanism. Specifically, we examine two distinct learning paradigms: self-regarding Q-learning, where individuals update their strategies based solely on personal experience, and neighbor-regarding Q-learning, which integrates both individual and neighbor experiences. Monte Carlo simulation results demonstrate that when Q-learning governs strategy updates, cyclic dominance disappears, leading the system to converge toward a single dominant strategy. The specific dominant strategy depends on the adopted learning paradigm: when individuals rely exclusively on their own experiences, destructive agents ultimately dominate the system. In contrast, when individuals incorporate experiences from their neighbors, full cooperation emerges. These findings suggest that the significance of cyclic dominance in promoting cooperation may have been overestimated. Instead, cooperation can be maintained through experiential learning, as long as the learning perspective is not overly constrained.

Suggested Citation

  • Si, Zehua & Ito, Takayuki, 2025. "Beyond cyclic dominance: Reinforcement learning promotes cooperation in the spatial rock–paper–scissors game," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
  • Handle: RePEc:eee:chsofr:v:199:y:2025:i:p1:s0960077925006411
    DOI: 10.1016/j.chaos.2025.116628
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0960077925006411
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2025.116628?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wu, Yu’e & Zhang, Zhipeng & Chang, Shuhua, 2019. "Reciprocal reward promotes the evolution of cooperation in structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 119(C), pages 230-236.
    2. Ding, Hong & Zhang, Geng-shun & Wang, Shi-hao & Li, Juan & Wang, Zhen, 2019. "Q-learning boosts the evolution of cooperation in structured population by involving extortion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yan, Zeyuan & Zhao, Hui & Liang, Shu & Li, Li & Song, Yanjie, 2024. "Inter-layer feedback mechanism with reinforcement learning boosts the evolution of cooperation in multilayer network," Chaos, Solitons & Fractals, Elsevier, vol. 185(C).
    2. Huang, Yijie, 2025. "The evolution of cooperation in multi-games with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 201(P2).
    3. Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
    4. Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
    5. Li, Cong & Xu, Hedong & Fan, Suohai, 2021. "Evolutionary compromise game on assortative mixing networks," Applied Mathematics and Computation, Elsevier, vol. 390(C).
    6. Lin, Jingyan & Huang, Changwei & Dai, Qionglin & Yang, Junzhong, 2020. "Evolutionary game dynamics of combining the payoff-driven and conformity-driven update rules," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    7. An, Tianbo & Zhang, Huizhen & Zhang, Zhanshuo & Liu, Guanghui & Li, Jiayu & Chen, Liangyu & Wang, Zhen, 2025. "Cooperation dynamics driven by reinforcement learning with interactive diversity in structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 201(P2).
    8. Zhenghong Wu & Huan Huang & Qinghu Liao, 2021. "The study on the role of dedicators on promoting cooperation in public goods game," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-17, September.
    9. Ma, Lili & Su, Zhao & Lin, Peng & Wang, Kai & Pang, Xingbo & Li, Lin & Chen, Lin, 2026. "The impact of peer incentives that integrate node similarity on the evolution of cooperation in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 682(C).
    10. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).
    11. Mao, Yajun & Rong, Zhihai & Wu, Zhi-Xi, 2021. "Effect of collective influence on the evolution of cooperation in evolutionary prisoner’s dilemma games," Applied Mathematics and Computation, Elsevier, vol. 392(C).
    12. Gao, Liyan & Pan, Qiuhui & He, Mingfeng, 2021. "Effects of defensive cooperation strategy on the evolution of cooperation in social dilemma," Applied Mathematics and Computation, Elsevier, vol. 399(C).
    13. Zhang, Xiaoyang & Chen, Tong & Chen, Qiao & Li, Xueya, 2020. "Increasing pool funds in public goods: The effects of deposit-based delayed rewards," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    14. Feng, Sinan & Liu, Xuesong, 2024. "Effects of three-faced strategy on the evolution of cooperation in social dilemma," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 639(C).
    15. Shen, Shaofei & Zhang, Xuejun & Xu, Aobo & Duan, Taisen, 2024. "An adaptive exploration mechanism for Q-learning in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 189(P1).
    16. Lyu, Ding & Liu, Hanxiao & Wang, Lin & Wang, Xiaofan, 2024. "Evolution of cooperation in a mixed cooperative–competitive structured population," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 652(C).
    17. Gao, Liyan & Pan, Qiuhui & He, Mingfeng, 2021. "Environmental-based defensive promotes cooperation in the prisoner’s dilemma game," Applied Mathematics and Computation, Elsevier, vol. 401(C).
    18. Ueda, Masahiko & Fujita, Ayaka, 2025. "Zero-determinant strategies in repeated continuously-relaxed games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 670(C).
    19. Li, Cong & Xu, Hedong & Fan, Suohai, 2020. "Synergistic effects of self-optimization and imitation rules on the evolution of cooperation in the investor sharing game," Applied Mathematics and Computation, Elsevier, vol. 370(C).
    20. Zhu, Yuying & Xing, Bohua & Xia, Chengyi, 2025. "Q-learning update with second-order reputation promotes the evolution of trust within structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:199:y:2025:i:p1:s0960077925006411. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.