IDEAS home Printed from https://ideas.repec.org/a/eee/apmaco/v507y2025ics0096300325003169.html

Q-learning driven cooperative evolution with dual-reputation incentive mechanisms

Author

Listed:
  • Zhang, Qianwei
  • Zhang, Xinran

Abstract

Reinforcement learning, as a powerful framework for analyzing strategic dynamics in evolutionary games, has gained significant traction in game theory research. In this study, we propose a dual-reputation incentive mechanism that integrates individual and group reputation metrics within the spatial Prisoner's Dilemma paradigm, aiming to elucidate how adaptive Q-learning drives the evolution of cooperation. Our approach combines traditional game payoffs with reputation-based rewards through a novel Q-learning reward function, strategically decomposing reputation into two components: individual rewards (quantifying an agent’s behavioral history) and group rewards (reflecting the collective reputation of their local neighborhood). Simulations demonstrate that when individual reputation rewards are prioritized, agents optimize long-term gains by dynamically adjusting strategies under strong motivational incentives, which ultimately enhances global cooperation levels. Microscopic analysis reveals that individual reputation incentives promote high-density cooperator clusters and facilitate cooperative behavior propagation. Furthermore, when a high weight is assigned to individual reputation rewards, evolutionary analysis demonstrates that cooperative Q-values consistently exceeds defective ones, indicating the emergence of cooperation as an evolutionarily stable strategy. This research provides theoretical insights for designing reputation-aware reinforcement learning systems to foster cooperation in real-world social dilemmas.

Suggested Citation

  • Zhang, Qianwei & Zhang, Xinran, 2025. "Q-learning driven cooperative evolution with dual-reputation incentive mechanisms," Applied Mathematics and Computation, Elsevier, vol. 507(C).
  • Handle: RePEc:eee:apmaco:v:507:y:2025:i:c:s0096300325003169
    DOI: 10.1016/j.amc.2025.129590
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0096300325003169
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.amc.2025.129590?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Xie, Kai & Szolnoki, Attila, 2025. "Reputation in public goods cooperation under double Q-learning protocol," Chaos, Solitons & Fractals, Elsevier, vol. 196(C).
    2. Zheng, Guozhong & Zhang, Jiqiang & Deng, Shengfeng & Cai, Weiran & Chen, Li, 2024. "Evolution of cooperation in the public goods game with Q-learning," Chaos, Solitons & Fractals, Elsevier, vol. 188(C).
    3. Szolnoki, Attila & Danku, Zsuzsa, 2018. "Dynamic-sensitive cooperation in the presence of multiple strategy updating rules," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 511(C), pages 371-377.
    4. Cassar, Alessandra, 2007. "Coordination and cooperation in local, random and small world networks: Experimental evidence," Games and Economic Behavior, Elsevier, vol. 58(2), pages 209-230, February.
    5. Li, MingYuan & Kang, HongWei & Sun, XingPing & Shen, Yong & Chen, QingYi, 2022. "Replicator dynamics of public goods game with tax-based punishment," Chaos, Solitons & Fractals, Elsevier, vol. 164(C).
    6. Francisco C. Santos & Marta D. Santos & Jorge M. Pacheco, 2008. "Social diversity promotes the emergence of cooperation in public goods games," Nature, Nature, vol. 454(7201), pages 213-216, July.
    7. Christoph Hauert & Michael Doebeli, 2004. "Spatial structure often inhibits the evolution of cooperation in the snowdrift game," Nature, Nature, vol. 428(6983), pages 643-646, April.
    8. Wang, Jianwei & He, Jialu & Yu, Fengyuan, 2021. "Heterogeneity of reputation increment driven by individual influence promotes cooperation in spatial social dilemma," Chaos, Solitons & Fractals, Elsevier, vol. 146(C).
    9. Zhen Wang & Lin Wang & Zi-Yu Yin & Cheng-Yi Xia, 2012. "Inferring Reputation Promotes the Evolution of Cooperation in Spatial Social Dilemma Games," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-9, July.
    10. Guo, Shiqiang & Wang, Juan & Zhao, Dawei & Xia, Chengyi, 2023. "Role of second-order reputation evaluation in the multi-player snowdrift game on scale-free simplicial complexes," Chaos, Solitons & Fractals, Elsevier, vol. 172(C).
    11. Hisashi Ohtsuki & Christoph Hauert & Erez Lieberman & Martin A. Nowak, 2006. "A simple rule for the evolution of cooperation on graphs and social networks," Nature, Nature, vol. 441(7092), pages 502-505, May.
    12. Zou, Kuan & Huang, Changwei, 2024. "Incorporating reputation into reinforcement learning can promote cooperation on hypergraphs," Chaos, Solitons & Fractals, Elsevier, vol. 186(C).
    13. Martin A. Nowak & Akira Sasaki & Christine Taylor & Drew Fudenberg, 2004. "Emergence of cooperation and evolutionary stability in finite populations," Nature, Nature, vol. 428(6983), pages 646-650, April.
    14. Wang, Pai & Yang, Zhihu, 2024. "The double-edged sword effect of conformity on cooperation in spatial Prisoner’s Dilemma Games with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
    15. Shen, Yong & Ma, Yujie & Kang, Hongwei & Sun, Xingping & Chen, Qingyi, 2024. "Learning and propagation: Evolutionary dynamics in spatial public goods games through combined Q-learning and Fermi rule," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
    16. Bi, Yan & Yang, Hui, 2023. "Based on reputation consistent strategy times promotes cooperation in spatial prisoner’s dilemma game," Applied Mathematics and Computation, Elsevier, vol. 444(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).
    2. Xie, Kai & Szolnoki, Attila, 2025. "Reputation in public goods cooperation under double Q-learning protocol," Chaos, Solitons & Fractals, Elsevier, vol. 196(C).
    3. Bi, Yan & Hao, Qingyi & Liu, Kui, 2025. "The influence of reputation levels and disparities on the evolution of cooperation in spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 501(C).
    4. Wang, Pai & Yang, Zhihu, 2024. "The double-edged sword effect of conformity on cooperation in spatial Prisoner’s Dilemma Games with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
    5. Zhang, Xinran & Zhang, Qianwei & Liu, Jiaqi, 2025. "The selective imitation based on influence mechanism in evolutionary dynamics of the Prisoner’s dilemma game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 661(C).
    6. Wang, Jianwei & Xu, Wenshu & Yu, Fengyuan & He, Jialu & Chen, Wei & Dai, Wenhui, 2024. "Evolution of cooperation under corrupt institutions," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).
    7. Hu, Zhengyang & Zhu, Yuying & Zhao, Dawei & Xia, Chengyi, 2026. "The higher-order networked N-player trust game driven by reputation and reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 202(P2).
    8. Bi, Yan & Hao, Qingyi & Wu, Wenjun, 2024. "The warning effect of persistent defection strategy promotes cooperation in spatial prisoner’s dilemma game," Chaos, Solitons & Fractals, Elsevier, vol. 189(P1).
    9. Sarkar, Bijan, 2021. "The cooperation–defection evolution on social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 584(C).
    10. Qi Su & Lei Zhou & Long Wang, 2019. "Evolutionary multiplayer games on graphs with edge diversity," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-22, April.
    11. Flávio L Pinheiro & Jorge M Pacheco & Francisco C Santos, 2012. "From Local to Global Dilemmas in Social Networks," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-6, February.
    12. Shang, Lihui & Hu, Mingjian, 2025. "Enhancement of cooperation induced by taxation mechanism with progressive tax rates in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P2).
    13. Feng, Meiling & Li, Xuezhu & Zhao, Dawei & Xia, Chengyi, 2023. "Evolutionary dynamics with the second-order reputation in the networked N-player trust game," Chaos, Solitons & Fractals, Elsevier, vol. 175(P2).
    14. Li, Jiaying & Lv, Shaojie & Zhao, Changheng, 2025. "Public goods games with environmental feedbacks in well-mixed and structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 201(P2).
    15. Du, Faqi & Fu, Feng, 2013. "Quantifying the impact of noise on macroscopic organization of cooperation in spatial games," Chaos, Solitons & Fractals, Elsevier, vol. 56(C), pages 35-44.
    16. Anzhi Sheng & Aming Li & Long Wang, 2023. "Evolutionary dynamics on sequential temporal networks," PLOS Computational Biology, Public Library of Science, vol. 19(8), pages 1-19, August.
    17. Li, Yan & Ye, Hang, 2015. "Effect of migration based on strategy and cost on the evolution of cooperation," Chaos, Solitons & Fractals, Elsevier, vol. 76(C), pages 156-165.
    18. Yao Meng & Sean P. Cornelius & Yang-Yu Liu & Aming Li, 2024. "Dynamics of collective cooperation under personalised strategy updates," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    19. Konno, Tomohiko, 2013. "An imperfect competition on scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(21), pages 5453-5460.
    20. Te Wu & Feng Fu & Long Wang, 2011. "Moving Away from Nasty Encounters Enhances Cooperation in Ecological Prisoner's Dilemma Game," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-7, November.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:507:y:2025:i:c:s0096300325003169. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.