Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Author

Listed:

Yang, Zhengzhi
Zheng, Lei
Perc, Matjaž
Li, Yumeng

Abstract

Many recent studies have used reinforcement learning methods to investigate the behavior of agents in evolutionary games. Q-learning, in particular, has become a mainstream method during this development. Here we introduce Q-learning agents into the evolutionary prisoner's dilemma game on a square lattice. Specifically, we associate the state space of Q-learning agents with the strategies of their neighbors, and we introduce a neighboring reward information sharing mechanism. We thus provide Q-learning agents with the payoff information of their neighbors, in addition to their strategies, which has not been done in previous studies. Through simulations, we show that considering neighborhood payoff information can significantly promote cooperation in the population. Moreover, we show that for an appropriate strength of neighborhood payoff information sharing, a chessboard pattern emerges on the lattice. We analyze in detail the reasons for the emergence of the chessboard pattern and the increase in cooperation frequency, and we also provide a theoretical analysis based on the pair approximation method. We hope that our research will inspire effective approaches for resolving social dilemmas by means of sharing more information among reinforcement learning agents during evolutionary games.

Suggested Citation

Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).

Handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337
DOI: 10.1016/j.amc.2023.128364

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
Cao, Xian-Bin & Du, Wen-Bo & Rong, Zhi-Hai, 2010. "The evolutionary public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(6), pages 1273-1280.
Theodor Cimpeanu & Francisco C. Santos & The Anh Han, 2023. "Does Spending More Always Ensure Higher Cooperation? An Analysis of Institutional Incentives on Heterogeneous Networks," Dynamic Games and Applications, Springer, vol. 13(4), pages 1236-1255, December.
Ding, Hong & Zhang, Geng-shun & Wang, Shi-hao & Li, Juan & Wang, Zhen, 2019. "Q-learning boosts the evolution of cooperation in structured population by involving extortion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
Cimpeanu, Theodor & Di Stefano, Alessandro & Perret, Cedric & Han, The Anh, 2023. "Social diversity reduces the complexity and cost of fostering fairness," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
Du, Wen-Bo & Zheng, Hao-Ran & Hu, Mao-Bin, 2008. "Evolutionary prisoner’s dilemma game on weighted scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(14), pages 3796-3800.
Wang, Shengxian & Chen, Xiaojie & Xiao, Zhilong & Szolnoki, Attila, 2022. "Decentralized incentives for general well-being in networked public goods game," Applied Mathematics and Computation, Elsevier, vol. 431(C).
Wang, Hanchen & Sun, Yichun & Zheng, Lei & Du, Wenbo & Li, Yumeng, 2018. "The public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 509(C), pages 396-404.
Christoph Hauert & Michael Doebeli, 2004. "Spatial structure often inhibits the evolution of cooperation in the snowdrift game," Nature, Nature, vol. 428(6983), pages 643-646, April.
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2022. "Mercenary punishment in structured populations," Applied Mathematics and Computation, Elsevier, vol. 417(C).
Li, Yumeng & Zhang, Jun & Perc, Matjaž, 2018. "Effects of compassion on the evolution of cooperation in spatial social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 320(C), pages 437-443.
Marco Alberto Javarone & Daniele Marinazzo, 2017. "Evolutionary dynamics of group formation," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-10, November.
David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
Li, Yumeng & Wang, Hanchen & Du, Wenbo & Perc, Matjaž & Cao, Xianbin & Zhang, Jun, 2019. "Resonance-like cooperation due to transaction costs in the prisoner’s dilemma game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 248-257.
Wang, Chaoqian & Szolnoki, Attila, 2023. "Inertia in spatial public goods games under weak selection," Applied Mathematics and Computation, Elsevier, vol. 449(C).
Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Yang, Guoli & Wu, Yu'e & Cavaliere, Matteo, 2024. "Information-driven cooperation on adaptive cyber-physical systems," Applied Mathematics and Computation, Elsevier, vol. 466(C).
Lee, Hsuan-Wei & Weng, Yi-Ning, 2025. "Granular Q-learning adaptation boosts collective welfare in multi-agent Prisoner’s Dilemma," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
Lv, Shaojie & Li, Jiaying & Zhao, Changheng, 2025. "Reinforcement learning in spatial public goods games with environmental feedbacks," Chaos, Solitons & Fractals, Elsevier, vol. 195(C).
Yue, Hongyu & Xiong, Xiaojin & Feng, Minyu & Szolnoki, Attila, 2025. "Dynamic evolution of cooperation based on adaptive reputation threshold and game transition," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
Zhang, Yali & Lu, Yikang & Jin, Haoyu & Dong, Yuting & Du, Chunpeng & Shi, Lei, 2024. "The impact of dynamic reward on cooperation in the spatial public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
Xu, Wei & Li, Dandan & Han, Dun & Sun, Mei, 2024. "The impact of relationship stickiness and memory on the evolution of individual behavior," Chaos, Solitons & Fractals, Elsevier, vol. 183(C).
Rao, Wenjia & Han, Miao & Xu, Wangfang, 2025. "Emergent coordination without symmetry breaking in Minority Game via policy-based reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 198(C).
He, Jialu & Cui, Lei, 2024. "The persistence-based game transition resolves the social dilemma," Applied Mathematics and Computation, Elsevier, vol. 477(C).
Bai, Xi & Ye, Ye & Chen, Tong & Xie, Nenggang, 2024. "The evolutionary game of emotions considering the influence of reputation," Applied Mathematics and Computation, Elsevier, vol. 474(C).
Zhou, Meng & Yang, Yanlong & Xiang, Shuwen, 2025. "Effect of community learning mechanism on cooperation in conflict societies," Chaos, Solitons & Fractals, Elsevier, vol. 192(C).
Krellner, Marcus & Han, The Anh, 2025. "Words are not wind - how public joint commitment and reputation solve the Prisoner's Dilemma," Applied Mathematics and Computation, Elsevier, vol. 506(C).
Shen, Shaofei & Zhang, Xuejun & Xu, Aobo & Duan, Taisen, 2024. "An adaptive exploration mechanism for Q-learning in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 189(P1).
Zhang, Qianwei & Tang, Rui & Lu, Yilun & Wang, Xinyu, 2024. "The impact of anxiety on cooperative behavior: A network evolutionary game theory approach," Applied Mathematics and Computation, Elsevier, vol. 474(C).
Lin, Jiaying & Long, Pinduo & Liang, Jinfeng & Dai, Qionglin & Li, Haihong & Yang, Junzhong, 2025. "The coevolution of cooperation: Integrating Q-learning and occasional social interactions in evolutionary games," Chaos, Solitons & Fractals, Elsevier, vol. 194(C).
Yang, Qianxi & Yang, Yanlong, 2024. "A social monitoring mechanism for third-party judges promotes cooperation in evolutionary games," Applied Mathematics and Computation, Elsevier, vol. 483(C).
Pi, Jinxiu & Wang, Chun & Zhou, Die & Tang, Wei & Yang, Guanghui, 2024. "Evolutionary dynamics of N-person snowdrift game with two thresholds in well-mixed and structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).
Dai, Hui & Wang, Xiaoyue & Lu, Yikang & Hou, Yunxiang & Shi, Lei, 2024. "The effect of intraspecific cooperation in a three-species cyclic predator-prey model," Applied Mathematics and Computation, Elsevier, vol. 470(C).
Yang, Yujin & Zhao, Dawei & Wang, Juan, 2025. "Evolution of cooperation in spatial public goods games driven by reinforcement learning and environmental feedback," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
Qin, Lijuan & Zhang, Yali & Du, Chunpeng & Duan, Xiaofang & Lu, Yikang, 2025. "Coupling static opinions with evolutionary games: Shaping cooperation in spatial social dilemmas," Chaos, Solitons & Fractals, Elsevier, vol. 199(P2).
Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
Zhao, Hui & Zhang, Zhenyu & Tchappi, Igor & Li, Li, 2025. "An evolutionary game-based vicsek model with a fixed number of neighbors," Applied Mathematics and Computation, Elsevier, vol. 500(C).
Zhu, Yuying & Xing, Bohua & Xia, Chengyi, 2025. "Q-learning update with second-order reputation promotes the evolution of trust within structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 199(P1).
Sun, Jiaqin & Fan, Ruguo & Luo, Ming & Zhang, Yingqing & Dong, Lili, 2018. "The evolution of cooperation in spatial prisoner’s dilemma game with dynamic relationship-based preferential learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 598-611.
Huang, Chaochao & Wang, Chaoqian, 2024. "Memory-based involution dilemma on square lattices," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Group-size dependent synergy in heterogeneous populations," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Restoring spatial cooperation with myopic agents in a three-strategy social dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).
Yang, Han-Xin & Yang, Jing, 2019. "Reputation-based investment strategy promotes cooperation in public goods games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 886-893.
Molnar, Grant & Hammond, Caroline & Fu, Feng, 2023. "Reactive means in the iterated Prisoner’s dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).
Shang, Lihui & Hu, Mingjian, 2025. "Enhancement of cooperation induced by taxation mechanism with progressive tax rates in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P2).
Wang, Jianwei & Dai, Wenhui & Zheng, Yanfeng & Yu, Fengyuan & Chen, Wei & Xu, Wenshu, 2024. "Partial intervention promotes cooperation and social welfare in regional public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).
Chen, Wei & Wang, Jianwei & Yu, Fengyuan & Xu, Wenshu & Dai, Wenhui, 2024. "Heterogeneous interaction radius based on emotional dynamics can promote cooperation in spatial public goods games," Applied Mathematics and Computation, Elsevier, vol. 473(C).
Zhang, Gui & Xiong, Xiaojin & Pi, Bin & Feng, Minyu & Perc, Matjaž, 2025. "Spatial public goods games with queueing and reputation," Applied Mathematics and Computation, Elsevier, vol. 505(C).
Yunsheng Deng & Jihui Zhang, 2022. "The choice-decision based on memory and payoff favors cooperation in stag hunt game on interdependent networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 95(2), pages 1-13, February.
Ping Zhu & Guiyi Wei, 2014. "Stochastic Heterogeneous Interaction Promotes Cooperation in Spatial Prisoner's Dilemma Game," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-10, April.
Qinghu Liao & Wenwen Dong & Boxin Zhao, 2023. "A New Strategy to Solve “the Tragedy of the Commons” in Sustainable Grassland Ecological Compensation: Experience from Inner Mongolia, China," Sustainability, MDPI, vol. 15(12), pages 1-24, June.
Chen, Wei & Wang, Jianwei & Yu, Fengyuan & He, Jialu & Xu, Wenshu & Dai, Wenhui, 2024. "Successful initial positioning of non-cooperative individuals in cooperative populations effectively hinders cooperation prosperity," Applied Mathematics and Computation, Elsevier, vol. 462(C).
Zou, Kuan & Huang, Changwei, 2024. "Incorporating reputation into reinforcement learning can promote cooperation on hypergraphs," Chaos, Solitons & Fractals, Elsevier, vol. 186(C).
Yu, Fengyuan & Wang, Jianwei & Chen, Wei & He, Jialu, 2023. "Increased cooperation potential and risk under suppressed strategy differentiation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 621(C).
Zou, Kuan & Han, Wenchen & Zhang, Lan & Huang, Changwei, 2024. "The spatial public goods game on hypergraphs with heterogeneous investment," Applied Mathematics and Computation, Elsevier, vol. 466(C).

More about this item

Keywords

; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data