IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v199y2025ip2s0960077925007751.html

PPO-ACT: Proximal policy optimization with adversarial curriculum transfer for spatial public goods games

Author

Listed:
  • Yang, Zhaoqilin
  • Li, Chanchan
  • Wang, Xin
  • Tian, Youliang

Abstract

This study investigates cooperation evolution mechanisms in the spatial public goods game. A novel deep reinforcement learning framework, Proximal Policy Optimization with Adversarial Curriculum Transfer (PPO-ACT), is proposed to model agent strategy optimization in dynamic environments. Traditional evolutionary game models often exhibit limitations in modeling long-term decision-making processes. Imitation-based rules (e.g., Fermi) lack strategic foresight, while tabular methods (e.g., Q-learning) fail to capture spatial–temporal correlations. Deep reinforcement learning effectively addresses these limitation by bridging policy gradient methods with evolutionary game theory. Our study pioneers the application of proximal policy optimization’s continuous strategy optimization capability to public goods games through a two-stage adversarial curriculum transfer training paradigm. The experimental results show that PPO-ACT performs better in critical enhancement factor regimes. Compared to conventional standard proximal policy optimization methods, Q-learning and Fermi update rules, achieve earlier cooperation phase transitions and maintain stable cooperative equilibria. This framework exhibits better robustness when handling challenging scenarios like all-defector initial conditions. Systematic comparisons reveal the unique advantage of policy gradient methods in population-scale cooperation, i.e., achieving spatiotemporal payoff coordination through value function propagation. Our work provides a new computational framework for studying cooperation emergence in complex systems, algorithmically validating the punishment promotes cooperation hypothesis while offering methodological insights for multi-agent system strategy design.

Suggested Citation

  • Yang, Zhaoqilin & Li, Chanchan & Wang, Xin & Tian, Youliang, 2025. "PPO-ACT: Proximal policy optimization with adversarial curriculum transfer for spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P2).
  • Handle: RePEc:eee:chsofr:v:199:y:2025:i:p2:s0960077925007751
    DOI: 10.1016/j.chaos.2025.116762
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0960077925007751
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2025.116762?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Duh, Maja & Gosak, Marko & Perc, Matjaž, 2021. "Public goods games on random hyperbolic graphs with mixing," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).
    2. Cao, Xian-Bin & Du, Wen-Bo & Rong, Zhi-Hai, 2010. "The evolutionary public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(6), pages 1273-1280.
    3. Lipowski, Adam & Gontarek, Krzysztof & Ausloos, Marcel, 2009. "Statistical mechanics approach to a reinforcement learning model with memory," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(9), pages 1849-1856.
    4. Zhang, Haifeng & Yang, Hanxin & Du, Wenbo & Wang, Binghong & Cao, Xianbin, 2010. "Evolutionary public goods games on scale-free networks with unequal payoff allocation mechanism," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(5), pages 1099-1104.
    5. Dawes, Robyn M & Thaler, Richard H, 1988. "Anomalies: Cooperation," Journal of Economic Perspectives, American Economic Association, vol. 2(3), pages 187-197, Summer.
    6. Valerio Capraro & Roberto Di Paolo & Matjaz Perc & Veronica Pizziol, 2024. "Language-based game theory in the age of artificial intelligence," Papers 2403.08944, arXiv.org.
    7. Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2024. "Supporting punishment via taxation in a structured population," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
    8. Liu, Jinzhuo & Meng, Haoran & Wang, Wei & Li, Tong & Yu, Yong, 2018. "Synergy punishment promotes cooperation in spatial public good game," Chaos, Solitons & Fractals, Elsevier, vol. 109(C), pages 214-218.
    9. Shen, Yong & Ma, Yujie & Kang, Hongwei & Sun, Xingping & Chen, Qingyi, 2024. "Learning and propagation: Evolutionary dynamics in spatial public goods games through combined Q-learning and Fermi rule," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
    10. Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
    11. Isamu Okada & Hitoshi Yamamoto & Fujio Toriumi & Tatsuya Sasaki, 2015. "The Effect of Incentives and Meta-incentives on the Evolution of Cooperation," PLOS Computational Biology, Public Library of Science, vol. 11(5), pages 1-17, May.
    12. Quan, Ji & Zhou, Yawen & Wang, Xianjia & Yang, Jian-Bo, 2020. "Information fusion based on reputation and payoff promotes cooperation in spatial public goods game," Applied Mathematics and Computation, Elsevier, vol. 368(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang, Zhaoqilin & Wang, Xin & Zhang, Ruichen & Li, Chanchan & Tian, Youliang, 2025. "TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P3).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang, Zhaoqilin & Wang, Xin & Zhang, Ruichen & Li, Chanchan & Tian, Youliang, 2025. "TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P3).
    2. Shen, Yong & Ma, Yujie & Kang, Hongwei & Sun, Xingping & Chen, Qingyi, 2024. "Learning and propagation: Evolutionary dynamics in spatial public goods games through combined Q-learning and Fermi rule," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
    3. Kang, Hongwei & Jiang, Chao & Shen, Yong & Sun, Xingping & Chen, Qingyi, 2025. "Neighbor-aware reinforcement learning fosters cooperation in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 199(P3).
    4. Quan, Ji & Tang, Caixia & Wang, Xianjia, 2021. "Reputation-based discount effect in imitation on the evolution of cooperation in spatial public goods games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 563(C).
    5. Sun, Xingping & Zhu, Haoran & Kang, Hongwei & Bi, Yanzheng & Shen, Yong & Chen, Qingyi, 2025. "The impact of memory reputation-induced tax and reward allocation on spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 195(C).
    6. Li, Cong & Xu, Hedong & Fan, Suohai, 2020. "Synergistic effects of self-optimization and imitation rules on the evolution of cooperation in the investor sharing game," Applied Mathematics and Computation, Elsevier, vol. 370(C).
    7. Ping Zhu & Guiyi Wei, 2014. "Stochastic Heterogeneous Interaction Promotes Cooperation in Spatial Prisoner's Dilemma Game," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-10, April.
    8. Qinghu Liao & Wenwen Dong & Boxin Zhao, 2023. "A New Strategy to Solve “the Tragedy of the Commons” in Sustainable Grassland Ecological Compensation: Experience from Inner Mongolia, China," Sustainability, MDPI, vol. 15(12), pages 1-24, June.
    9. Quan, Ji & Yang, Xiukang & Wang, Xianjia, 2018. "Spatial public goods game with continuous contributions based on Particle Swarm Optimization learning and the evolution of cooperation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 505(C), pages 973-983.
    10. Fan, Ruguo & Zhang, Yingqing & Luo, Ming & Zhang, Hongjuan, 2017. "Promotion of cooperation induced by heterogeneity of both investment and payoff allocation in spatial public goods game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 465(C), pages 454-463.
    11. Li, Dandan & Wu, Qiongzi & Han, Dun, 2025. "On evolution of agent behavior under limited gaming time with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 194(C).
    12. Zha, Jiajing & Li, Cong & Fan, Suohai, 2022. "The effect of stability-based strategy updating on cooperation in evolutionary social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 413(C).
    13. Sun, Jiaqin & Fan, Ruguo & Luo, Ming & Zhang, Yingqing & Dong, Lili, 2018. "The evolution of cooperation in spatial prisoner’s dilemma game with dynamic relationship-based preferential learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 598-611.
    14. Yu, Fengyuan & Wang, Jianwei & He, Jialu, 2022. "Inequal dependence on members stabilizes cooperation in spatial public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 165(P1).
    15. Tian, Lin-Lin & Li, Ming-Chu & Lu, Kun & Zhao, Xiao-Wei & Wang, Zhen, 2013. "The influence of age-driven investment on cooperation in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 54(C), pages 65-70.
    16. Griffin, Christopher & Semonsen, Justin & Belmonte, Andrew, 2022. "Generalized Hamiltonian dynamics and chaos in evolutionary games on networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 597(C).
    17. Wang, Hanchen & Sun, Yichun & Zheng, Lei & Du, Wenbo & Li, Yumeng, 2018. "The public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 509(C), pages 396-404.
    18. Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2021. "Small fraction of selective cooperators can elevate general wellbeing significantly," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 582(C).
    19. Quan, Ji & Zhou, Yawen & Wang, Xianjia & Yang, Jian-Bo, 2020. "Information fusion based on reputation and payoff promotes cooperation in spatial public goods game," Applied Mathematics and Computation, Elsevier, vol. 368(C).
    20. Kurokawa, Shun, 2019. "How memory cost, switching cost, and payoff non-linearity affect the evolution of persistence," Applied Mathematics and Computation, Elsevier, vol. 341(C), pages 174-192.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:199:y:2025:i:p2:s0960077925007751. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.