IDEAS home Printed from https://ideas.repec.org/a/eee/apmaco/v427y2022ics0096300322002569.html
   My bibliography  Save this article

Reinforcement learning explains various conditional cooperation

Author

Listed:
  • Geng, Yini
  • Liu, Yifan
  • Lu, Yikang
  • Shen, Chen
  • Shi, Lei

Abstract

Recent studies show that different update rules are invariant regarding the evolutionary outcomes for a well-mixed population or homogeneous network. In this paper, we investigate how the Q-learning algorithm, one of the reinforcement learning methods, affects the evolutionary outcomes in square lattice. Especially, we consider the mixed strategy update rule, among which some agents adopt Q-learning method to update their strategies, the proportion of these agents (these agents are denoted as Artificial Intelligence (AI)) is controlled by a simple parameter ρ. The rest of other agents, the proportion is denoted by 1 − ρ, adopt the Fermi function to update their strategies. Through extensive numerical simulations, we found that the mixed strategy-update rule can facilitate cooperation compared with the pure Fermi- function-based update rule. Besides, if the proportion of AI is moderate, cooperators among the whole population exhibit conditional behavior and moody conditional behavior. However, if the whole population adopts the pure Fermi-function-based strategy update rule or the pure Q-learning-based strategy update rule, then cooperators among the whole population exhibit the hump-shaped conditional behavior. Our results provide a new insight to understand the evolution of cooperation from AI's view.

Suggested Citation

  • Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).
  • Handle: RePEc:eee:apmaco:v:427:y:2022:i:c:s0096300322002569
    DOI: 10.1016/j.amc.2022.127182
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0096300322002569
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.amc.2022.127182?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Li, Zhibin & Jia, Danyang & Guo, Hao & Geng, Yini & Shen, Chen & Wang, Zhen & Li, Xuelong, 2019. "The effect of multigame on cooperation in spatial network," Applied Mathematics and Computation, Elsevier, vol. 351(C), pages 162-167.
    2. Ananish Chaudhuri & Tirnud Paichayontvijit, 2006. "Conditional cooperation and voluntary contributions to a public good," Economics Bulletin, AccessEcon, vol. 3(8), pages 1-14.
    3. Hu, Mao-Bin & Jiang, Rui & Wu, Qing-Song & Wu, Yong-Hong, 2007. "Simulating the wealth distribution with a Richest-Following strategy on scale-free network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 381(C), pages 467-472.
    4. Fischbacher, Urs & Gachter, Simon & Fehr, Ernst, 2001. "Are people conditionally cooperative? Evidence from a public goods experiment," Economics Letters, Elsevier, vol. 71(3), pages 397-404, June.
    5. Claudia Keser & Frans Van Winden, 2000. "Conditional Cooperation and Voluntary Contributions to Public Goods," Scandinavian Journal of Economics, Wiley Blackwell, vol. 102(1), pages 23-39, March.
    6. Makowsky, Michael D. & Orman, Wafa Hakim & Peart, Sandra J., 2014. "Playing with other people's money: Contributions to public goods by trustees," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 53(C), pages 44-55.
    7. Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
    8. V. S. Borkar, 2002. "Reinforcement Learning In Markovian Evolutionary Games," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 5(01), pages 55-72.
    9. Muller, Laurent & Sefton, Martin & Steinberg, Richard & Vesterlund, Lise, 2008. "Strategic behavior and learning in repeated voluntary contribution experiments," Journal of Economic Behavior & Organization, Elsevier, vol. 67(3-4), pages 782-793, September.
    10. repec:ebl:ecbull:v:3:y:2006:i:8:p:1-14 is not listed on IDEAS
    11. Dui, Hongyan & Meng, Xueyu & Xiao, Hui & Guo, Jianjun, 2020. "Analysis of the cascading failure for scale-free networks based on a multi-strategy evolutionary game," Reliability Engineering and System Safety, Elsevier, vol. 199(C).
    12. Bruno S. Frey & Stephan Meier, 2004. "Social Comparisons and Pro-social Behavior: Testing "Conditional Cooperation" in a Field Experiment," American Economic Review, American Economic Association, vol. 94(5), pages 1717-1722, December.
    13. K. M. Ariful Kabir & Jun Tanimoto & Zhen Wang, 2018. "Influence of bolstering network reciprocity in the evolutionary spatial Prisoner’s Dilemma game: a perspective," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 91(12), pages 1-10, December.
    14. Saral, Ali Seyhun, 2020. "Evolution of Conditional Cooperation in Prisoner's Dilemma," OSF Preprints wcpkz, Center for Open Science.
    15. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    16. Jin Zhang & Dietmar Maringer, 2016. "Using a Genetic Algorithm to Improve Recurrent Reinforcement Learning for Equity Trading," Computational Economics, Springer;Society for Computational Economics, vol. 47(4), pages 551-567, April.
    17. Guo, Tian & Guo, Mi & Zhang, Yan & Liang, Shuanglu, 2019. "The effect of aspiration on the evolution of cooperation in spatial multigame," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 525(C), pages 27-32.
    18. Karen Evelyn Hauge & Kjell Arne Brekke & Karine Nyborg & Jo Thori Lind, 2019. "Sustaining cooperation through self-sorting: The good, the bad, and the conditional," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 116(12), pages 5299-5304, March.
    19. Hongyan Cheng & Qionglin Dai & Haihong Li & Xiaolan Qian & Mei Zhang & Junzhong Yang, 2013. "Effects of directional migration on prisoner’s dilemma game in a square domain," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 86(4), pages 1-6, April.
    20. Bruno Frey & Stephan Meier, 2004. "In a field experiment," Natural Field Experiments 00243, The Field Experiments Website.
    21. Tanimoto, Jun, 2015. "The impact of initial cooperation fraction on the evolutionary fate in a spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 263(C), pages 171-188.
    22. Nagashima, Keisuke & Tanimoto, Jun, 2019. "A stochastic Pairwise Fermi rule modified by utilizing the average in payoff differences of neighbors leads to increased network reciprocity in spatial prisoner's dilemma games," Applied Mathematics and Computation, Elsevier, vol. 361(C), pages 661-669.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiang, Bei & Yuan, Lin & Zou, Rongcheng & Su, Rui & Mi, Yuqiang, 2023. "The effect of migration on vaccination dilemma in networked populations," Chaos, Solitons & Fractals, Elsevier, vol. 170(C).
    2. Di, Changyan & Zhou, Qingguo & Shen, Jun & Wang, Jinqiang & Zhou, Rui & Wang, Tianyi, 2023. "The coupling effect between the environment and strategies drives the emergence of group cooperation," Chaos, Solitons & Fractals, Elsevier, vol. 176(C).
    3. Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Gaechter, 2006. "Conditional cooperation: Behavioral regularities from the lab and the field and their policy implications," Discussion Papers 2006-03, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
    2. Urs Fischbacher & Simon Gachter, 2010. "Social Preferences, Beliefs, and the Dynamics of Free Riding in Public Goods Experiments," American Economic Review, American Economic Association, vol. 100(1), pages 541-556, March.
    3. Urs Fischbacher & Simon G�chter, 2005. "Heterogeneous social preferences and the dynamics of free riding in public goods," IEW - Working Papers 261, Institute for Empirical Research in Economics - University of Zurich.
    4. M. Vittoria Levati & Ro’i Zultan, 2011. "Cycles of Conditional Cooperation in a Real-Time Voluntary Contribution Mechanism," Games, MDPI, vol. 2(1), pages 1-15, January.
    5. Simon Gaechter & Elke Renner, 2014. "Leaders as Role Models for the Voluntary Provision of Public Goods," Discussion Papers 2014-11, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
    6. Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-07, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
    7. Yin, Xile & Li, Jianbiao & Bao, Te, 2019. "Does overconfidence promote cooperation? Theory and experimental evidence," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 79(C), pages 119-133.
    8. Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-07, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
    9. Gächter, Simon & Renner, Elke, 2018. "Leaders as role models and ‘belief managers’ in social dilemmas," Journal of Economic Behavior & Organization, Elsevier, vol. 154(C), pages 321-334.
    10. Josephine G. Gatua, 2021. "Information and cooperation in preventive health behavior: The case of bed net use in rural Kenya," Health Economics, John Wiley & Sons, Ltd., vol. 30(9), pages 2124-2143, September.
    11. Ananish Chaudhuri, 2011. "Sustaining cooperation in laboratory public goods experiments: a selective survey of the literature," Experimental Economics, Springer;Economic Science Association, vol. 14(1), pages 47-83, March.
    12. Drouvelis, Michalis & Marx, Benjamin M., 2022. "Can charitable appeals identify and exploit belief heterogeneity?," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 631-649.
    13. Dong, Bin & Dulleck, Uwe & Torgler, Benno, 2012. "Conditional corruption," Journal of Economic Psychology, Elsevier, vol. 33(3), pages 609-627.
    14. Thorsten Chmura & Christoph Engel & Markus Englerth, 2013. "Selfishness As a Potential Cause of Crime. A Prison Experiment," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2013_05, Max Planck Institute for Research on Collective Goods.
    15. Vyrastekova, Jana & Funaki, Yukihiko, 2018. "Cooperation in a sequential dilemma game: How much transparency is good for cooperation?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 77(C), pages 88-95.
    16. Richter, Andries & Grasman, Johan, 2013. "The transmission of sustainable harvesting norms when agents are conditionally cooperative," Ecological Economics, Elsevier, vol. 93(C), pages 202-209.
    17. Raúl López-Pérez & Aldo Ramírez-Almudio, 2020. "Why people give to their governments: The role of outcome-oriented norms," Working Papers 2007, Instituto de Políticas y Bienes Públicos (IPP), CSIC.
    18. Antoni Bosch-Domènech & Joaquim Silvestre, 2017. "The role of frames, numbers and risk in the frequency of cooperation," International Review of Economics, Springer;Happiness Economics and Interpersonal Relations (HEIRS), vol. 64(3), pages 245-267, September.
    19. Bernd Irlenbusch & Janna Ter Meer, 2015. "Lying in public good games with and without punishment," Cologne Graduate School Working Paper Series 06-02, Cologne Graduate School in Management, Economics and Social Sciences.
    20. Raphaële Préget & Phu Nguyen-Van & Marc Willinger, 2016. "Who are the voluntary leaders? Experimental evidence from a sequential contribution game," Theory and Decision, Springer, vol. 81(4), pages 581-599, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:427:y:2022:i:c:s0096300322002569. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.