Reinforcement learning explains various conditional cooperation

My bibliography Save this article

Reinforcement learning explains various conditional cooperation

Author

Listed:

Geng, Yini
Liu, Yifan
Lu, Yikang
Shen, Chen
Shi, Lei

Registered:

Abstract

Recent studies show that different update rules are invariant regarding the evolutionary outcomes for a well-mixed population or homogeneous network. In this paper, we investigate how the Q-learning algorithm, one of the reinforcement learning methods, affects the evolutionary outcomes in square lattice. Especially, we consider the mixed strategy update rule, among which some agents adopt Q-learning method to update their strategies, the proportion of these agents (these agents are denoted as Artificial Intelligence (AI)) is controlled by a simple parameter ρ. The rest of other agents, the proportion is denoted by 1 − ρ, adopt the Fermi function to update their strategies. Through extensive numerical simulations, we found that the mixed strategy-update rule can facilitate cooperation compared with the pure Fermi- function-based update rule. Besides, if the proportion of AI is moderate, cooperators among the whole population exhibit conditional behavior and moody conditional behavior. However, if the whole population adopts the pure Fermi-function-based strategy update rule or the pure Q-learning-based strategy update rule, then cooperators among the whole population exhibit the hump-shaped conditional behavior. Our results provide a new insight to understand the evolution of cooperation from AI's view.

Suggested Citation

Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).

Handle: RePEc:eee:apmaco:v:427:y:2022:i:c:s0096300322002569
DOI: 10.1016/j.amc.2022.127182

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Li, Zhibin & Jia, Danyang & Guo, Hao & Geng, Yini & Shen, Chen & Wang, Zhen & Li, Xuelong, 2019. "The effect of multigame on cooperation in spatial network," Applied Mathematics and Computation, Elsevier, vol. 351(C), pages 162-167.
Ananish Chaudhuri & Tirnud Paichayontvijit, 2006. "Conditional cooperation and voluntary contributions to a public good," Economics Bulletin, AccessEcon, vol. 3(8), pages 1-14.
Hu, Mao-Bin & Jiang, Rui & Wu, Qing-Song & Wu, Yong-Hong, 2007. "Simulating the wealth distribution with a Richest-Following strategy on scale-free network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 381(C), pages 467-472.
Fischbacher, Urs & Gachter, Simon & Fehr, Ernst, 2001. "Are people conditionally cooperative? Evidence from a public goods experiment," Economics Letters, Elsevier, vol. 71(3), pages 397-404, June.
- Urs Fischbacher & Simon Gaechter & Ernst Fehr, "undated". "Are People Conditionally Cooperative? Evidence from a Public Goods Experiment," IEW - Working Papers 016, Institute for Empirical Research in Economics - University of Zurich.
Claudia Keser & Frans Van Winden, 2000. "Conditional Cooperation and Voluntary Contributions to Public Goods," Scandinavian Journal of Economics, Wiley Blackwell, vol. 102(1), pages 23-39, March.
- Claudia Keser & Frans A.A.M. van Winden, 2000. "Conditional Cooperation and Voluntary Contributions to Public Goods," Tinbergen Institute Discussion Papers 00-011/1, Tinbergen Institute.
Makowsky, Michael D. & Orman, Wafa Hakim & Peart, Sandra J., 2014. "Playing with other people's money: Contributions to public goods by trustees," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 53(C), pages 44-55.
Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
V. S. Borkar, 2002. "Reinforcement Learning In Markovian Evolutionary Games," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 5(01), pages 55-72.
Muller, Laurent & Sefton, Martin & Steinberg, Richard & Vesterlund, Lise, 2008. "Strategic behavior and learning in repeated voluntary contribution experiments," Journal of Economic Behavior & Organization, Elsevier, vol. 67(3-4), pages 782-793, September.
- Laurent Muller & Martin Sefton & Richard Steinberg & Lise Vesterlund, 2005. "Strategic Behavior and Learning in Repeated Voluntary-Contribution Experiments," Discussion Papers 2005-13, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Laurent Muller & Martin Sefton & Richard Steinberg & Lise Vesterlund, 2008. "Strategic Behavior and Learning in Repeated Voluntary-Contribution Experiments," Post-Print hal-00614682, HAL.
repec:ebl:ecbull:v:3:y:2006:i:8:p:1-14 is not listed on IDEAS
Dui, Hongyan & Meng, Xueyu & Xiao, Hui & Guo, Jianjun, 2020. "Analysis of the cascading failure for scale-free networks based on a multi-strategy evolutionary game," Reliability Engineering and System Safety, Elsevier, vol. 199(C).
Bruno S. Frey & Stephan Meier, 2004. "Social Comparisons and Pro-social Behavior: Testing "Conditional Cooperation" in a Field Experiment," American Economic Review, American Economic Association, vol. 94(5), pages 1717-1722, December.
- Bruno S. Frey & Stephan Meier, "undated". "Social Comparisons and Pro-social Behavior - Testing "Conditional Cooperation" in a Field Experiment," IEW - Working Papers 162, Institute for Empirical Research in Economics - University of Zurich.
K. M. Ariful Kabir & Jun Tanimoto & Zhen Wang, 2018. "Influence of bolstering network reciprocity in the evolutionary spatial Prisoner’s Dilemma game: a perspective," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 91(12), pages 1-10, December.
Saral, Ali Seyhun, 2020. "Evolution of Conditional Cooperation in Prisoner's Dilemma," OSF Preprints wcpkz, Center for Open Science.
Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
Jin Zhang & Dietmar Maringer, 2016. "Using a Genetic Algorithm to Improve Recurrent Reinforcement Learning for Equity Trading," Computational Economics, Springer;Society for Computational Economics, vol. 47(4), pages 551-567, April.
Guo, Tian & Guo, Mi & Zhang, Yan & Liang, Shuanglu, 2019. "The effect of aspiration on the evolution of cooperation in spatial multigame," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 525(C), pages 27-32.
Karen Evelyn Hauge & Kjell Arne Brekke & Karine Nyborg & Jo Thori Lind, 2019. "Sustaining cooperation through self-sorting: The good, the bad, and the conditional," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 116(12), pages 5299-5304, March.
Hongyan Cheng & Qionglin Dai & Haihong Li & Xiaolan Qian & Mei Zhang & Junzhong Yang, 2013. "Effects of directional migration on prisoner’s dilemma game in a square domain," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 86(4), pages 1-6, April.
Bruno Frey & Stephan Meier, 2004. "In a field experiment," Natural Field Experiments 00243, The Field Experiments Website.
Tanimoto, Jun, 2015. "The impact of initial cooperation fraction on the evolutionary fate in a spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 263(C), pages 171-188.
Nagashima, Keisuke & Tanimoto, Jun, 2019. "A stochastic Pairwise Fermi rule modified by utilizing the average in payoff differences of neighbors leads to increased network reciprocity in spatial prisoner's dilemma games," Applied Mathematics and Computation, Elsevier, vol. 361(C), pages 661-669.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jiang, Bei & Yuan, Lin & Zou, Rongcheng & Su, Rui & Mi, Yuqiang, 2023. "The effect of migration on vaccination dilemma in networked populations," Chaos, Solitons & Fractals, Elsevier, vol. 170(C).
Di, Changyan & Zhou, Qingguo & Shen, Jun & Wang, Jinqiang & Zhou, Rui & Wang, Tianyi, 2023. "The coupling effect between the environment and strategies drives the emergence of group cooperation," Chaos, Solitons & Fractals, Elsevier, vol. 176(C).
Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Simon Gaechter, 2006. "Conditional cooperation: Behavioral regularities from the lab and the field and their policy implications," Discussion Papers 2006-03, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter, 2006. "Conditional cooperation: Behavioral regularities from the lab and the field and their policy implications," Discussion Papers 2006-03, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
Urs Fischbacher & Simon Gachter, 2010. "Social Preferences, Beliefs, and the Dynamics of Free Riding in Public Goods Experiments," American Economic Review, American Economic Association, vol. 100(1), pages 541-556, March.
- Urs Fischbacher & Simon Gaechter, 2008. "Social Preferences, Beliefs, and the Dynamics of Free Riding in Public Good Experiments," CESifo Working Paper Series 2491, CESifo.
- Urs Fischbacher & Simon Gaechter, 2009. "Social Preferences, Beliefs, and the Dynamics of Free Riding in Public Good Experiments," Discussion Papers 2009-04, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
Urs Fischbacher & Simon Gï¿½chter, 2005. "Heterogeneous social preferences and the dynamics of free riding in public goods," IEW - Working Papers 261, Institute for Empirical Research in Economics - University of Zurich.
- Urs Fischbacher & Simon Gaechter, 2006. "Heterogeneous social preferences and the dynamics of free riding in public goods," Discussion Papers 2006-01, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Fischbacher, Urs & Gächter, Simon, 2006. "Heterogeneous Social Preferences and the Dynamics of Free Riding in Public Goods," IZA Discussion Papers 2011, Institute of Labor Economics (IZA).
M. Vittoria Levati & Ro’i Zultan, 2011. "Cycles of Conditional Cooperation in a Real-Time Voluntary Contribution Mechanism," Games, MDPI, vol. 2(1), pages 1-15, January.
- M. Vittoria Levati & Ro'i Zultan, 2009. "Cycles of conditional cooperation in a real-time voluntary contribution mechanism," Jena Economics Research Papers 2009-029, Friedrich-Schiller-University Jena.
- M. Vittoria Levati & Ro'i Zultan, 2009. "Cycles of Conditional Cooperation in a Real-Time Voluntary Contribution Mechanism," Discussion Paper Series dp511, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
Simon Gaechter & Elke Renner, 2014. "Leaders as Role Models for the Voluntary Provision of Public Goods," Discussion Papers 2014-11, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Elke Renner, 2014. "Leaders as Role Models for the Voluntary Provision of Public Goods," CESifo Working Paper Series 5049, CESifo.
- Gächter, Simon & Renner, Elke, 2014. "Leaders as Role Models for the Voluntary Provision of Public Goods," IZA Discussion Papers 8580, Institute of Labor Economics (IZA).
Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-07, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus Simultaneous Contributions to Public Goods: Experimental Evidence," CESifo Working Paper Series 2602, CESifo.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-17, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
Yin, Xile & Li, Jianbiao & Bao, Te, 2019. "Does overconfidence promote cooperation? Theory and experimental evidence," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 79(C), pages 119-133.
Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-07, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-07, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-17, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus simultaneous contributions to public goods: Experimental evidence," Discussion Papers 2009-17, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Simon Gaechter & Daniele Nosenzo & Elke Renner & Martin Sefton, 2009. "Sequential versus Simultaneous Contributions to Public Goods: Experimental Evidence," CESifo Working Paper Series 2602, CESifo.
Gächter, Simon & Renner, Elke, 2018. "Leaders as role models and ‘belief managers’ in social dilemmas," Journal of Economic Behavior & Organization, Elsevier, vol. 154(C), pages 321-334.
Josephine G. Gatua, 2021. "Information and cooperation in preventive health behavior: The case of bed net use in rural Kenya," Health Economics, John Wiley & Sons, Ltd., vol. 30(9), pages 2124-2143, September.
Ananish Chaudhuri, 2011. "Sustaining cooperation in laboratory public goods experiments: a selective survey of the literature," Experimental Economics, Springer;Economic Science Association, vol. 14(1), pages 47-83, March.
Drouvelis, Michalis & Marx, Benjamin M., 2022. "Can charitable appeals identify and exploit belief heterogeneity?," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 631-649.
- Michalis Drouvelis & Benjamin M. Marx, 2021. "Can Charitable Appeals Identify and Exploit Belief Heterogeneity?," CESifo Working Paper Series 8855, CESifo.
Dong, Bin & Dulleck, Uwe & Torgler, Benno, 2012. "Conditional corruption," Journal of Economic Psychology, Elsevier, vol. 33(3), pages 609-627.
- Bin Dong & Uwe Dulleck & Benno Torgler, 2008. "Conditional Corruption," CREMA Working Paper Series 2008-29, Center for Research in Economics, Management and the Arts (CREMA).
Thorsten Chmura & Christoph Engel & Markus Englerth, 2013. "Selfishness As a Potential Cause of Crime. A Prison Experiment," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2013_05, Max Planck Institute for Research on Collective Goods.
Vyrastekova, Jana & Funaki, Yukihiko, 2018. "Cooperation in a sequential dilemma game: How much transparency is good for cooperation?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 77(C), pages 88-95.
Richter, Andries & Grasman, Johan, 2013. "The transmission of sustainable harvesting norms when agents are conditionally cooperative," Ecological Economics, Elsevier, vol. 93(C), pages 202-209.
- Andries Richter & Johan Grasman, 2013. "The Transmission of Sustainable Harvesting Norms When Agents Are Conditionally Cooperative," Working Papers 2013.80, Fondazione Eni Enrico Mattei.
- Richter, Andries & Grasman, Johan, 2013. "The Transmission of Sustainable Harvesting Norms When Agents Are Conditionally Cooperative," Economy and Society 158733, Fondazione Eni Enrico Mattei (FEEM).
Raúl López-Pérez & Aldo Ramírez-Almudio, 2020. "Why people give to their governments: The role of outcome-oriented norms," Working Papers 2007, Instituto de Políticas y Bienes Públicos (IPP), CSIC.
Antoni Bosch-Domènech & Joaquim Silvestre, 2017. "The role of frames, numbers and risk in the frequency of cooperation," International Review of Economics, Springer;Happiness Economics and Interpersonal Relations (HEIRS), vol. 64(3), pages 245-267, September.
Bernd Irlenbusch & Janna Ter Meer, 2015. "Lying in public good games with and without punishment," Cologne Graduate School Working Paper Series 06-02, Cologne Graduate School in Management, Economics and Social Sciences.
Raphaële Préget & Phu Nguyen-Van & Marc Willinger, 2016. "Who are the voluntary leaders? Experimental evidence from a sequential contribution game," Theory and Decision, Springer, vol. 81(4), pages 581-599, November.
- Raphaële Préget & Phu Nguyen-Van & Marc Willinger, 2012. "Who are the Voluntary Leaders? Experimental Evidence from a Sequential Contribution Game," Working Papers of BETA 2012-21, Bureau d'Economie Théorique et Appliquée, UDS, Strasbourg.
- Raphaële Préget & Phu Nguyen-Van & Marc Willinger, 2012. "Who are the Voluntary Leaders? Experimental Evidence from a Sequential Contribution Game," Working Papers 12-34, LAMETA, Universtiy of Montpellier, revised Nov 2012.
- Raphaële Préget & Phu Nguyen Van & Marc Willinger, 2016. "Who are the voluntary leaders? Experimental evidence from a sequential contribution game," Post-Print hal-01300195, HAL.

More about this item

Keywords

Evolutionary games; Q-learning; Conditional cooperation;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:427:y:2022:i:c:s0096300322002569. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Reinforcement learning explains various conditional cooperation

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data