IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v200y2025ip2s096007792500997x.html

Q-learning promotes the evolution of fairness and generosity in the ultimatum game

Author

Listed:
  • Wu, Binjie
  • Shen, Shaofei
  • Wang, Jiafeng
  • Wan, Haibin

Abstract

The traditional Q-learning algorithm has been widely applied to the study of cooperation in social dilemmas, however, few studies have utilized it in the context of the Ultimatum Game. To address this gap, this paper investigates the evolutionary Ultimatum Game by proposing a strategy-adjustment-based Q-learning algorithm. Through Monte Carlo simulations, we quantitatively confirm the significant influence of sensitivity factors (denoted as βp and βq) on fairness and generosity. Notably, compared to the conventional situation, the introduction of sensitivity factors, especially when βp≫βq, leads to a marked increase in levels of fairness and generosity. Additionally, when βp≪βq, the population gravitates toward empathy-driven strategies, further enhancing fairness. Conversely, we find that when βp and βq are approximately equal, fairness is undermined. These evolutionary dynamics provide deeper insights into the mechanisms underlying fairness and generosity in human behavior.

Suggested Citation

  • Wu, Binjie & Shen, Shaofei & Wang, Jiafeng & Wan, Haibin, 2025. "Q-learning promotes the evolution of fairness and generosity in the ultimatum game," Chaos, Solitons & Fractals, Elsevier, vol. 200(P2).
  • Handle: RePEc:eee:chsofr:v:200:y:2025:i:p2:s096007792500997x
    DOI: 10.1016/j.chaos.2025.116984
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S096007792500997X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2025.116984?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Güth, Werner & Kocher, Martin G., 2014. "More than thirty years of ultimatum bargaining experiments: Motives, variations, and a survey of the recent literature," Journal of Economic Behavior & Organization, Elsevier, vol. 108(C), pages 396-409.
    2. Zhao, Yakun & Xiong, Tianyu & Zheng, Lei & Li, Yumeng & Chen, Xiaojie, 2020. "The effect of similarity on the evolution of fairness in the ultimatum game," Chaos, Solitons & Fractals, Elsevier, vol. 131(C).
    3. Ernst Fehr & Urs Fischbacher, 2003. "The nature of human altruism," Nature, Nature, vol. 425(6960), pages 785-791, October.
    4. Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
    5. Huck, Steffen & Oechssler, Jorg, 1999. "The Indirect Evolutionary Approach to Explaining Fair Allocations," Games and Economic Behavior, Elsevier, vol. 28(1), pages 13-24, July.
    6. Deng, Lili & Wang, Rugen & Liao, Ying & Xu, Ronghua & Wang, Cheng, 2025. "The reputation-based reward mechanism promotes the evolution of fairness," Applied Mathematics and Computation, Elsevier, vol. 486(C).
    7. Guth, Werner & Schmittberger, Rolf & Schwarze, Bernd, 1982. "An experimental analysis of ultimatum bargaining," Journal of Economic Behavior & Organization, Elsevier, vol. 3(4), pages 367-388, December.
    8. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    9. Shen, Shaofei & Zhang, Xuejun & Xu, Aobo & Duan, Taisen, 2024. "An adaptive exploration mechanism for Q-learning in spatial public goods games," Chaos, Solitons & Fractals, Elsevier, vol. 189(P1).
    10. Vriend, Nicolaas J., 1996. "Rational behavior and economic theory," Journal of Economic Behavior & Organization, Elsevier, vol. 29(2), pages 263-285, March.
    11. Perc, Matjaž & Grigolini, Paolo, 2013. "Collective behavior and evolutionary games – An introduction," Chaos, Solitons & Fractals, Elsevier, vol. 56(C), pages 1-5.
    12. Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).
    13. Yanling Zhang & Jian Liu & Aming Li, 2019. "Effects of Empathy on the Evolutionary Dynamics of Fairness in Group-Structured Systems," Complexity, Hindawi, vol. 2019, pages 1-13, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Deng, Lili & Wang, Rugen & Liao, Ying & Xu, Ronghua & Wang, Cheng, 2025. "The reputation-based reward mechanism promotes the evolution of fairness," Applied Mathematics and Computation, Elsevier, vol. 486(C).
    2. Deng, Lili & Wang, Hongsi & Wang, Rugen & Xu, Ronghua & Wang, Cheng, 2024. "The adaptive adjustment of node weights based on reputation and memory promotes fairness," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).
    3. Zhang, Yanling & Yang, Shuo & Chen, Xiaojie & Bai, Yanbing & Xie, Guangming, 2023. "Reputation update of responders efficiently promotes the evolution of fairness in the ultimatum game," Chaos, Solitons & Fractals, Elsevier, vol. 169(C).
    4. Xiaofeng Wang & Xiaojie Chen & Long Wang, 2020. "Evolution of egalitarian social norm by resource management," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-16, January.
    5. Pau Juan-Bartroli & Jos'e Ignacio Rivero-Wildemauwe, 2025. "Social preferences or moral concerns: What drives rejections in the Ultimatum game?," Papers 2510.22086, arXiv.org.
    6. Deng, Lili & Li, Weiwei & Wang, Rugen & Wang, Cheng, 2025. "The impact of reputation-based dynamic reward mechanism on the evolution of fairness," Chaos, Solitons & Fractals, Elsevier, vol. 199(P3).
    7. Konrad, Kai A. & Morath, Florian, 2016. "Bargaining with incomplete information: Evolutionary stability in finite populations," Journal of Mathematical Economics, Elsevier, vol. 65(C), pages 118-131.
    8. Gagen, Michael, 2013. "Isomorphic Strategy Spaces in Game Theory," MPRA Paper 46176, University Library of Munich, Germany.
    9. Ou Li & Yan Shi & Kuangran Li, 2025. "Red, rather than blue can promote fairness in decision-making," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 12(1), pages 1-10, December.
    10. Mohamed I. Gomaa & Stuart Mestelman & Mohamed Shehata, 2014. "Social Distance, Reputation, Risk Attitude, Value Orientation and Equity in Economic Exchanges," Department of Economics Working Papers 2014-07, McMaster University.
    11. Xie, Kai & Szolnoki, Attila, 2026. "Reinforcement learning in evolutionary game theory: A brief review of recent developments," Applied Mathematics and Computation, Elsevier, vol. 510(C).
    12. Capraro, Valerio & Rodriguez-Lara, Ismael, 2025. "Moral preferences in ultimatum and impunity games," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 117(C).
    13. Sylvie Thoron, 2016. "Morality Beyond Social Preferences: Smithian Sympathy, Social Neuroscience and the Nature of Social Consciousness [La moralité au delà des préférences sociales. La sympathie Smithienne, les neurosciences sociales et la nature d’une conscience soci," Post-Print hal-01645043, HAL.
    14. Lambsdorff, Johann Graf & Grubiak, Kevin & Werner, Katharina, 2023. "Intrinsic Motivation vs. Corruption? Experimental Evidence on the Performance of Officials," MPRA Paper 118153, University Library of Munich, Germany.
    15. Christian Korth, 2009. "Reciprocity—An Indirect Evolutionary Analysis," Lecture Notes in Economics and Mathematical Systems, in: Fairness in Bargaining and Markets, chapter 0, pages 35-55, Springer.
    16. Noemí Navarro & Róbert F. Veszteg, 2025. "How robust is the equal split? Transferable utility and three-person bargaining in the laboratory," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 23(3), pages 909-931, September.
    17. Fabian Dvorak & Regina Stumpf & Sebastian Fehrler & Urs Fischbacher, 2024. "Generative AI Triggers Welfare-Reducing Decisions in Humans," Papers 2401.12773, arXiv.org.
    18. Wang, Lu & Ye, Shun-qiang & Jones, Michael C. & Ye, Ye & Wang, Meng & Xie, Neng-gang, 2015. "The evolutionary analysis of the ultimatum game based on the net-profit decision," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 430(C), pages 32-38.
    19. Cárdenas Juan Camilo & Casas-Casas Andrés & Méndez Nathalie Méndez, 2014. "The Hidden Face of Justice: Fairness, Discrimination and Distribution in Transitional Justice Processes," Peace Economics, Peace Science, and Public Policy, De Gruyter, vol. 20(1), pages 33-60, January.
    20. M. Christian Lehmann, 2022. "Fairness preferences as a cause of inefficient war," Journal of Behavioral Economics for Policy, Society for the Advancement of Behavioral Economics (SABE), vol. 6(1), pages 33-36, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:200:y:2025:i:p2:s096007792500997x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.