IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0102840.html
   My bibliography  Save this article

Evolution with Reinforcement Learning in Negotiation

Author

Listed:
  • Yi Zou
  • Wenjie Zhan
  • Yuan Shao

Abstract

Adaptive behavior depends less on the details of the negotiation process and makes more robust predictions in the long term as compared to in the short term. However, the extant literature on population dynamics for behavior adjustment has only examined the current situation. To offset this limitation, we propose a synergy of evolutionary algorithm and reinforcement learning to investigate long-term collective performance and strategy evolution. The model adopts reinforcement learning with a tradeoff between historical and current information to make decisions when the strategies of agents evolve through repeated interactions. The results demonstrate that the strategies in populations converge to stable states, and the agents gradually form steady negotiation habits. Agents that adopt reinforcement learning perform better in payoff, fairness, and stableness than their counterparts using classic evolutionary algorithm.

Suggested Citation

  • Yi Zou & Wenjie Zhan & Yuan Shao, 2014. "Evolution with Reinforcement Learning in Negotiation," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
  • Handle: RePEc:plo:pone00:0102840
    DOI: 10.1371/journal.pone.0102840
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0102840
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0102840&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0102840?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Iain D. Couzin & Jens Krause & Nigel R. Franks & Simon A. Levin, 2005. "Effective leadership and decision-making in animal groups on the move," Nature, Nature, vol. 433(7025), pages 513-516, February.
    2. Jing Han & Lin Wang, 2013. "Nondestructive Intervention to Multi-Agent Systems through an Intelligent Agent," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-11, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Levin & Anastasios Xepapadeas, 2021. "On the Coevolution of Economic and Ecological Systems," Annual Review of Resource Economics, Annual Reviews, vol. 13(1), pages 355-377, October.
    2. Becco, Ch. & Vandewalle, N. & Delcourt, J. & Poncin, P., 2006. "Experimental evidences of a structural and dynamical transition in fish school," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 367(C), pages 487-493.
    3. Long-Hai Wang & Alexander Ulrich Ernst & Duo An & Ashim Kumar Datta & Boris Epel & Mrignayani Kotecha & Minglin Ma, 2021. "A bioinspired scaffold for rapid oxygenation of cell encapsulation systems," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    4. Richard P Mann, 2011. "Bayesian Inference for Identifying Interaction Rules in Moving Animal Groups," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-10, August.
    5. Ma, Jian & Song, Wei-guo & Zhang, Jun & Lo, Siu-ming & Liao, Guang-xuan, 2010. "k-Nearest-Neighbor interaction induced self-organized pedestrian counter flow," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(10), pages 2101-2117.
    6. Andrew Hoegh & Frank T. Manen & Mark Haroldson, 2021. "Agent-Based Models for Collective Animal Movement: Proximity-Induced State Switching," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(4), pages 560-579, December.
    7. Tamás Nepusz & Tamás Vicsek, 2013. "Hierarchical Self-Organization of Non-Cooperating Individuals," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-9, December.
    8. Amos Korman & Efrat Greenwald & Ofer Feinerman, 2014. "Confidence Sharing: An Economic Strategy for Efficient Information Flows in Animal Groups," PLOS Computational Biology, Public Library of Science, vol. 10(10), pages 1-10, October.
    9. Roy Harpaz & Minh Nguyet Nguyen & Armin Bahl & Florian Engert, 2021. "Precise visuomotor transformations underlying collective behavior in larval zebrafish," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    10. Li, Qing & Zhang, Lingwei & Jia, Yongnan & Lu, Tianzhao & Chen, Xiaojie, 2022. "Modeling, analysis, and optimization of three-dimensional restricted visual field metric-free swarms," Chaos, Solitons & Fractals, Elsevier, vol. 157(C).
    11. Mathew Titus & George Hagstrom & James R Watson, 2021. "Unsupervised manifold learning of collective behavior," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-20, February.
    12. Sophie Lardy & Daniel Fortin & Olivier Pays, 2016. "Increased Exploration Capacity Promotes Group Fission in Gregarious Foraging Herbivores," PLOS ONE, Public Library of Science, vol. 11(12), pages 1-14, December.
    13. Fan, Kangqi & Pedrycz, Witold, 2016. "Opinion evolution influenced by informed agents," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 462(C), pages 431-441.
    14. De Rosis, Alessandro, 2014. "Hydrodynamic effects on a predator approaching a group of preys," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 329-339.
    15. Shao, Zhi-Gang & Yang, Yan-Yan, 2015. "Effective strategies of collective evacuation from an enclosed space," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 427(C), pages 34-39.
    16. Panpan Yang & Maode Yan & Jiacheng Song & Ye Tang, 2019. "Self-Organized Fission-Fusion Control Algorithm for Flocking Systems Based on Intermittent Selective Interaction," Complexity, Hindawi, vol. 2019, pages 1-12, February.
    17. Huepe, Cristián & Aldana, Maximino, 2008. "New tools for characterizing swarming systems: A comparison of minimal models," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(12), pages 2809-2822.
    18. Federico Pratissoli & Andreagiovanni Reina & Yuri Kaszubowski Lopes & Carlo Pinciroli & Genki Miyauchi & Lorenzo Sabattini & Roderich Groß, 2023. "Coherent movement of error-prone individuals through mechanical coupling," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    19. Milad Haghani & Majid Sarvi & Zahra Shahhoseini & Maik Boltes, 2016. "How Simple Hypothetical-Choice Experiments Can Be Utilized to Learn Humans’ Navigational Escape Decisions in Emergencies," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-24, November.
    20. Kim, Jong-Ho & Park, Jea-Hyun, 2022. "Clustering phenomenon of the singular Cucker–Smale model with finite communication weight and variable coupling strength," Chaos, Solitons & Fractals, Elsevier, vol. 164(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0102840. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.