How social reinforcement learning can lead to metastable polarisation and the voter model

My bibliography Save this article

How social reinforcement learning can lead to metastable polarisation and the voter model

Author

Listed:

Benedikt V Meylahn
Janusz M Meylahn

Registered:

Abstract

Previous explanations for the persistence of polarization of opinions have typically included modelling assumptions that predispose the possibility of polarization (i.e., assumptions allowing a pair of agents to drift apart in their opinion such as repulsive interactions or bounded confidence). An exception is a recent simulation study showing that polarization is persistent when agents form their opinions using social reinforcement learning. Our goal is to highlight the usefulness of reinforcement learning in the context of modeling opinion dynamics, but that caution is required when selecting the tools used to study such a model. We show that the polarization observed in the model of the simulation study cannot persist indefinitely, and exhibits consensus asymptotically with probability one. By constructing a link between the reinforcement learning model and the voter model, we argue that the observed polarization is metastable. Finally, we show that a slight modification in the learning process of the agents changes the model from being non-ergodic to being ergodic. Our results show that reinforcement learning may be a powerful method for modelling polarization in opinion dynamics, but that the tools (objects to study such as the stationary distribution, or time to absorption for example) appropriate for analysing such models crucially depend on their properties (such as ergodicity, or transience). These properties are determined by the details of the learning process and may be difficult to identify based solely on simulations.

Suggested Citation

Benedikt V Meylahn & Janusz M Meylahn, 2024. "How social reinforcement learning can lead to metastable polarisation and the voter model," PLOS ONE, Public Library of Science, vol. 19(12), pages 1-23, December.

Handle: RePEc:plo:pone00:0313951
DOI: 10.1371/journal.pone.0313951

Download full text from publisher

References listed on IDEAS

Tinggui Chen & Qianqian Li & Peihua Fu & Jianjun Yang & Chonghuan Xu & Guodong Cong & Gongfa Li, 2020. "Public Opinion Polarization by Individual Revenue from the Social Preference Theory," IJERPH, MDPI, vol. 17(3), pages 1-29, February.
Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
Sven Banischa & Ricardo Lima & Tanya Araújo, 2012. "Agent based models and opinion dynamics as markov chains," Working Papers Department of Economics 2012/10, ISEG - Lisbon School of Economics and Management, Department of Economics, Universidade de Lisboa.
Adri'an Carro & Ra'ul Toral & Maxi San Miguel, 2016. "The noisy voter model on complex networks," Papers 1602.06935, arXiv.org, revised Apr 2016.
Guillaume Deffuant & David Neau & Frederic Amblard & Gérard Weisbuch, 2000. "Mixing beliefs among interacting agents," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 3(01n04), pages 87-98.
J. M. Meylahn & L. Janssen & Hassan Zargarzadeh, 2022. "Limiting Dynamics for Q-Learning with Memory One in Symmetric Two-Player, Two-Action Games," Complexity, Hindawi, vol. 2022, pages 1-20, November.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Agnieszka Kowalska-Styczeń & Krzysztof Malarz, 2020. "Noise induced unanimity and disorder in opinion formation," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-22, July.
Lücke, Marvin & Heitzig, Jobst & Koltai, Péter & Molkenthin, Nora & Winkelmann, Stefanie, 2023. "Large population limits of Markov processes on random networks," Stochastic Processes and their Applications, Elsevier, vol. 166(C).
Wu, Dong & Zou, Fan, 2025. "Dominant design selected by users: Dynamic interaction and convergence of users," Technovation, Elsevier, vol. 140(C).
Lee, Woosub & Yang, Seong-Gyu & Kim, Beom Jun, 2022. "The effect of media on opinion formation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 595(C).
Ponti, Giovanni, 2000. "Cycles of Learning in the Centipede Game," Games and Economic Behavior, Elsevier, vol. 30(1), pages 115-141, January.
- Giovanni Ponti, "undated". "Cycles Of Learning In The Centipede Game," ELSE working papers 024, ESRC Centre on Economics Learning and Social Evolution.
- Giovanni Ponti, 1996. "Cycles of Learning in the Centipede Game," Discussion Papers 96-22 ISSN 1350-6722, University College London, Department of Economics.
Philippe Jehiel & Aviman Satpathy, 2024. "Learning to be Indifferent in Complex Decisions: A Coarse Payoff-Assessment Model," Papers 2412.09321, arXiv.org, revised Dec 2024.
Antonio Cabrales & Rosemarie Nagel & Roc Armenter, 2007. "Equilibrium selection through incomplete information in coordination games: an experimental study," Experimental Economics, Springer;Economic Science Association, vol. 10(3), pages 221-234, September.
- Rosemarie Nagel & Antonio Cabrales & Roc Armenter, 2002. "Equilibrium selection through incomplete information in coordination games: An experimental study," Economics Working Papers 601, Department of Economics and Business, Universitat Pompeu Fabra.
Sven Banisch & Eckehard Olbrich, 2021. "An Argument Communication Model of Polarization and Ideological Alignment," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 24(1), pages 1-1.
Dehai Liu & Hongyi Li & Weiguo Wang & Chuang Zhou, 2015. "Scenario forecast model of long term trends in rural labor transfer based on evolutionary games," Journal of Evolutionary Economics, Springer, vol. 25(3), pages 649-670, July.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
Meylahn, Benedikt V. & De Turck, Koen & Mandjes, Michel, 2025. "Trust in society: A stochastic compartmental model," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 668(C).
Shang, Lihui & Zhao, Mingming & Ai, Jun & Su, Zhan, 2021. "Opinion evolution in the Sznajd model on interdependent chains," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 565(C).
Hopkins, Ed, 1999. "Learning, Matching, and Aggregation," Games and Economic Behavior, Elsevier, vol. 26(1), pages 79-110, January.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Discussion Papers 1996-2, Edinburgh School of Economics, University of Edinburgh.
- Hopkins, E., 1995. "Learning, Matching and Aggregation," G.R.E.Q.A.M. 95a20, Universite Aix-Marseille III.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Edinburgh School of Economics Discussion Paper Series 2, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," ELSE working papers 033, ESRC Centre on Economics Learning and Social Evolution.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Game Theory and Information 9512001, University Library of Munich, Germany.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Department of Economics 1996 : II, Edinburgh School of Economics, University of Edinburgh.
Tsakas, Elias & Voorneveld, Mark, 2009. "The target projection dynamic," Games and Economic Behavior, Elsevier, vol. 67(2), pages 708-719, November.
- Tsakas, Elias & Voorneveld, Mark, 2007. "The target projection dynamic," SSE/EFI Working Paper Series in Economics and Finance 670, Stockholm School of Economics, revised 13 Aug 2007.
Floriana Gargiulo & José J Ramasco, 2012. "Influence of Opinion Dynamics on the Evolution of Games," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-7, November.
Lu, Xi & Mo, Hongming & Deng, Yong, 2015. "An evidential opinion dynamics model based on heterogeneous social influential power," Chaos, Solitons & Fractals, Elsevier, vol. 73(C), pages 98-107.
Antonio Morales, 2005. "On the Role of the Group Composition for Achieving Optimality," Annals of Operations Research, Springer, vol. 137(1), pages 387-397, July.
DeJong, D.V. & Blume, A. & Neumann, G., 1998. "Learning in Sender-Receiver Games," Other publications TiSEM 4a8b4f46-f30b-4ad2-bb0c-1, Tilburg University, School of Economics and Management.
- Blume, A. & DeJong, D.V. & Neumann, G.R. & Savin, N.E., 1998. "Learning in Sender-Receiver Games," Working Papers 98-02, University of Iowa, Department of Economics.
- DeJong, D.V. & Blume, A. & Neumann, G., 1998. "Learning in Sender-Receiver Games," Discussion Paper 1998-28, Tilburg University, Center for Economic Research.
- Andreas Blume & Douglas V. DeJong & George R. Neumann & Nathan E. Savin, 1998. "Learning in Sender-Receiver Games," CIG Working Papers FS IV 98-13, Wissenschaftszentrum Berlin (WZB), Research Unit: Competition and Innovation (CIG).
Gaunersdorfer, A. & Hommes, C.H. & Wagener, F.O.O., 2000. "Bifurcation Routes to Volatility Clustering," CeNDEF Working Papers 00-04, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
- Andrea Gaunersdorfer & Cars Hommes & Florian O.O. Wagener, 2001. "Bifurcation Routes to Volatility Clustering," Tinbergen Institute Discussion Papers 01-015/1, Tinbergen Institute.
Norman, Thomas W.L., 2009. "Rapid evolution under inertia," Games and Economic Behavior, Elsevier, vol. 66(2), pages 865-879, July.
- Thomas Norman, 2007. "Rapid Evolution under Inertia," Economics Series Working Papers 299, University of Oxford, Department of Economics.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0313951. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

How social reinforcement learning can lead to metastable polarisation and the voter model

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data