On the convergence of reinforcement learning

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

repec:plo:pone00:0208095 is not listed on IDEAS
Beggs, Alan, 2022. "Reference points and learning," Journal of Mathematical Economics, Elsevier, vol. 100(C).
- Alan Beggs, 2015. "Reference Points and Learning," Economics Series Working Papers 767, University of Oxford, Department of Economics.
Maxwell Pak & Bing Xu, 2016. "Generalized reinforcement learning in perfect-information games," International Journal of Game Theory, Springer;Game Theory Society, vol. 45(4), pages 985-1011, November.
Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2015. "From imitation to collusion: Long-run learning in a low-information environment," Journal of Economic Theory, Elsevier, vol. 155(C), pages 185-205.
- Daniel Friedman & Steffen Huck & Ryan Oprea & Simon Weidenholzer, 2012. "From Imitation to Collusion: Long-run Learning in a Low-Information Environment," Levine's Working Paper Archive 786969000000000457, David K. Levine.
- Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2012. "From imitation to collusion: Long-run learning in a low-information environment," Discussion Papers, Research Unit: Economics of Change SP II 2012-301r, WZB Berlin Social Science Center.
- Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2012. "From imitation to collusion: Long-run learning in a low-information environment," Discussion Papers, Research Unit: Economics of Change SP II 2012-301, WZB Berlin Social Science Center.
- Friedman, D & Huck, S & Oprea, R & Weidenholzer, S, 2012. "From Imitation to Collusion: Long-run Learning in a Low-Information Environment," Economics Discussion Papers 8954, University of Essex, Department of Economics.
Köke, Sonja & Lange, Andreas & Nicklisch, Andreas, 2015. "Adversity is a school of wisdomː Experimental evidence on cooperative protection against stochastic losses," WiSo-HH Working Paper Series 22, University of Hamburg, Faculty of Business, Economics and Social Sciences, WISO Research Laboratory.
Mertikopoulos, Panayotis & Sandholm, William H., 2024. "Nested replicator dynamics, nested logit choice, and similarity-based learning," Journal of Economic Theory, Elsevier, vol. 220(C).
Sonja Köke & Andreas Lange & Andreas Nicklisch, 2026. "Cooperative protection against stochastic losses: Experimental evidence on behavioral dynamics," Journal of Evolutionary Economics, Springer, vol. 36(2), pages 1-29, August.
Josephson, Jens, 2008. "A numerical analysis of the evolutionary stability of learning rules," Journal of Economic Dynamics and Control, Elsevier, vol. 32(5), pages 1569-1599, May.
- Josephson, Jens, 2001. "A Numerical Analysis of the Evolutionary Stability of Learning Rules," SSE/EFI Working Paper Series in Economics and Finance 474, Stockholm School of Economics.
Ding, Jieyao & Nicklisch, Andreas, 2013. "On the impulse in impulse learning," Economics Letters, Elsevier, vol. 121(2), pages 294-297.
Nicklisch, Andreas & Köke, Sonja & Lange, Andreas, 2016. "Is Adversity a School of Wisdom? Experimental Evidence on Cooperative Protection Against Stochastic Losses," VfS Annual Conference 2016 (Augsburg): Demographic Change 145716, Verein für Socialpolitik / German Economic Association.
Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2012. "Learning in experimental 2×2 games," Games and Economic Behavior, Elsevier, vol. 76(1), pages 44-73.
- Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2008. "Learning in experimental 2×2 games," Bonn Econ Discussion Papers 18/2008, University of Bonn, Bonn Graduate School of Economics (BGSE).
- Thorsten Chmura & Sebastian Goerg & Reinhard Selten, 2011. "Learning in experimental 2 x 2 games," Discussion Paper Series of the Max Planck Institute for Behavioral Economics 2011_26, Max Planck Institute for Behavioral Economics.
Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Discussion Papers 13-14, Department of Economics, University of Birmingham.
Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
Jacques Durieu & Philippe Solal, 2012. "Models of Adaptive Learning in Game Theory," Chapters, in: Richard Arena & Agnès Festré & Nathalie Lazaric (ed.), Handbook of Knowledge and Economics, chapter 11, Edward Elgar Publishing.
- Jacques Durieu & Philippe Solal, 2012. "Models of adaptive learning in game theory," Post-Print halshs-00667674, HAL.
Jieyao Ding & Andreas Nicklisch, 2013. "On the Impulse in Impulse Learning," Discussion Paper Series of the Max Planck Institute for Behavioral Economics 2013_02, Max Planck Institute for Behavioral Economics.
Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
Schuster, Stephan, 2010. "Network Formation with Adaptive Agents," MPRA Paper 27388, University Library of Munich, Germany.
Mele, Antonio & Molnár, Krisztina & Santoro, Sergio, 2020. "On the perils of stabilizing prices when agents are learning," Journal of Monetary Economics, Elsevier, vol. 115(C), pages 339-353.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2014. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 1/2015, Norwegian School of Economics, Department of Economics.
- Antonio Mele & Krisztina Molnar & Sergio Santoro, 2015. "On the perils of stabilizing prices when agents are learning," School of Economics Discussion Papers 0215, School of Economics, University of Surrey.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2018. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 22/2018, Norwegian School of Economics, Department of Economics.
- Antonio Mele & Krisztina Molnár & Sergio Santoro, 2015. "On the Perils of Stabilizing Prices when Agents are Learning," CESifo Working Paper Series 5173, CESifo.
Masiliūnas, Aidas, 2023. "Learning in rent-seeking contests with payoff risk and foregone payoff information," Games and Economic Behavior, Elsevier, vol. 140(C), pages 50-72.
Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
Hopkins, Ed & Posch, Martin, 2005. "Attainability of boundary points under reinforcement learning," Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Levine's Working Paper Archive 506439000000000350, David K. Levine.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Edinburgh School of Economics Discussion Paper Series 79, Edinburgh School of Economics, University of Edinburgh.
repec:esx:essedp:715 is not listed on IDEAS
Alanyali, Murat, 2010. "A note on adjusted replicator dynamics in iterated games," Journal of Mathematical Economics, Elsevier, vol. 46(1), pages 86-98, January.
Ianni, Antonella, 2014. "Learning strict Nash equilibria through reinforcement," Journal of Mathematical Economics, Elsevier, vol. 50(C), pages 148-155.
- Ianni, Antonella, 2011. "Learning Strict Nash Equilibria through Reinforcement," MPRA Paper 33936, University Library of Munich, Germany.
Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
Jaspersen, Johannes G. & Montibeller, Gilberto, 2020. "On the learning patterns and adaptive behavior of terrorist organizations," European Journal of Operational Research, Elsevier, vol. 282(1), pages 221-234.
Albert Banal-Estañol & Augusto Rupérez Micola, 2009. "Composition of Electricity Generation Portfolios, Pivotal Dynamics, and Market Prices," Management Science, INFORMS, vol. 55(11), pages 1813-1831, November.
- Augusto Rupérez-Micola & Albert Banal-Estañol, 2007. "Composition of electricity generation portfolios, pivotal dynamics and market prices," Economics Working Papers 1083, Department of Economics and Business, Universitat Pompeu Fabra.
Filippo Massari & Jonathan Newton, 2026. "Rational beliefs when the truth is not an option," International Journal of Game Theory, Springer;Game Theory Society, vol. 55(1), pages 1-26, June.
Conor Mayo-Wilson & Kevin Zollman & David Danks, 2013. "Wisdom of crowds versus groupthink: learning in groups and in isolation," International Journal of Game Theory, Springer;Game Theory Society, vol. 42(3), pages 695-723, August.
Fortini, Sandra & Petrone, Sonia & Sporysheva, Polina, 2018. "On a notion of partially conditionally identically distributed sequences," Stochastic Processes and their Applications, Elsevier, vol. 128(3), pages 819-846.
Oyarzun, Carlos & Ruf, Johannes, 2014. "Convergence in models with bounded expected relative hazard rates," Journal of Economic Theory, Elsevier, vol. 154(C), pages 229-244.
Naoki Funai, 2019. "Convergence results on stochastic adaptive learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 68(4), pages 907-934, November.
March, Christoph, 2019. "The behavioral economics of artificial intelligence: Lessons from experiments with computer players," BERG Working Paper Series 154, Bamberg University, Bamberg Economic Research Group.
- Christoph March, 2019. "The Behavioral Economics of Artificial Intelligence: Lessons from Experiments with Computer Players," CESifo Working Paper Series 7926, CESifo.
Manxi Wu & Saurabh Amin & Asuman Ozdaglar, 2021. "Multi-agent Bayesian Learning with Best Response Dynamics: Convergence and Stability," Papers 2109.00719, arXiv.org.
Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Games, MDPI, vol. 4(4), pages 1-22, November.
Karl D. Lewis & A. J. Shaiju, 2024. "Asymmetric Replicator Dynamics on Polish Spaces: Invariance, Stability, and Convergence," Dynamic Games and Applications, Springer, vol. 14(5), pages 1160-1190, November.
Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
March, Christoph, 2021. "Strategic interactions between humans and artificial intelligence: Lessons from experiments with computer players," Journal of Economic Psychology, Elsevier, vol. 87(C).
Ilaria Brunetti & Yezekael Hayel & Eitan Altman, 2018. "State-Policy Dynamics in Evolutionary Games," Dynamic Games and Applications, Springer, vol. 8(1), pages 93-116, March.
Manxi Wu & Saurabh Amin, 2019. "Securing Infrastructure Facilities: When Does Proactive Defense Help?," Dynamic Games and Applications, Springer, vol. 9(4), pages 984-1025, December.
Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
Nazaria Solferino & Viviana Solferino & Serena F. Taurino, 2018. "The economics analysis of a Q-learning model of cooperation with punishment and risk taking preferences," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 13(3), pages 601-613, October.
Mario Bravo, 2016. "An Adjusted Payoff-Based Procedure for Normal Form Games," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1469-1483, November.
Enrique Fatas & Antonio J. Morales & Ainhoa Jaramillo-Gutiérrez, 2026. "Social aspiration reinforcement learning in Cournot games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 81(1), pages 485-524, February.
Georgios Chasparis & Jeff Shamma & Anders Rantzer, 2015. "Nonconvergence to saddle boundary points under perturbed reinforcement learning," International Journal of Game Theory, Springer;Game Theory Society, vol. 44(3), pages 667-699, August.
Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
Benoit Duvocelle & Panayotis Mertikopoulos & Mathias Staudigl & Dries Vermeulen, 2023. "Multiagent Online Learning in Time-Varying Games," Mathematics of Operations Research, INFORMS, vol. 48(2), pages 914-941, May.
Erik Mohlin & Robert Ostling & Joseph Tao-yi Wang, 2014. "Learning by Imitation in Games: Theory, Field, and Laboratory," Economics Series Working Papers 734, University of Oxford, Department of Economics.
Mario Bravo & Mathieu Faure, 2013. "Reinforcement Learning with Restrictions on the Action Set," AMSE Working Papers 1335, Aix-Marseille School of Economics, France, revised 01 Jul 2013.
- Mario Bravo & Mathieu Faure, 2015. "Reinforcement Learning with Restrictions on the Action Set," Post-Print hal-01457301, HAL.
Roger Waldeck & Eric Darmon, 2006. "Can boundedly rational sellers learn to play Nash?," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 1(2), pages 147-169, November.
Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
Manxi Wu & Saurabh Amin & Asuman Ozdaglar, 2025. "Convergence and Stability of Coupled Belief-Strategy Learning Dynamics in Continuous Games," Mathematics of Operations Research, INFORMS, vol. 50(1), pages 459-481, February.
Norman, Thomas W.L., 2023. "Pigouvian algorithmic platform design," Journal of Economic Behavior & Organization, Elsevier, vol. 212(C), pages 322-332.
Giacomo Aletti & Caterina May & Piercesare Secchi, 2012. "A Functional Equation Whose Unknown is $\mathcal{P}([0,1])$ Valued," Journal of Theoretical Probability, Springer, vol. 25(4), pages 1207-1232, December.
Han, Jungsuk & Sangiorgi, Francesco, 2018. "Searching for information," Journal of Economic Theory, Elsevier, vol. 175(C), pages 342-373.
- Han, Jungsuk & Sangiorgi, Francesco, 2015. "Searching for Information," Working Paper Series 300, Sveriges Riksbank (Central Bank of Sweden).
Kuang Xu & Se-Young Yun, 2020. "Reinforcement with Fading Memories," Mathematics of Operations Research, INFORMS, vol. 45(4), pages 1258-1288, November.
Pemantle, Robin & Skyrms, Brian, 2004. "Network formation by reinforcement learning: the long and medium run," Mathematical Social Sciences, Elsevier, vol. 48(3), pages 315-327, November.
Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," PLOS Computational Biology, Public Library of Science, vol. 17(4), pages 1-19, April.
- Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," Papers 2104.08636, arXiv.org.
Oyarzun, Carlos & Sarin, Rajiv, 2013. "Learning and risk aversion," Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
Georgios Chasparis & Jeff Shamma, 2012. "Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation," Dynamic Games and Applications, Springer, vol. 2(1), pages 18-50, March.

Browse Econ Literature

More features

On the convergence of reinforcement learning

Citations

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data