IDEAS home Printed from https://ideas.repec.org/r/eee/jetheo/v122y2005i1p1-36.html
   My bibliography  Save this item

On the convergence of reinforcement learning

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
as


Cited by:

  1. Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
  2. Köke, Sonja & Lange, Andreas & Nicklisch, Andreas, 2015. "Adversity is a school of wisdomː Experimental evidence on cooperative protection against stochastic losses," WiSo-HH Working Paper Series 22, University of Hamburg, Faculty of Business, Economics and Social Sciences, WISO Research Laboratory.
  3. Josephson, Jens, 2008. "A numerical analysis of the evolutionary stability of learning rules," Journal of Economic Dynamics and Control, Elsevier, vol. 32(5), pages 1569-1599, May.
  4. Mario Bravo & Mathieu Faure, 2013. "Reinforcement Learning with Restrictions on the Action Set," AMSE Working Papers 1335, Aix-Marseille School of Economics, France, revised 01 Jul 2013.
  5. Ding, Jieyao & Nicklisch, Andreas, 2013. "On the impulse in impulse learning," Economics Letters, Elsevier, vol. 121(2), pages 294-297.
  6. Nicklisch, Andreas & Köke, Sonja & Lange, Andreas, 2016. "Is Adversity a School of Wisdom? Experimental Evidence on Cooperative Protection Against Stochastic Losses," VfS Annual Conference 2016 (Augsburg): Demographic Change 145716, Verein für Socialpolitik / German Economic Association.
  7. Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2012. "Learning in experimental 2×2 games," Games and Economic Behavior, Elsevier, vol. 76(1), pages 44-73.
  8. Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Discussion Papers 13-14, Department of Economics, University of Birmingham.
  9. Jieyao Ding & Andreas Nicklisch, 2013. "On the Impulse in Impulse Learning," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2013_02, Max Planck Institute for Research on Collective Goods.
  10. Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
  11. Ianni, Antonella, 2014. "Learning strict Nash equilibria through reinforcement," Journal of Mathematical Economics, Elsevier, vol. 50(C), pages 148-155.
  12. Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2015. "From imitation to collusion: Long-run learning in a low-information environment," Journal of Economic Theory, Elsevier, vol. 155(C), pages 185-205.
  13. Jaspersen, Johannes G. & Montibeller, Gilberto, 2020. "On the learning patterns and adaptive behavior of terrorist organizations," European Journal of Operational Research, Elsevier, vol. 282(1), pages 221-234.
  14. Fortini, Sandra & Petrone, Sonia & Sporysheva, Polina, 2018. "On a notion of partially conditionally identically distributed sequences," Stochastic Processes and their Applications, Elsevier, vol. 128(3), pages 819-846.
  15. Albert Banal-Estañol & Augusto Rupérez Micola, 2009. "Composition of Electricity Generation Portfolios, Pivotal Dynamics, and Market Prices," Management Science, INFORMS, vol. 55(11), pages 1813-1831, November.
  16. Naoki Funai, 2019. "Convergence results on stochastic adaptive learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 68(4), pages 907-934, November.
  17. Oyarzun, Carlos & Sarin, Rajiv, 2013. "Learning and risk aversion," Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
  18. Mele, Antonio & Molnár, Krisztina & Santoro, Sergio, 2020. "On the perils of stabilizing prices when agents are learning," Journal of Monetary Economics, Elsevier, vol. 115(C), pages 339-353.
  19. Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," PLOS Computational Biology, Public Library of Science, vol. 17(4), pages 1-18, April.
  20. Nazaria Solferino & Viviana Solferino & Serena F. Taurino, 2018. "The economics analysis of a Q-learning model of cooperation with punishment and risk taking preferences," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 13(3), pages 601-613, October.
  21. Erik Mohlin & Robert Ostling & Joseph Tao-yi Wang, 2014. "Learning by Imitation in Games: Theory, Field, and Laboratory," Economics Series Working Papers 734, University of Oxford, Department of Economics.
  22. Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
  23. Christoph March, 2019. "The Behavioral Economics of Artificial Intelligence: Lessons from Experiments with Computer Players," CESifo Working Paper Series 7926, CESifo.
  24. Giacomo Aletti & Caterina May & Piercesare Secchi, 2012. "A Functional Equation Whose Unknown is $\mathcal{P}([0,1])$ Valued," Journal of Theoretical Probability, Springer, vol. 25(4), pages 1207-1232, December.
  25. Kuang Xu & Se-Young Yun, 2020. "Reinforcement with Fading Memories," Mathematics of Operations Research, INFORMS, vol. 45(4), pages 1258-1288, November.
  26. Beggs, Alan, 2022. "Reference points and learning," Journal of Mathematical Economics, Elsevier, vol. 100(C).
  27. Maxwell Pak & Bing Xu, 2016. "Generalized reinforcement learning in perfect-information games," International Journal of Game Theory, Springer;Game Theory Society, vol. 45(4), pages 985-1011, November.
  28. Jacques Durieu & Philippe Solal, 2012. "Models of Adaptive Learning in Game Theory," Chapters, in: Richard Arena & Agnès Festré & Nathalie Lazaric (ed.), Handbook of Knowledge and Economics, chapter 11, Edward Elgar Publishing.
  29. Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
  30. Schuster, Stephan, 2010. "Network Formation with Adaptive Agents," MPRA Paper 27388, University Library of Munich, Germany.
  31. Hopkins, Ed & Posch, Martin, 2005. "Attainability of boundary points under reinforcement learning," Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
  32. Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
  33. Han, Jungsuk & Sangiorgi, Francesco, 2018. "Searching for information," Journal of Economic Theory, Elsevier, vol. 175(C), pages 342-373.
  34. repec:esx:essedp:715 is not listed on IDEAS
  35. Alanyali, Murat, 2010. "A note on adjusted replicator dynamics in iterated games," Journal of Mathematical Economics, Elsevier, vol. 46(1), pages 86-98, January.
  36. Conor Mayo-Wilson & Kevin Zollman & David Danks, 2013. "Wisdom of crowds versus groupthink: learning in groups and in isolation," International Journal of Game Theory, Springer;Game Theory Society, vol. 42(3), pages 695-723, August.
  37. Oyarzun, Carlos & Ruf, Johannes, 2014. "Convergence in models with bounded expected relative hazard rates," Journal of Economic Theory, Elsevier, vol. 154(C), pages 229-244.
  38. Manxi Wu & Saurabh Amin & Asuman Ozdaglar, 2021. "Multi-agent Bayesian Learning with Best Response Dynamics: Convergence and Stability," Papers 2109.00719, arXiv.org.
  39. Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Games, MDPI, vol. 4(4), pages 1-22, November.
  40. Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
  41. March, Christoph, 2021. "Strategic interactions between humans and artificial intelligence: Lessons from experiments with computer players," Journal of Economic Psychology, Elsevier, vol. 87(C).
  42. Ilaria Brunetti & Yezekael Hayel & Eitan Altman, 2018. "State-Policy Dynamics in Evolutionary Games," Dynamic Games and Applications, Springer, vol. 8(1), pages 93-116, March.
  43. Manxi Wu & Saurabh Amin, 2019. "Securing Infrastructure Facilities: When Does Proactive Defense Help?," Dynamic Games and Applications, Springer, vol. 9(4), pages 984-1025, December.
  44. Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
  45. Mario Bravo, 2016. "An Adjusted Payoff-Based Procedure for Normal Form Games," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1469-1483, November.
  46. Georgios Chasparis & Jeff Shamma & Anders Rantzer, 2015. "Nonconvergence to saddle boundary points under perturbed reinforcement learning," International Journal of Game Theory, Springer;Game Theory Society, vol. 44(3), pages 667-699, August.
  47. Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
  48. Roger Waldeck & Eric Darmon, 2006. "Can boundedly rational sellers learn to play Nash?," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 1(2), pages 147-169, November.
  49. Pemantle, Robin & Skyrms, Brian, 2004. "Network formation by reinforcement learning: the long and medium run," Mathematical Social Sciences, Elsevier, vol. 48(3), pages 315-327, November.
  50. Georgios Chasparis & Jeff Shamma, 2012. "Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation," Dynamic Games and Applications, Springer, vol. 2(1), pages 18-50, March.
IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.