IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Log in (now much improved!) to save this article

Transient and asymptotic dynamics of reinforcement learning in games

Listed author(s):
  • Izquierdo, Luis R.
  • Izquierdo, Segismundo S.
  • Gotts, Nicholas M.
  • Polhill, J. Gary

No abstract is available for this item.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.sciencedirect.com/science/article/pii/S0899-8256(07)00012-7
Download Restriction: Full text for ScienceDirect subscribers only

As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.

Article provided by Elsevier in its journal Games and Economic Behavior.

Volume (Year): 61 (2007)
Issue (Month): 2 (November)
Pages: 259-276

as
in new window

Handle: RePEc:eee:gamebe:v:61:y:2007:i:2:p:259-276
Contact details of provider: Web page: http://www.elsevier.com/locate/inca/622836

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as
in new window


  1. Debraj Ray & Dilip Mookherjee & Fernando Vega Redondo & Rajeeva L. Karandikar, 1996. "Evolving aspirations and cooperation," Working Papers. Serie AD 1996-06, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
  2. Palomino, F. & Vega, F., 1996. "Convergence of Aspirations and (Partial) Cooperation in the Prisoners's Dilemma," UFAE and IAE Working Papers 345.96, Unitat de Fonaments de l'Anàlisi Econòmica (UAB) and Institut d'Anàlisi Econòmica (CSIC).
  3. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
  4. Hopkins, Ed & Posch, Martin, 2005. "Attainability of boundary points under reinforcement learning," Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
  5. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
  6. Borgers, Tilman & Sarin, Rajiv, 2000. "Naive Reinforcement Learning with Endogenous Aspirations," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(4), pages 921-950, November.
  7. Martin Posch, 1997. "Cycling in a stochastic learning algorithm for normal form games," Journal of Evolutionary Economics, Springer, vol. 7(2), pages 193-207.
  8. Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," ESE Discussion Papers 51, Edinburgh School of Economics, University of Edinburgh.
  9. Binmore, K. & Samuelson, L., 1993. "An Economist's Perspective on the Evolution of Norms," Working papers 9323, Wisconsin Madison - Social Systems.
  10. Barry Sopher & Dilip Mookherjee, 2000. "Learning and Decision Costs in Experimental Constant Sum Games," Departmental Working Papers 199625, Rutgers University, Department of Economics.
  11. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
  12. Boylan Richard T., 1995. "Continuous Approximation of Dynamical Systems with Randomly Matched Individuals," Journal of Economic Theory, Elsevier, vol. 66(2), pages 615-625, August.
  13. T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
  14. Arthur, W Brian, 1991. "Designing Economic Agents that Act Like Human Agents: A Behavioral Approach to Bounded Rationality," American Economic Review, American Economic Association, vol. 81(2), pages 353-359, May.
  15. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 0203, Economics Division, School of Social Sciences, University of Southampton.
  16. Binmore Kenneth G. & Samuelson Larry & Vaughan Richard, 1995. "Musical Chairs: Modeling Noisy Evolution," Games and Economic Behavior, Elsevier, vol. 11(1), pages 1-35, October.
  17. Yan Chen & Fang-Fang Tang, 1998. "Learning and Incentive-Compatible Mechanisms for Public Goods Provision: An Experimental Study," Journal of Political Economy, University of Chicago Press, vol. 106(3), pages 633-662, June.
  18. Alan Beggs, 2002. "Stochastic evolution with slow learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 19(2), pages 379-405.
  19. Laslier, Jean-Francois & Topol, Richard & Walliser, Bernard, 2001. "A Behavioral Learning Process in Games," Games and Economic Behavior, Elsevier, vol. 37(2), pages 340-366, November.
  20. Jean-François Laslier & Bernard Walliser, 2005. "A reinforcement learning process in extensive form games," International Journal of Game Theory, Springer;Game Theory Society, vol. 33(2), pages 219-227, 06.
  21. Rustichini, Aldo, 1999. "Optimal Properties of Stimulus--Response Learning Models," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 244-273, October.
  22. Beggs, A.W., 2005. "On the convergence of reinforcement learning," Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
  23. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011 Elsevier.
  24. Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
  25. John G. Cross, 1973. "A Stochastic Learning Model of Economic Behavior," The Quarterly Journal of Economics, Oxford University Press, vol. 87(2), pages 239-266.
  26. Boylan, Richard T., 1992. "Laws of large numbers for dynamical systems with randomly matched individuals," Journal of Economic Theory, Elsevier, vol. 57(2), pages 473-504, August.
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:61:y:2007:i:2:p:259-276. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Dana Niculescu)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.