IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Login to save this article or follow this journal

Transient and asymptotic dynamics of reinforcement learning in games

  • Izquierdo, Luis R.
  • Izquierdo, Segismundo S.
  • Gotts, Nicholas M.
  • Polhill, J. Gary

No abstract is available for this item.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.sciencedirect.com/science/article/B6WFW-4NF2HGK-2/2/9d24ecb931f1be3727e086c0585957e5
Download Restriction: Full text for ScienceDirect subscribers only

As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.

Article provided by Elsevier in its journal Games and Economic Behavior.

Volume (Year): 61 (2007)
Issue (Month): 2 (November)
Pages: 259-276

as
in new window

Handle: RePEc:eee:gamebe:v:61:y:2007:i:2:p:259-276
Contact details of provider: Web page: http://www.elsevier.com/locate/inca/622836

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as in new window
  1. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
  2. Barry Sopher & Dilip Mookherjee, 1997. "Learning and Decision Costs in Experimental Constant Sum Games," Departmental Working Papers 199527, Rutgers University, Department of Economics.
  3. Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
  4. Borgers, Tilman & Sarin, Rajiv, 2000. "Naive Reinforcement Learning with Endogenous Aspirations," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(4), pages 921-50, November.
  5. Tilman B�rgers & Rajiv Sarin, . "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
  6. John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, EconWPA.
  7. Beggs, A., 2000. "Stochastic Evolution with Slow Learning," Economics Series Working Papers 9933, University of Oxford, Department of Economics.
  8. Laslier, J.-F. & Topol, R. & Walliser, B., 1999. "A Behavioral Learning Process in Games," Papers 99-03, Paris X - Nanterre, U.F.R. de Sc. Ec. Gest. Maths Infor..
  9. Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
  10. Fernando Vega Redondo & Frédéric Palomino, 1996. "Convergence of aspirations and (partial) cooperation in the Prisoner's Dilemma," Working Papers. Serie AD 1996-20, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
  11. Cross, John G, 1973. "A Stochastic Learning Model of Economic Behavior," The Quarterly Journal of Economics, MIT Press, vol. 87(2), pages 239-66, May.
  12. Yan Chen & Fang-Fang Tang, 1998. "Learning and Incentive-Compatible Mechanisms for Public Goods Provision: An Experimental Study," Journal of Political Economy, University of Chicago Press, vol. 106(3), pages 633-662, June.
  13. Karandikar, Rajeeva & Mookherjee, Dilip & Ray, Debraj & Vega-Redondo, Fernando, 1998. "Evolving Aspirations and Cooperation," Journal of Economic Theory, Elsevier, vol. 80(2), pages 292-331, June.
  14. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 0203, Economics Division, School of Social Sciences, University of Southampton.
  15. Arthur, W Brian, 1991. "Designing Economic Agents that Act Like Human Agents: A Behavioral Approach to Bounded Rationality," American Economic Review, American Economic Association, vol. 81(2), pages 353-59, May.
  16. Binmore Kenneth G. & Samuelson Larry & Vaughan Richard, 1995. "Musical Chairs: Modeling Noisy Evolution," Games and Economic Behavior, Elsevier, vol. 11(1), pages 1-35, October.
  17. Jean-François Laslier & Bernard Walliser, 2005. "A reinforcement learning process in extensive form games," International Journal of Game Theory, Springer, vol. 33(2), pages 219-227, 06.
  18. Boylan Richard T., 1995. "Continuous Approximation of Dynamical Systems with Randomly Matched Individuals," Journal of Economic Theory, Elsevier, vol. 66(2), pages 615-625, August.
  19. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
  20. Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
  21. Martin Posch, 1997. "Cycling in a stochastic learning algorithm for normal form games," Journal of Evolutionary Economics, Springer, vol. 7(2), pages 193-207.
  22. Hopkins, Ed & Posch, Martin, 2005. "Attainability of boundary points under reinforcement learning," Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
  23. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-81, September.
  24. Rustichini, Aldo, 1999. "Optimal Properties of Stimulus--Response Learning Models," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 244-273, October.
  25. Boylan, Richard T., 1992. "Laws of large numbers for dynamical systems with randomly matched individuals," Journal of Economic Theory, Elsevier, vol. 57(2), pages 473-504, August.
  26. repec:att:wimass:9323 is not listed on IDEAS
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:61:y:2007:i:2:p:259-276. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.