Learning Nash Equilibria

Learning Nash Equilibria

Author

Listed:

Dai, Darong

Abstract

In the paper, we re-investigate the long run behavior of an adaptive learning process driven by the stochastic replicator dynamics developed by Fudenberg and Harris (1992). It is demonstrated that the Nash equilibrium will be the robust limit of the adaptive learning process as long as it is reachable for the learning dynamics in almost surely finite time. Doob’s martingale theory and Girsanov Theorem play very important roles in confirming the required assertion.

Suggested Citation

Dai, Darong, 2012. "Learning Nash Equilibria," MPRA Paper 40040, University Library of Munich, Germany.

Handle: RePEc:pra:mprapa:40040

Download full text from publisher

References listed on IDEAS

Alan Beggs, 2002. "Stochastic evolution with slow learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 19(2), pages 379-405.
- Alan Beggs, 2000. "Stochastic Evolution with Slow Learning," Economics Series Working Papers 33, University of Oxford, Department of Economics.
- Beggs, A., 2000. "Stochastic Evolution with Slow Learning," Economics Series Working Papers 9933, University of Oxford, Department of Economics.
Fudenberg, D. & Harris, C., 1992. "Evolutionary dynamics with aggregate shocks," Journal of Economic Theory, Elsevier, vol. 57(2), pages 420-441, August.
- Fudenberg, Drew & Harris, Christopher, 1992. "Evolutionary Dynamics with Aggregate Shocks," IDEI Working Papers 13, Institut d'Économie Industrielle (IDEI), Toulouse.
- D. Fudenberg & C. Harris, 2010. "Evolutionary Dynamics with Aggregate Shocks," Levine's Working Paper Archive 496, David K. Levine.
Ken Binmore & Larry Samuelson, "undated". "Evolutionary Drift and Equilibrium Selection," ELSE working papers 011, ESRC Centre on Economics Learning and Social Evolution.
Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
Ellison, Glenn & Fudenberg, Drew, 2000. "Learning Purified Mixed Equilibria," Journal of Economic Theory, Elsevier, vol. 90(1), pages 84-115, January.
- Glenn Ellison & Drew Fudenberg, 1998. "Learning Purified Mixed Equilibria," Harvard Institute of Economic Research Working Papers 1817, Harvard - Institute of Economic Research.
Canning, David, 1992. "Average behavior in learning models," Journal of Economic Theory, Elsevier, vol. 57(2), pages 442-472, August.
- Canning, D., 1990. "Average Behaviour In Learning Models," Papers 156, Cambridge - Risk, Information & Quantity Signals.
- D. Canning, 2010. "Average Behavior in Learning Models," Levine's Working Paper Archive 490, David K. Levine.
Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
Ken Binmore & Larry Samuelson, 1999. "Evolutionary Drift and Equilibrium Selection," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 66(2), pages 363-393.
Kaniovski Yuri M. & Young H. Peyton, 1995. "Learning Dynamics in Games with Stochastic Perturbations," Games and Economic Behavior, Elsevier, vol. 11(2), pages 330-363, November.
Binmore Kenneth G. & Samuelson Larry & Vaughan Richard, 1995. "Musical Chairs: Modeling Noisy Evolution," Games and Economic Behavior, Elsevier, vol. 11(1), pages 1-35, October.
Ken Binmore & Larry Samuelson, "undated". "Evolutionary Drift And Equilibrium Selection," ELSE working papers 049, ESRC Centre on Economics Learning and Social Evolution.
Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
Cabrales, Antonio, 2000. "Stochastic Replicator Dynamics," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(2), pages 451-481, May.
- Antonio Cabrales, 1993. "Stochastic replicator dynamics," Economics Working Papers 54, Department of Economics and Business, Universitat Pompeu Fabra.
- A. Cabrales, 2010. "Stochastic Replicator Dynamics," Levine's Working Paper Archive 489, David K. Levine.
Gale, John & Binmore, Kenneth G. & Samuelson, Larry, 1995. "Learning to be imperfect: The ultimatum game," Games and Economic Behavior, Elsevier, vol. 8(1), pages 56-90.
Benaim, Michel & Hirsch, Morris W., 1999. "Mixed Equilibria and Dynamical Systems Arising from Fictitious Play in Perturbed Games," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 36-72, October.
Gaunersdorfer Andrea & Hofbauer Josef, 1995. "Fictitious Play, Shapley Polygons, and the Replicator Equation," Games and Economic Behavior, Elsevier, vol. 11(2), pages 279-303, November.
- A. Gaunersdorfer & J. Hofbauer, 2010. "Fictitious Play, Shapley Polygons and the Replicator Equation," Levine's Working Paper Archive 438, David K. Levine.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Dai, Darong, 2012. "On the Existence and Stability of Pareto Optimal Endogenous Matching with Fairness," MPRA Paper 40560, University Library of Munich, Germany.
Sandholm, William H., 2003. "Evolution and equilibrium under inexact information," Games and Economic Behavior, Elsevier, vol. 44(2), pages 343-378, August.
Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 1999. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 42, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
Sandholm,W.H., 1999. "Markov evolution with inexact information," Working papers 15, Wisconsin Madison - Social Systems.
Dai, Darong, 2012. "On the existence and stability of Pareto optimal endogenous matching with fairness," MPRA Paper 40457, University Library of Munich, Germany.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
Ponti, Giovanni, 2000. "Continuous-time evolutionary dynamics: theory and practice," Research in Economics, Elsevier, vol. 54(2), pages 187-214, June.
- Giovanni Ponti, 1999. "- Continuous-Time Evolutionary Dynamics: Theory And Practice," Working Papers. Serie AD 1999-31, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
N. Williams, 2002. "Stability and Long Run Equilibrium in Stochastic Fictitious Play," Princeton Economic Theory Working Papers cbeeeb49cc8afc83f125df5a8, David K. Levine.
Uriarte, Jose Ramon, 2007. "A behavioural foundation for models of evolutionary drift," Journal of Economic Behavior & Organization, Elsevier, vol. 63(3), pages 497-513, July.
Uriarte Ayo, José Ramón, 2005. "A Behavioral Foundation for Models of Evolutionary Drift," IKERLANAK 2005-19, Universidad del País Vasco - Departamento de Fundamentos del Análisis Económico I.
Dai, Darong, 2012. "On the Existence of Pareto Optimal Endogenous Matching," MPRA Paper 43125, University Library of Munich, Germany.
Simon P. Anderson & Jacob K. Goeree & Charles A. Holt, 1999. "Stochastic Game Theory: Adjustment to Equilibrium Under Noisy Directional Learning," Virginia Economics Online Papers 327, University of Virginia, Department of Economics.
Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
- Josef Hofbauer & William H. Sandholm, 2001. "Evolution and Learning in Games with Randomly Disturbed Payoffs," Vienna Economics Papers vie0205, University of Vienna, Department of Economics.
Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
- Josef Hofbauer & William H. Sandholm, 2001. "Evolution and Learning in Games with Randomly Disturbed Payoffs," Vienna Economics Papers 0205, University of Vienna, Department of Economics.
Williams, Noah, 2022. "Learning and equilibrium transitions: Stochastic stability in discounted stochastic fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 145(C).
Hofbauer, Josef & Sandholm, William H., 2007. "Evolution in games with randomly disturbed payoffs," Journal of Economic Theory, Elsevier, vol. 132(1), pages 47-69, January.
- Hofbauer,J. & Sandholm,W.H., 2003. "Evolution in games with randomly disturbed payoffs," Working papers 20, Wisconsin Madison - Social Systems.
Sandholm, William H., 2015. "Population Games and Deterministic Evolutionary Dynamics," Handbook of Game Theory with Economic Applications,, Elsevier.
Heller, Yuval & Kuzmics, Christoph, 2020. "Communication, Renegotiation and Coordination with Private Values (Extended Version)," MPRA Paper 102926, University Library of Munich, Germany, revised 26 Jul 2021.

More about this item

Keywords

; ; ; ; ;

JEL classification:

C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games

NEP fields

This paper has been announced in the following NEP Reports:

NEP-GTH-2012-07-23 (Game Theory)
NEP-MIC-2012-07-23 (Microeconomics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:40040. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning Nash Equilibria

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data