Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games

My bibliography Save this paper

Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games

Author

Listed:

Jiequn Han
Ruimeng Hu

Registered:

Abstract

We propose a deep neural network-based algorithm to identify the Markovian Nash equilibrium of general large $N$-player stochastic differential games. Following the idea of fictitious play, we recast the $N$-player game into $N$ decoupled decision problems (one for each player) and solve them iteratively. The individual decision problem is characterized by a semilinear Hamilton-Jacobi-Bellman equation, to solve which we employ the recently developed deep BSDE method. The resulted algorithm can solve large $N$-player games for which conventional numerical methods would suffer from the curse of dimensionality. Multiple numerical examples involving identical or heterogeneous agents, with risk-neutral or risk-sensitive objectives, are tested to validate the accuracy of the proposed algorithm in large group games. Even for a fifty-player game with the presence of common noise, the proposed algorithm still finds the approximate Nash equilibrium accurately, which, to our best knowledge, is difficult to achieve by other numerical algorithms.

Suggested Citation

Jiequn Han & Ruimeng Hu, 2019. "Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games," Papers 1912.01809, arXiv.org, revised Jun 2020.

Handle: RePEc:arx:papers:1912.01809

Download full text from publisher

References listed on IDEAS

Milgrom, Paul & Roberts, John, 1991. "Adaptive and sophisticated learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 82-100, February.
Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
Yves Achdou & Jiequn Han & Jean-Michel Lasry & Pierre-Louis Lions & Benjamin Moll, 2017. "Income and Wealth Distribution in Macroeconomics: A Continuous-Time Approach," NBER Working Papers 23732, National Bureau of Economic Research, Inc.
A. Prasad & S. P. Sethi, 2004. "Competitive Advertising Under Uncertainty: A Stochastic Differential Game Approach," Journal of Optimization Theory and Applications, Springer, vol. 123(1), pages 163-185, October.
Pierre Cardaliaguet & Charles-Albert Lehalle, 2016. "Mean Field Game of Controls and An Application To Trade Crowding," Papers 1610.09904, arXiv.org, revised Sep 2017.
Xun Gao & Lu-Ming Duan, 2017. "Efficient representation of quantum many-body states with deep neural networks," Nature Communications, Nature, vol. 8(1), pages 1-6, December.
Vijay Krishna & Tomas Sjöström, 1998. "On the Convergence of Fictitious Play," Mathematics of Operations Research, INFORMS, vol. 23(2), pages 479-511, May.
- Vijay Krishna & Tomas Sjostrom, 1995. "On the Convergence of Fictitious Play," Harvard Institute of Economic Research Working Papers 1717, Harvard - Institute of Economic Research.
- Vijay Krishna & Tomas Sjostrom, 1995. "On the Convergence of Fictitious Play," Game Theory and Information 9503003, University Library of Munich, Germany.
- Sjostrom, T. & Krishna, V., 1995. "On the Convergence of Ficticious Play," Papers 04-95-07, Pennsylvania State - Department of Economics.
- Vijay Krishna & T. Sjostrom, 2010. "On the Convergence of Fictitious Play," Levine's Working Paper Archive 417, David K. Levine.
Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
Ngo Long, 2011. "Dynamic Games in the Economics of Natural Resources: A Survey," Dynamic Games and Applications, Springer, vol. 1(1), pages 115-148, March.
Dockner,Engelbert J. & Jorgensen,Steffen & Long,Ngo Van & Sorger,Gerhard, 2000. "Differential Games in Economics and Management Science," Cambridge Books, Cambridge University Press, number 9780521637329.
Darryl A. Seale & John E. Burnett, 2006. "Solving Large Games With Simulated Fictitious Play," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 8(03), pages 437-467.
Monderer, Dov & Sela, Aner, 1996. "A2 x 2Game without the Fictitious Play Property," Games and Economic Behavior, Elsevier, vol. 14(1), pages 144-148, May.
Monderer, Dov & Shapley, Lloyd S., 1996. "Fictitious Play Property for Games with Identical Interests," Journal of Economic Theory, Elsevier, vol. 68(1), pages 258-265, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Xiangdong Liu & Yu Gu, 2023. "Study of Pricing of High-Dimensional Financial Derivatives Based on Deep Learning," Mathematics, MDPI, vol. 11(12), pages 1-16, June.
Ming Min & Ruimeng Hu, 2021. "Signatured Deep Fictitious Play for Mean Field Games with Common Noise," Papers 2106.03272, arXiv.org.
Steven Campbell & Yichao Chen & Arvind Shrivats & Sebastian Jaimungal, 2021. "Deep Learning for Principal-Agent Mean Field Games," Papers 2110.01127, arXiv.org.
Han, Jiequn & Hu, Ruimeng & Long, Jihao, 2023. "A class of dimension-free metrics for the convergence of empirical measures," Stochastic Processes and their Applications, Elsevier, vol. 164(C), pages 242-287.
Jiequn Han & Yucheng Yang & Weinan E, 2021. "DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks," Papers 2112.14377, arXiv.org, revised Feb 2022.
Sebastian Jaimungal, 2022. "Reinforcement learning and stochastic optimisation," Finance and Stochastics, Springer, vol. 26(1), pages 103-129, January.
Jiequn Han & Ruimeng Hu, 2021. "Recurrent Neural Networks for Stochastic Control Problems with Delay," Papers 2101.01385, arXiv.org, revised Jun 2021.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Jiequn Han & Ruimeng Hu & Jihao Long, 2020. "Convergence of Deep Fictitious Play for Stochastic Differential Games," Papers 2008.05519, arXiv.org, revised Mar 2021.
Ulrich Berger, 2004. "Two More Classes of Games with the Fictitious Play Property," Game Theory and Information 0408003, University Library of Munich, Germany.
Ulrich Berger, 2004. "Some Notes on Learning in Games with Strategic Complementarities," Game Theory and Information 0409001, University Library of Munich, Germany.
Ewerhart, Christian & Valkanova, Kremena, 2020. "Fictitious play in networks," Games and Economic Behavior, Elsevier, vol. 123(C), pages 182-206.
- Christian Ewerhart & Kremena Valkanova, 2016. "Fictitious play in networks," ECON - Working Papers 239, Department of Economics - University of Zurich, revised Jun 2019.
Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
Sela, Aner, 2000. "Fictitious Play in 2 x 3 Games," Games and Economic Behavior, Elsevier, vol. 31(1), pages 152-162, April.
Berger, Ulrich, 2008. "Learning in games with strategic complementarities revisited," Journal of Economic Theory, Elsevier, vol. 143(1), pages 292-301, November.
van Strien, Sebastian & Sparrow, Colin, 2011. "Fictitious play in 3x3 games: Chaos and dithering behaviour," Games and Economic Behavior, Elsevier, vol. 73(1), pages 262-286, September.
Berger, Ulrich, 2007. "Two more classes of games with the continuous-time fictitious play property," Games and Economic Behavior, Elsevier, vol. 60(2), pages 247-261, August.
Berger, Ulrich, 2007. "Brown's original fictitious play," Journal of Economic Theory, Elsevier, vol. 135(1), pages 572-578, July.
- Ulrich Berger, 2005. "Brown's Original Fictitious Play," Game Theory and Information 0503008, University Library of Munich, Germany.
Candogan, Ozan & Ozdaglar, Asuman & Parrilo, Pablo A., 2013. "Dynamics in near-potential games," Games and Economic Behavior, Elsevier, vol. 82(C), pages 66-90.
Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
Hopkins, Ed, 1999. "Learning, Matching, and Aggregation," Games and Economic Behavior, Elsevier, vol. 26(1), pages 79-110, January.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Discussion Papers 1996-2, Edinburgh School of Economics, University of Edinburgh.
- Hopkins, E., 1995. "Learning, Matching and Aggregation," G.R.E.Q.A.M. 95a20, Universite Aix-Marseille III.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Edinburgh School of Economics Discussion Paper Series 2, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," ELSE working papers 033, ESRC Centre on Economics Learning and Social Evolution.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Game Theory and Information 9512001, University Library of Munich, Germany.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Department of Economics 1996 : II, Edinburgh School of Economics, University of Edinburgh.
Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
Sobel, Joel, 2000. "Economists' Models of Learning," Journal of Economic Theory, Elsevier, vol. 94(2), pages 241-261, October.
Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
Monderer, Dov & Samet, Dov & Sela, Aner, 1997. "Belief Affirming in Learning Processes," Journal of Economic Theory, Elsevier, vol. 73(2), pages 438-452, April.
- Dov Monderer & Dov Samet & Aner Sela, 1994. "Belief Affirming in Learning Processes," Game Theory and Information 9408002, University Library of Munich, Germany, revised 11 Aug 1994.
- Dov Monderer & Dov Samet & Aner Sela, 2010. "Belief Affirming in Learning Processes," Levine's Working Paper Archive 420, David K. Levine.
Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
- Josef Hofbauer & William H. Sandholm, 2001. "Evolution and Learning in Games with Randomly Disturbed Payoffs," Vienna Economics Papers vie0205, University of Vienna, Department of Economics.
Ozan Candogan & Ishai Menache & Asuman Ozdaglar & Pablo A. Parrilo, 2011. "Flows and Decompositions of Games: Harmonic and Potential Games," Mathematics of Operations Research, INFORMS, vol. 36(3), pages 474-503, August.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2020-01-06 (Computational Economics)
NEP-GTH-2020-01-06 (Game Theory)
NEP-ORE-2020-01-06 (Operations Research)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1912.01809. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data