Convergence of Deep Fictitious Play for Stochastic Differential Games

My bibliography Save this paper

Convergence of Deep Fictitious Play for Stochastic Differential Games

Author

Listed:

Jiequn Han
Ruimeng Hu
Jihao Long

Registered:

Abstract

Stochastic differential games have been used extensively to model agents' competitions in Finance, for instance, in P2P lending platforms from the Fintech industry, the banking system for systemic risk, and insurance markets. The recently proposed machine learning algorithm, deep fictitious play, provides a novel efficient tool for finding Markovian Nash equilibrium of large $N$-player asymmetric stochastic differential games [J. Han and R. Hu, Mathematical and Scientific Machine Learning Conference, pages 221-245, PMLR, 2020]. By incorporating the idea of fictitious play, the algorithm decouples the game into $N$ sub-optimization problems, and identifies each player's optimal strategy with the deep backward stochastic differential equation (BSDE) method parallelly and repeatedly. In this paper, we prove the convergence of deep fictitious play (DFP) to the true Nash equilibrium. We can also show that the strategy based on DFP forms an $\eps$-Nash equilibrium. We generalize the algorithm by proposing a new approach to decouple the games, and present numerical results of large population games showing the empirical convergence of the algorithm beyond the technical assumptions in the theorems.

Suggested Citation

Jiequn Han & Ruimeng Hu & Jihao Long, 2020. "Convergence of Deep Fictitious Play for Stochastic Differential Games," Papers 2008.05519, arXiv.org, revised Mar 2021.

Handle: RePEc:arx:papers:2008.05519

Download full text from publisher

References listed on IDEAS

Milgrom, Paul & Roberts, John, 1991. "Adaptive and sophisticated learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 82-100, February.
Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
Bing Yu & Xiaojing Xing & Agus Sudjianto, 2019. "Deep-learning based numerical BSDE method for barrier options," Papers 1904.05921, arXiv.org.
A. Prasad & S. P. Sethi, 2004. "Competitive Advertising Under Uncertainty: A Stochastic Differential Game Approach," Journal of Optimization Theory and Applications, Springer, vol. 123(1), pages 163-185, October.
Philippe Casgrain & Brian Ning & Sebastian Jaimungal, 2019. "Deep Q-Learning for Nash Equilibria: Nash-DQN," Papers 1904.10554, arXiv.org, revised Oct 2022.
Zaiyan Wei & Mingfeng Lin, 2017. "Market Mechanisms in Online Peer-to-Peer Lending," Management Science, INFORMS, vol. 63(12), pages 4236-4257, December.
Ruimeng Hu, 2020. "Deep learning for ranking response surfaces with applications to optimal stopping problems," Quantitative Finance, Taylor & Francis Journals, vol. 20(9), pages 1567-1581, September.
N. El Karoui & S. Peng & M. C. Quenez, 1997. "Backward Stochastic Differential Equations in Finance," Mathematical Finance, Wiley Blackwell, vol. 7(1), pages 1-71, January.
Vijay Krishna & Tomas Sjöström, 1998. "On the Convergence of Fictitious Play," Mathematics of Operations Research, INFORMS, vol. 23(2), pages 479-511, May.
- Vijay Krishna & Tomas Sjostrom, 1995. "On the Convergence of Fictitious Play," Harvard Institute of Economic Research Working Papers 1717, Harvard - Institute of Economic Research.
- Vijay Krishna & Tomas Sjostrom, 1995. "On the Convergence of Fictitious Play," Game Theory and Information 9503003, University Library of Munich, Germany.
- Sjostrom, T. & Krishna, V., 1995. "On the Convergence of Ficticious Play," Papers 04-95-07, Pennsylvania State - Department of Economics.
- Vijay Krishna & T. Sjostrom, 2010. "On the Convergence of Fictitious Play," Levine's Working Paper Archive 417, David K. Levine.
Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
Ruimeng Hu, 2019. "Deep Learning for Ranking Response Surfaces with Applications to Optimal Stopping Problems," Papers 1901.03478, arXiv.org, revised Mar 2020.
Ngo Long, 2011. "Dynamic Games in the Economics of Natural Resources: A Survey," Dynamic Games and Applications, Springer, vol. 1(1), pages 115-148, March.
Dockner,Engelbert J. & Jorgensen,Steffen & Long,Ngo Van & Sorger,Gerhard, 2000. "Differential Games in Economics and Management Science," Cambridge Books, Cambridge University Press, number 9780521637329.
Justin Sirignano & Konstantinos Spiliopoulos, 2017. "DGM: A deep learning algorithm for solving partial differential equations," Papers 1708.07469, arXiv.org, revised Sep 2018.
Liu, He & Qiao, Han & Wang, Shouyang & Li, Yuze, 2019. "Platform Competition in Peer-to-Peer Lending Considering Risk Control Ability," European Journal of Operational Research, Elsevier, vol. 274(1), pages 280-290.
Monderer, Dov & Shapley, Lloyd S., 1996. "Fictitious Play Property for Games with Identical Interests," Journal of Economic Theory, Elsevier, vol. 68(1), pages 258-265, January.
Chen, Shumin & Yang, Hailiang & Zeng, Yan, 2018. "Stochastic Differential Games Between Two Insurers With Generalized Mean-Variance Premium Principle," ASTIN Bulletin, Cambridge University Press, vol. 48(1), pages 413-434, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Robert Balkin & Hector D. Ceniceros & Ruimeng Hu, 2023. "Stochastic Delay Differential Games: Financial Modeling and Machine Learning Algorithms," Papers 2307.06450, arXiv.org.
Ming Min & Ruimeng Hu, 2021. "Signatured Deep Fictitious Play for Mean Field Games with Common Noise," Papers 2106.03272, arXiv.org.
Han, Jiequn & Hu, Ruimeng & Long, Jihao, 2023. "A class of dimension-free metrics for the convergence of empirical measures," Stochastic Processes and their Applications, Elsevier, vol. 164(C), pages 242-287.
Jiequn Han & Yucheng Yang & Weinan E, 2021. "DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks," Papers 2112.14377, arXiv.org, revised Feb 2022.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Jiequn Han & Ruimeng Hu, 2019. "Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games," Papers 1912.01809, arXiv.org, revised Jun 2020.
Ulrich Berger, 2004. "Two More Classes of Games with the Fictitious Play Property," Game Theory and Information 0408003, University Library of Munich, Germany.
Ulrich Berger, 2004. "Some Notes on Learning in Games with Strategic Complementarities," Game Theory and Information 0409001, University Library of Munich, Germany.
Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
Berger, Ulrich, 2008. "Learning in games with strategic complementarities revisited," Journal of Economic Theory, Elsevier, vol. 143(1), pages 292-301, November.
Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
Ricardo Josa-Fombellida & Juan Rincón-Zapatero, 2015. "Euler–Lagrange equations of stochastic differential games: application to a game of a productive asset," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 59(1), pages 61-108, May.
Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
Ewerhart, Christian & Valkanova, Kremena, 2020. "Fictitious play in networks," Games and Economic Behavior, Elsevier, vol. 123(C), pages 182-206.
- Christian Ewerhart & Kremena Valkanova, 2016. "Fictitious play in networks," ECON - Working Papers 239, Department of Economics - University of Zurich, revised Jun 2019.
Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
- Josef Hofbauer & William H. Sandholm, 2001. "Evolution and Learning in Games with Randomly Disturbed Payoffs," Vienna Economics Papers vie0205, University of Vienna, Department of Economics.
Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
- Josef Hofbauer & William H. Sandholm, 2001. "Evolution and Learning in Games with Randomly Disturbed Payoffs," Vienna Economics Papers 0205, University of Vienna, Department of Economics.
Ulrich Berger, 2012. "Non-algebraic Convergence Proofs for Continuous-Time Fictitious Play," Dynamic Games and Applications, Springer, vol. 2(1), pages 4-17, March.
Francesco Caruso & Maria Carmela Ceparano & Jacqueline Morgan, 2020. "Best response algorithms in ratio-bounded games: convergence of affine relaxations to Nash equilibria," CSEF Working Papers 593, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
Sela, Aner, 2000. "Fictitious Play in 2 x 3 Games," Games and Economic Behavior, Elsevier, vol. 31(1), pages 152-162, April.
van Strien, Sebastian & Sparrow, Colin, 2011. "Fictitious play in 3x3 games: Chaos and dithering behaviour," Games and Economic Behavior, Elsevier, vol. 73(1), pages 262-286, September.
Berger, Ulrich, 2007. "Two more classes of games with the continuous-time fictitious play property," Games and Economic Behavior, Elsevier, vol. 60(2), pages 247-261, August.
Sparrow, Colin & van Strien, Sebastian & Harris, Christopher, 2008. "Fictitious play in 3x3 games: The transition between periodic and chaotic behaviour," Games and Economic Behavior, Elsevier, vol. 63(1), pages 259-291, May.
Berger, Ulrich, 2007. "Brown's original fictitious play," Journal of Economic Theory, Elsevier, vol. 135(1), pages 572-578, July.
- Ulrich Berger, 2005. "Brown's Original Fictitious Play," Game Theory and Information 0503008, University Library of Munich, Germany.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2020-09-14 (Big Data)
NEP-CMP-2020-09-14 (Computational Economics)
NEP-GTH-2020-09-14 (Game Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2008.05519. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Convergence of Deep Fictitious Play for Stochastic Differential Games

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data