IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2008.05519.html
   My bibliography  Save this paper

Convergence of Deep Fictitious Play for Stochastic Differential Games

Author

Listed:
  • Jiequn Han
  • Ruimeng Hu
  • Jihao Long

Abstract

Stochastic differential games have been used extensively to model agents' competitions in Finance, for instance, in P2P lending platforms from the Fintech industry, the banking system for systemic risk, and insurance markets. The recently proposed machine learning algorithm, deep fictitious play, provides a novel efficient tool for finding Markovian Nash equilibrium of large $N$-player asymmetric stochastic differential games [J. Han and R. Hu, Mathematical and Scientific Machine Learning Conference, pages 221-245, PMLR, 2020]. By incorporating the idea of fictitious play, the algorithm decouples the game into $N$ sub-optimization problems, and identifies each player's optimal strategy with the deep backward stochastic differential equation (BSDE) method parallelly and repeatedly. In this paper, we prove the convergence of deep fictitious play (DFP) to the true Nash equilibrium. We can also show that the strategy based on DFP forms an $\eps$-Nash equilibrium. We generalize the algorithm by proposing a new approach to decouple the games, and present numerical results of large population games showing the empirical convergence of the algorithm beyond the technical assumptions in the theorems.

Suggested Citation

  • Jiequn Han & Ruimeng Hu & Jihao Long, 2020. "Convergence of Deep Fictitious Play for Stochastic Differential Games," Papers 2008.05519, arXiv.org, revised Mar 2021.
  • Handle: RePEc:arx:papers:2008.05519
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2008.05519
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Milgrom, Paul & Roberts, John, 1991. "Adaptive and sophisticated learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 82-100, February.
    2. Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
    3. Bing Yu & Xiaojing Xing & Agus Sudjianto, 2019. "Deep-learning based numerical BSDE method for barrier options," Papers 1904.05921, arXiv.org.
    4. A. Prasad & S. P. Sethi, 2004. "Competitive Advertising Under Uncertainty: A Stochastic Differential Game Approach," Journal of Optimization Theory and Applications, Springer, vol. 123(1), pages 163-185, October.
    5. Philippe Casgrain & Brian Ning & Sebastian Jaimungal, 2019. "Deep Q-Learning for Nash Equilibria: Nash-DQN," Papers 1904.10554, arXiv.org, revised Oct 2022.
    6. Zaiyan Wei & Mingfeng Lin, 2017. "Market Mechanisms in Online Peer-to-Peer Lending," Management Science, INFORMS, vol. 63(12), pages 4236-4257, December.
    7. Ruimeng Hu, 2020. "Deep learning for ranking response surfaces with applications to optimal stopping problems," Quantitative Finance, Taylor & Francis Journals, vol. 20(9), pages 1567-1581, September.
    8. N. El Karoui & S. Peng & M. C. Quenez, 1997. "Backward Stochastic Differential Equations in Finance," Mathematical Finance, Wiley Blackwell, vol. 7(1), pages 1-71, January.
    9. Vijay Krishna & Tomas Sjöström, 1998. "On the Convergence of Fictitious Play," Mathematics of Operations Research, INFORMS, vol. 23(2), pages 479-511, May.
    10. Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
    11. Ruimeng Hu, 2019. "Deep Learning for Ranking Response Surfaces with Applications to Optimal Stopping Problems," Papers 1901.03478, arXiv.org, revised Mar 2020.
    12. Ngo Long, 2011. "Dynamic Games in the Economics of Natural Resources: A Survey," Dynamic Games and Applications, Springer, vol. 1(1), pages 115-148, March.
    13. Dockner,Engelbert J. & Jorgensen,Steffen & Long,Ngo Van & Sorger,Gerhard, 2000. "Differential Games in Economics and Management Science," Cambridge Books, Cambridge University Press, number 9780521637329.
    14. Justin Sirignano & Konstantinos Spiliopoulos, 2017. "DGM: A deep learning algorithm for solving partial differential equations," Papers 1708.07469, arXiv.org, revised Sep 2018.
    15. Liu, He & Qiao, Han & Wang, Shouyang & Li, Yuze, 2019. "Platform Competition in Peer-to-Peer Lending Considering Risk Control Ability," European Journal of Operational Research, Elsevier, vol. 274(1), pages 280-290.
    16. Monderer, Dov & Shapley, Lloyd S., 1996. "Fictitious Play Property for Games with Identical Interests," Journal of Economic Theory, Elsevier, vol. 68(1), pages 258-265, January.
    17. Chen, Shumin & Yang, Hailiang & Zeng, Yan, 2018. "Stochastic Differential Games Between Two Insurers With Generalized Mean-Variance Premium Principle," ASTIN Bulletin, Cambridge University Press, vol. 48(1), pages 413-434, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Robert Balkin & Hector D. Ceniceros & Ruimeng Hu, 2023. "Stochastic Delay Differential Games: Financial Modeling and Machine Learning Algorithms," Papers 2307.06450, arXiv.org.
    2. Ming Min & Ruimeng Hu, 2021. "Signatured Deep Fictitious Play for Mean Field Games with Common Noise," Papers 2106.03272, arXiv.org.
    3. Han, Jiequn & Hu, Ruimeng & Long, Jihao, 2023. "A class of dimension-free metrics for the convergence of empirical measures," Stochastic Processes and their Applications, Elsevier, vol. 164(C), pages 242-287.
    4. Jiequn Han & Yucheng Yang & Weinan E, 2021. "DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks," Papers 2112.14377, arXiv.org, revised Feb 2022.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiequn Han & Ruimeng Hu, 2019. "Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games," Papers 1912.01809, arXiv.org, revised Jun 2020.
    2. Ulrich Berger, 2004. "Two More Classes of Games with the Fictitious Play Property," Game Theory and Information 0408003, University Library of Munich, Germany.
    3. Ulrich Berger, 2004. "Some Notes on Learning in Games with Strategic Complementarities," Game Theory and Information 0409001, University Library of Munich, Germany.
    4. Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
    5. Berger, Ulrich, 2008. "Learning in games with strategic complementarities revisited," Journal of Economic Theory, Elsevier, vol. 143(1), pages 292-301, November.
    6. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    7. Ricardo Josa-Fombellida & Juan Rincón-Zapatero, 2015. "Euler–Lagrange equations of stochastic differential games: application to a game of a productive asset," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 59(1), pages 61-108, May.
    8. Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
    9. Ewerhart, Christian & Valkanova, Kremena, 2020. "Fictitious play in networks," Games and Economic Behavior, Elsevier, vol. 123(C), pages 182-206.
    10. Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
    11. Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
    12. Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
    13. Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
    14. Ulrich Berger, 2012. "Non-algebraic Convergence Proofs for Continuous-Time Fictitious Play," Dynamic Games and Applications, Springer, vol. 2(1), pages 4-17, March.
    15. Francesco Caruso & Maria Carmela Ceparano & Jacqueline Morgan, 2020. "Best response algorithms in ratio-bounded games: convergence of affine relaxations to Nash equilibria," CSEF Working Papers 593, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
    16. Sela, Aner, 2000. "Fictitious Play in 2 x 3 Games," Games and Economic Behavior, Elsevier, vol. 31(1), pages 152-162, April.
    17. van Strien, Sebastian & Sparrow, Colin, 2011. "Fictitious play in 3x3 games: Chaos and dithering behaviour," Games and Economic Behavior, Elsevier, vol. 73(1), pages 262-286, September.
    18. Berger, Ulrich, 2007. "Two more classes of games with the continuous-time fictitious play property," Games and Economic Behavior, Elsevier, vol. 60(2), pages 247-261, August.
    19. Sparrow, Colin & van Strien, Sebastian & Harris, Christopher, 2008. "Fictitious play in 3x3 games: The transition between periodic and chaotic behaviour," Games and Economic Behavior, Elsevier, vol. 63(1), pages 259-291, May.
    20. Berger, Ulrich, 2007. "Brown's original fictitious play," Journal of Economic Theory, Elsevier, vol. 135(1), pages 572-578, July.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2008.05519. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.