IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1912.01809.html
   My bibliography  Save this paper

Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games

Author

Listed:
  • Jiequn Han
  • Ruimeng Hu

Abstract

We propose a deep neural network-based algorithm to identify the Markovian Nash equilibrium of general large $N$-player stochastic differential games. Following the idea of fictitious play, we recast the $N$-player game into $N$ decoupled decision problems (one for each player) and solve them iteratively. The individual decision problem is characterized by a semilinear Hamilton-Jacobi-Bellman equation, to solve which we employ the recently developed deep BSDE method. The resulted algorithm can solve large $N$-player games for which conventional numerical methods would suffer from the curse of dimensionality. Multiple numerical examples involving identical or heterogeneous agents, with risk-neutral or risk-sensitive objectives, are tested to validate the accuracy of the proposed algorithm in large group games. Even for a fifty-player game with the presence of common noise, the proposed algorithm still finds the approximate Nash equilibrium accurately, which, to our best knowledge, is difficult to achieve by other numerical algorithms.

Suggested Citation

  • Jiequn Han & Ruimeng Hu, 2019. "Deep Fictitious Play for Finding Markovian Nash Equilibrium in Multi-Agent Games," Papers 1912.01809, arXiv.org, revised Jun 2020.
  • Handle: RePEc:arx:papers:1912.01809
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1912.01809
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Milgrom, Paul & Roberts, John, 1991. "Adaptive and sophisticated learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 82-100, February.
    2. Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
    3. Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
    4. Yves Achdou & Jiequn Han & Jean-Michel Lasry & Pierre-Louis Lions & Benjamin Moll, 2017. "Income and Wealth Distribution in Macroeconomics: A Continuous-Time Approach," NBER Working Papers 23732, National Bureau of Economic Research, Inc.
    5. A. Prasad & S. P. Sethi, 2004. "Competitive Advertising Under Uncertainty: A Stochastic Differential Game Approach," Journal of Optimization Theory and Applications, Springer, vol. 123(1), pages 163-185, October.
    6. Pierre Cardaliaguet & Charles-Albert Lehalle, 2016. "Mean Field Game of Controls and An Application To Trade Crowding," Papers 1610.09904, arXiv.org, revised Sep 2017.
    7. Xun Gao & Lu-Ming Duan, 2017. "Efficient representation of quantum many-body states with deep neural networks," Nature Communications, Nature, vol. 8(1), pages 1-6, December.
    8. Vijay Krishna & Tomas Sjöström, 1998. "On the Convergence of Fictitious Play," Mathematics of Operations Research, INFORMS, vol. 23(2), pages 479-511, May.
    9. Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
    10. Ngo Long, 2011. "Dynamic Games in the Economics of Natural Resources: A Survey," Dynamic Games and Applications, Springer, vol. 1(1), pages 115-148, March.
    11. Dockner,Engelbert J. & Jorgensen,Steffen & Long,Ngo Van & Sorger,Gerhard, 2000. "Differential Games in Economics and Management Science," Cambridge Books, Cambridge University Press, number 9780521637329.
    12. Darryl A. Seale & John E. Burnett, 2006. "Solving Large Games With Simulated Fictitious Play," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 8(03), pages 437-467.
    13. Monderer, Dov & Sela, Aner, 1996. "A2 x 2Game without the Fictitious Play Property," Games and Economic Behavior, Elsevier, vol. 14(1), pages 144-148, May.
    14. Monderer, Dov & Shapley, Lloyd S., 1996. "Fictitious Play Property for Games with Identical Interests," Journal of Economic Theory, Elsevier, vol. 68(1), pages 258-265, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xiangdong Liu & Yu Gu, 2023. "Study of Pricing of High-Dimensional Financial Derivatives Based on Deep Learning," Mathematics, MDPI, vol. 11(12), pages 1-16, June.
    2. Ming Min & Ruimeng Hu, 2021. "Signatured Deep Fictitious Play for Mean Field Games with Common Noise," Papers 2106.03272, arXiv.org.
    3. Steven Campbell & Yichao Chen & Arvind Shrivats & Sebastian Jaimungal, 2021. "Deep Learning for Principal-Agent Mean Field Games," Papers 2110.01127, arXiv.org.
    4. Han, Jiequn & Hu, Ruimeng & Long, Jihao, 2023. "A class of dimension-free metrics for the convergence of empirical measures," Stochastic Processes and their Applications, Elsevier, vol. 164(C), pages 242-287.
    5. Jiequn Han & Yucheng Yang & Weinan E, 2021. "DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks," Papers 2112.14377, arXiv.org, revised Feb 2022.
    6. Sebastian Jaimungal, 2022. "Reinforcement learning and stochastic optimisation," Finance and Stochastics, Springer, vol. 26(1), pages 103-129, January.
    7. Jiequn Han & Ruimeng Hu, 2021. "Recurrent Neural Networks for Stochastic Control Problems with Delay," Papers 2101.01385, arXiv.org, revised Jun 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiequn Han & Ruimeng Hu & Jihao Long, 2020. "Convergence of Deep Fictitious Play for Stochastic Differential Games," Papers 2008.05519, arXiv.org, revised Mar 2021.
    2. Ulrich Berger, 2004. "Two More Classes of Games with the Fictitious Play Property," Game Theory and Information 0408003, University Library of Munich, Germany.
    3. Ulrich Berger, 2004. "Some Notes on Learning in Games with Strategic Complementarities," Game Theory and Information 0409001, University Library of Munich, Germany.
    4. Ewerhart, Christian & Valkanova, Kremena, 2020. "Fictitious play in networks," Games and Economic Behavior, Elsevier, vol. 123(C), pages 182-206.
    5. Berger, Ulrich, 2005. "Fictitious play in 2 x n games," Journal of Economic Theory, Elsevier, vol. 120(2), pages 139-154, February.
    6. Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
    7. Sela, Aner, 2000. "Fictitious Play in 2 x 3 Games," Games and Economic Behavior, Elsevier, vol. 31(1), pages 152-162, April.
    8. Berger, Ulrich, 2008. "Learning in games with strategic complementarities revisited," Journal of Economic Theory, Elsevier, vol. 143(1), pages 292-301, November.
    9. van Strien, Sebastian & Sparrow, Colin, 2011. "Fictitious play in 3x3 games: Chaos and dithering behaviour," Games and Economic Behavior, Elsevier, vol. 73(1), pages 262-286, September.
    10. Berger, Ulrich, 2007. "Two more classes of games with the continuous-time fictitious play property," Games and Economic Behavior, Elsevier, vol. 60(2), pages 247-261, August.
    11. Berger, Ulrich, 2007. "Brown's original fictitious play," Journal of Economic Theory, Elsevier, vol. 135(1), pages 572-578, July.
    12. Candogan, Ozan & Ozdaglar, Asuman & Parrilo, Pablo A., 2013. "Dynamics in near-potential games," Games and Economic Behavior, Elsevier, vol. 82(C), pages 66-90.
    13. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    14. Hopkins, Ed, 1999. "Learning, Matching, and Aggregation," Games and Economic Behavior, Elsevier, vol. 26(1), pages 79-110, January.
    15. Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
    16. Sobel, Joel, 2000. "Economists' Models of Learning," Journal of Economic Theory, Elsevier, vol. 94(2), pages 241-261, October.
    17. Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
    18. Monderer, Dov & Samet, Dov & Sela, Aner, 1997. "Belief Affirming in Learning Processes," Journal of Economic Theory, Elsevier, vol. 73(2), pages 438-452, April.
    19. Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
    20. Ozan Candogan & Ishai Menache & Asuman Ozdaglar & Pablo A. Parrilo, 2011. "Flows and Decompositions of Games: Harmonic and Potential Games," Mathematics of Operations Research, INFORMS, vol. 36(3), pages 474-503, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1912.01809. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.