IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2104.09368.html

Deep Reinforcement Learning in a Monetary Model

Author

Listed:
  • Mingli Chen
  • Andreas Joseph
  • Michael Kumhof
  • Xinlei Pan
  • Xuan Zhou

Abstract

We propose using deep reinforcement learning to solve dynamic stochastic general equilibrium models. Agents are represented by deep artificial neural networks and learn to solve their dynamic optimisation problem by interacting with the model environment, of which they have no a priori knowledge. Deep reinforcement learning offers a flexible yet principled way to model bounded rationality within this general class of models. We apply our proposed approach to a classical model from the adaptive learning literature in macroeconomics which looks at the interaction of monetary and fiscal policy. We find that, contrary to adaptive learning, the artificially intelligent household can solve the model in all policy regimes.

Suggested Citation

  • Mingli Chen & Andreas Joseph & Michael Kumhof & Xinlei Pan & Xuan Zhou, 2021. "Deep Reinforcement Learning in a Monetary Model," Papers 2104.09368, arXiv.org, revised Jan 2023.
  • Handle: RePEc:arx:papers:2104.09368
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2104.09368
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. George William Evans, 2001. "Expectations in Macroeconomics Adaptive versus Eductive Learning," Revue économique, Presses de Sciences-Po, vol. 52(3), pages 573-582.
    2. Sims, Christopher A., 2010. "Rational Inattention and Monetary Economics," Handbook of Monetary Economics, in: Benjamin M. Friedman & Michael Woodford (ed.), Handbook of Monetary Economics, edition 1, volume 3, chapter 4, pages 155-181, Elsevier.
    3. Blanchard, Olivier Jean & Kahn, Charles M, 1980. "The Solution of Linear Difference Models under Rational Expectations," Econometrica, Econometric Society, vol. 48(5), pages 1305-1311, July.
    4. Stefano Eusepi & Bruce Preston, 2018. "The Science of Monetary Policy: An Imperfect Knowledge Perspective," Journal of Economic Literature, American Economic Association, vol. 56(1), pages 3-59, March.
    5. Douglas Heaven, 2019. "Why deep-learning AIs are so easy to fool," Nature, Nature, vol. 574(7777), pages 163-166, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rui & Shi, 2021. "Can an AI agent hit a moving target?," Papers 2110.02474, arXiv.org, revised Oct 2022.
    2. Heyang Ma & Qirui Mi & Qipeng Yang & Zijun Fan & Bo Li & Haifeng Zhang, 2025. "Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making," Papers 2511.12876, arXiv.org, revised Mar 2026.
    3. Artem Kuriksha, 2021. "An Economy of Neural Networks: Learning from Heterogeneous Experiences," Papers 2110.11582, arXiv.org.
    4. Rui (Aruhan) Shi, 2021. "Learning from Zero: How to Make Consumption-Saving Decisions in a Stochastic Environment with an AI Algorithm," CESifo Working Paper Series 9255, CESifo.
    5. Yucheng Yang & Chiyuan Wang & Andreas Schaab & Benjamin Moll, 2025. "Structural Reinforcement Learning for Heterogeneous Agent Macroeconomics," Papers 2512.18892, arXiv.org.
    6. Jesús Fernández-Villaverde & Galo Nuño & Jesse Perla, 2024. "Taming the Curse of Dimensionality: Quantitative Economics with Deep Learning," NBER Working Papers 33117, National Bureau of Economic Research, Inc.
    7. Benjamin Moll, 2025. "The Trouble with Rational Expectations in Heterogeneous Agent Models: A Challenge for Macroeconomics," Papers 2508.20571, arXiv.org.
    8. Qirui Mi & Zhiyu Zhao & Chengdong Ma & Siyu Xia & Yan Song & Mengyue Yang & Jun Wang & Haifeng Zhang, 2024. "Learning Macroeconomic Policies through Dynamic Stackelberg Mean-Field Games," Papers 2403.12093, arXiv.org, revised Jun 2025.
    9. Qirui Mi & Qipeng Yang & Zijun Fan & Wentian Fan & Heyang Ma & Chengdong Ma & Siyu Xia & Bo An & Jun Wang & Haifeng Zhang, 2025. "EconGym: A Scalable AI Testbed with Diverse Economic Tasks," Papers 2506.12110, arXiv.org.
    10. Rui & Shi, 2021. "Learning from zero: how to make consumption-saving decisions in a stochastic environment with an AI algorithm," Papers 2105.10099, arXiv.org, revised Feb 2022.
    11. Hinterlang, Natascha & Tänzer, Alina, 2021. "Optimal monetary policy using reinforcement learning," Discussion Papers 51/2021, Deutsche Bundesbank.
    12. Michael Curry & Alexander Trott & Soham Phade & Yu Bai & Stephan Zheng, 2022. "Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning," Papers 2201.01163, arXiv.org, revised Feb 2022.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jean-Paul L’Huillier & Sanjay R Singh & Donghoon Yoo, 2024. "Incorporating Diagnostic Expectations into the New Keynesian Framework," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 91(5), pages 3013-3046.
    2. Anufriev, Mikhail & Duffy, John & Panchenko, Valentyn, 2022. "Learning in two-dimensional beauty contest games: Theory and experimental evidence," Journal of Economic Theory, Elsevier, vol. 201(C).
    3. Poledna, Sebastian & Miess, Michael Gregor & Hommes, Cars & Rabitsch, Katrin, 2023. "Economic forecasting with an agent-based model," European Economic Review, Elsevier, vol. 151(C).
    4. Grimaud, Alex & Salle, Isabelle & Vermandel, Gauthier, 2025. "A Dynare toolbox for social learning expectations," Journal of Economic Dynamics and Control, Elsevier, vol. 172(C).
    5. Warne, Anders, 2023. "DSGE model forecasting: rational expectations vs. adaptive learning," Working Paper Series 2768, European Central Bank.
    6. Chatelain, Jean-Bernard & Ralf, Kirsten, 2021. "Hopf Bifurcation From New-Keynesian Taylor Rule To Ramsey Optimal Policy," Macroeconomic Dynamics, Cambridge University Press, vol. 25(8), pages 2204-2236, December.
    7. Cole, Stephen J., 2020. "The influence of learning and price-level targeting on central bank forward guidance," Journal of Macroeconomics, Elsevier, vol. 65(C).
    8. Pfajfar, Damjan & Santoro, Emiliano, 2014. "Credit Market Distortions, Asset Prices And Monetary Policy," Macroeconomic Dynamics, Cambridge University Press, vol. 18(3), pages 631-650, April.
    9. Mitsuhiro Okano, 2026. "A Unified Framework for Equilibrium Selection in DSGE Models," Papers 2601.19329, arXiv.org.
    10. Goy, Gavin & Hommes, Cars & Mavromatis, Kostas, 2022. "Forward guidance and the role of central bank credibility under heterogeneous beliefs," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 1240-1274.
    11. Ashwin, Julian & Beaudry, Paul & Ellison, Martin, 2025. "Neural network learning for nonlinear economies," Journal of Monetary Economics, Elsevier, vol. 149(C).
    12. García-Cicco, Javier, 2022. "Alternative monetary-policy instruments and limited credibility: An exploration," Latin American Journal of Central Banking (previously Monetaria), Elsevier, vol. 3(1).
    13. André Marine Charlotte & Dai Meixing, 2018. "Learning, robust monetary policy and the merit of precaution," The B.E. Journal of Macroeconomics, De Gruyter, vol. 18(2), pages 1-20, June.
    14. Elton Beqiraj & Giovanni Di Bartolomeo & Marco Di Pietro & Carolina Serpieri, 2020. "Bounded rationality and heterogeneous expectations: Euler versus anticipated-utility approach," Journal of Economics, Springer, vol. 130(3), pages 249-273, August.
    15. Gauti B. Eggertsson & Sergei K. Egiev, 2025. "Liquidity Traps: A Unified Theory of the Great Depression and the Great Recession," Journal of Economic Literature, American Economic Association, vol. 63(4), pages 1424-1551, December.
    16. Marine Charlotte André & Meixing Dai, 2017. "Can inflation contract discipline central bankers when agents are learning?," Working Papers of BETA 2017-25, Bureau d'Economie Théorique et Appliquée, UDS, Strasbourg.
    17. Lindé, J. & Smets, F. & Wouters, R., 2016. "Challenges for Central Banks’ Macro Models," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 2185-2262, Elsevier.
    18. Evans, David & Evans, George W. & McGough, Bruce, 2022. "The RPEs of RBCs and other DSGEs," Journal of Economic Dynamics and Control, Elsevier, vol. 143(C).
    19. Duffy, John & Shin, Michael, 2024. "Heterogeneous experience and constant-gain learning," Journal of Economic Dynamics and Control, Elsevier, vol. 164(C).
    20. James Bullard & Eric Schaling, 2009. "Monetary Policy, Determinacy, and Learnability in a Two-Block World Economy," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 41(8), pages 1585-1612, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2104.09368. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.