IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.18892.html

Structural Reinforcement Learning for Heterogeneous Agent Macroeconomics

Author

Listed:
  • Yucheng Yang
  • Chiyuan Wang
  • Andreas Schaab
  • Benjamin Moll

Abstract

We present a new approach to formulating and solving heterogeneous agent models with aggregate risk. We replace the cross-sectional distribution with low-dimensional prices as state variables and let agents learn equilibrium price dynamics directly from simulated paths. To do so, we introduce a structural reinforcement learning (SRL) method which treats prices via simulation while exploiting agents' structural knowledge of their own individual dynamics. Our SRL method yields a general and highly efficient global solution method for heterogeneous agent models that sidesteps the Master equation and handles problems traditional methods struggle with, in particular nontrivial market-clearing conditions. We illustrate the approach in the Krusell-Smith model, the Huggett model with aggregate shocks, and a HANK model with a forward-looking Phillips curve, all of which we solve globally within minutes.

Suggested Citation

  • Yucheng Yang & Chiyuan Wang & Andreas Schaab & Benjamin Moll, 2025. "Structural Reinforcement Learning for Heterogeneous Agent Macroeconomics," Papers 2512.18892, arXiv.org.
  • Handle: RePEc:arx:papers:2512.18892
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.18892
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fernández-Villaverde, Jesús & Ebrahimi Kahou, Mahdi & Perla, Jesse & Sood, Arnav, 2021. "Exploiting Symmetry in High-Dimensional Dynamic Programming," CEPR Discussion Papers 16285, C.E.P.R. Discussion Papers.
    2. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    3. Young, Eric R., 2010. "Solving the incomplete markets model with aggregate uncertainty using the Krusell-Smith algorithm and non-stochastic simulations," Journal of Economic Dynamics and Control, Elsevier, vol. 34(1), pages 36-41, January.
    4. George William Evans, 2001. "Expectations in Macroeconomics Adaptive versus Eductive Learning," Revue économique, Presses de Sciences-Po, vol. 52(3), pages 573-582.
    5. Giusto, Andrea, 2014. "Adaptive learning and distributional dynamics in an incomplete markets model," Journal of Economic Dynamics and Control, Elsevier, vol. 40(C), pages 317-333.
    6. Victor Duarte & Diogo Duarte & Dejanir H Silva, 2024. "Machine Learning for Continuous-Time Finance," The Review of Financial Studies, Society for Financial Studies, vol. 37(11), pages 3217-3271.
    7. Sargent, Thomas J., 1991. "Equilibrium with signal extraction from endogenous variables," Journal of Economic Dynamics and Control, Elsevier, vol. 15(2), pages 245-273, April.
    8. Marcet, Albert & Sargent, Thomas J., 1989. "Convergence of least squares learning mechanisms in self-referential linear stochastic models," Journal of Economic Theory, Elsevier, vol. 48(2), pages 337-368, August.
    9. Federico Gabriele & Aldo Glielmo & Marco Taboga, 2025. "Heterogeneous RBCs via Deep Multi-Agent Reinforcement Learning," Papers 2510.12272, arXiv.org, revised Feb 2026.
    10. Kase, Hanno & Melosi, Leonardo & Rottner, Matthias, 2022. "Estimating Nonlinear Heterogeneous Agents Models with Neural Networks," CEPR Discussion Papers 17391, C.E.P.R. Discussion Papers.
    11. Jésus Fernández-Villaverde & Galo Nuño & Jesse Perla & Jesús Fernández-Villaverde, 2024. "Taming the Curse of Dimensionality: Quantitative Economics with Deep Learning," CESifo Working Paper Series 11448, CESifo.
    12. Jonathan Heathcote & Kjetil Storesletten & Giovanni L. Violante, 2009. "Quantitative Macroeconomics with Heterogeneous Households," Annual Review of Economics, Annual Reviews, vol. 1(1), pages 319-354, May.
    13. Greg Kaplan & Kurt Mitman & Giovanni L. Violante, 2020. "The Housing Boom and Bust: Model Meets Evidence," Journal of Political Economy, University of Chicago Press, vol. 128(9), pages 3285-3345.
    14. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    15. SeHyoun Ahn & Greg Kaplan & Benjamin Moll & Thomas Winberry & Christian Wolf, 2018. "When Inequality Matters for Macro and Macro Matters for Inequality," NBER Macroeconomics Annual, University of Chicago Press, vol. 32(1), pages 1-75.
    16. Donghoon Lee & Kenneth I. Wolpin, 2006. "Intersectoral Labor Mobility and the Growth of the Service Sector," Econometrica, Econometric Society, vol. 74(1), pages 1-46, January.
    17. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    18. Nicholas C. Barberis & Lawrence J. Jin, 2023. "Model-free and Model-based Learning as Joint Drivers of Investor Behavior," NBER Working Papers 31081, National Bureau of Economic Research, Inc.
    19. Victor Duarte & Diogo Duarte & Dejanir H. Silva, 2024. "Machine Learning for Continuous-Time Finance," CESifo Working Paper Series 10909, CESifo.
    20. Adrien Auclert & Bence Bardóczy & Matthew Rognlie & Ludwig Straub, 2021. "Using the Sequence‐Space Jacobian to Solve and Estimate Heterogeneous‐Agent Models," Econometrica, Econometric Society, vol. 89(5), pages 2375-2408, September.
    21. Mingli Chen & Andreas Joseph & Michael Kumhof & Xinlei Pan & Xuan Zhou, 2021. "Deep Reinforcement Learning in a Monetary Model," Papers 2104.09368, arXiv.org, revised Jan 2023.
    22. Benjamin Moll, 2025. "The Trouble with Rational Expectations in Heterogeneous Agent Models: A Challenge for Macroeconomics," Papers 2508.20571, arXiv.org.
    23. Marlon Azinovic-Yang & Jan Zemlicka, 2025. "Deep Learning in the Sequence Space," CERGE-EI Working Papers wp802, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    24. Andrew Caplin & Mark Dean, 2008. "Dopamine, Reward Prediction Error, and Economics," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 123(2), pages 663-701.
    25. S. Rao Aiyagari, 1994. "Uninsured Idiosyncratic Risk and Aggregate Saving," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 109(3), pages 659-684.
    26. Francisco Gomes & Alexander Michaelides, 2008. "Asset Pricing with Limited Risk Sharing and Heterogeneous Agents," The Review of Financial Studies, Society for Financial Studies, vol. 21(1), pages 415-448, January.
    27. Zhouzhou Gu & Mathieu Lauri`ere & Sebastian Merkel & Jonathan Payne, 2024. "Global Solutions to Master Equations for Continuous Time Heterogeneous Agent Macroeconomic Models," Papers 2406.13726, arXiv.org.
    28. Ulrike Malmendier & Stefan Nagel, 2011. "Depression Babies: Do Macroeconomic Experiences Affect Risk Taking?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(1), pages 373-416.
    29. Adrien Bilal, 2023. "Solving Heterogeneous Agent Models with the Master Equation," NBER Working Papers 31103, National Bureau of Economic Research, Inc.
    30. Kjetil Storesletten & Chris Telmer & Amir Yaron, 2007. "Asset Pricing with Idiosyncratic Risk and Overlapping Generations," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 10(4), pages 519-548, October.
    31. Huggett, Mark, 1993. "The risk-free rate in heterogeneous-agent incomplete-insurance economies," Journal of Economic Dynamics and Control, Elsevier, vol. 17(5-6), pages 953-969.
    32. Marlon Azinovic-Yang & Jan v{Z}emliv{c}ka, 2025. "Deep Learning in the Sequence Space," Papers 2509.13623, arXiv.org, revised Mar 2026.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin Moll, 2025. "The Trouble with Rational Expectations in Heterogeneous Agent Models: A Challenge for Macroeconomics," Papers 2508.20571, arXiv.org.
    2. Jesús Fernández-Villaverde & Galo Nuño & Jesse Perla, 2024. "Taming the Curse of Dimensionality: Quantitative Economics with Deep Learning," NBER Working Papers 33117, National Bureau of Economic Research, Inc.
    3. Li, Mingzhe, 2025. "A Theory of Portfolio Choice for Heterogeneous Investors," MPRA Paper 126642, University Library of Munich, Germany, revised 29 Oct 2025.
    4. Hanno Kase & Matthias Rottner & Fabio Stohler, 2025. "Generative economic modeling," BIS Working Papers 1312, Bank for International Settlements.
    5. Zhouzhou Gu & Mathieu Lauri`ere & Sebastian Merkel & Jonathan Payne, 2024. "Global Solutions to Master Equations for Continuous Time Heterogeneous Agent Macroeconomic Models," Papers 2406.13726, arXiv.org.
    6. Krueger, D. & Mitman, K. & Perri, F., 2016. "Macroeconomics and Household Heterogeneity," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 843-921, Elsevier.
    7. Chipeniuk, Karsten O. & Katz, Nets Hawk & Walker, Todd B., 2022. "Households, auctioneers, and aggregation," European Economic Review, Elsevier, vol. 141(C).
    8. Schesch, Constantin, 2024. "Pseudospectral methods for continuous-time heterogeneous-agent models," Journal of Economic Dynamics and Control, Elsevier, vol. 163(C).
    9. Jesús Fernández‐Villaverde & Samuel Hurtado & Galo Nuño, 2023. "Financial Frictions and the Wealth Distribution," Econometrica, Econometric Society, vol. 91(3), pages 869-901, May.
    10. Fernández-Villaverde, Jesús & Marbet, Joël & Nuño, Galo & Rachedi, Omar, 2025. "Inequality and the zero lower bound," Journal of Econometrics, Elsevier, vol. 249(PC).
    11. Ivo Bakota, 2023. "Market Clearing and Krusell-Smith Algorithm in an Economy with Multiple Assets," Computational Economics, Springer;Society for Computational Economics, vol. 62(3), pages 1007-1045, October.
    12. Evans, David & Li, Jungang & McGough, Bruce, 2023. "Local rationality," Journal of Economic Behavior & Organization, Elsevier, vol. 205(C), pages 216-236.
    13. Darougheh, Saman & Faccini, Renato & Melosi, Leonardo & Villa, Alessandro T., 2024. "On-the-Job Search and Inflation under the Microscope," The Warwick Economics Research Paper Series (TWERPS) 1536, University of Warwick, Department of Economics.
    14. Darracq Pariès, Matthieu & Notarpietro, Alessandro & Kilponen, Juha & Papadopoulou, Niki & Zimic, Srečko & Aldama, Pierre & Langenus, Geert & Alvarez, Luis Julian & Lemoine, Matthieu & Angelini, Elena, 2021. "Review of macroeconomic modelling in the Eurosystem: current practices and scope for improvement," Occasional Paper Series 267, European Central Bank.
    15. Grimaud, Alex, 2021. "Precautionary saving and un-anchored expectations," MPRA Paper 108931, University Library of Munich, Germany.
    16. François Le Grand & Xavier Ragot, 2022. "Managing Inequality Over Business Cycles: Optimal Policies With Heterogeneous Agents And Aggregate Shocks," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 63(1), pages 511-540, February.
    17. Grimaud, Alex, 2021. "Precautionary saving and un-anchored expectations," ECON WPS - Working Papers in Economic Theory and Policy 08/2021, TU Wien, Institute of Statistics and Mathematical Methods in Economics, Economics Research Unit.
    18. Karsten O. Chipeniuk & Nets Hawk Katz & Todd Bruce Walker, 2022. "Households, Auctioneers, and Aggregation," CAEPR Working Papers 2022-005 Classification-E, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
    19. Kase, Hanno & Melosi, Leonardo & Rottner, Matthias, 2022. "Estimating Nonlinear Heterogeneous Agents Models with Neural Networks," CEPR Discussion Papers 17391, C.E.P.R. Discussion Papers.
    20. Marlon Azinovic-Yang & Jan v{Z}emliv{c}ka, 2025. "Deep Learning in the Sequence Space," Papers 2509.13623, arXiv.org, revised Mar 2026.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.18892. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.