IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.18229.html
   My bibliography  Save this paper

From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

Author

Listed:
  • Zeqiang Zhang
  • Ruxin Chen

Abstract

The application of Reinforcement Learning (RL) to economic modeling reveals a fundamental conflict between the assumptions of equilibrium theory and the emergent behavior of learning agents. While canonical economic models assume atomistic agents act as `takers' of aggregate market conditions, a naive single-agent RL simulation incentivizes the agent to become a `manipulator' of its environment. This paper first demonstrates this discrepancy within a search-and-matching model with concave production, showing that a standard RL agent learns a non-equilibrium, monopsonistic policy. Additionally, we identify a parametric bias arising from the mismatch between economic discounting and RL's treatment of intertemporal costs. To address both issues, we propose a calibrated Mean-Field Reinforcement Learning framework that embeds a representative agent in a fixed macroeconomic field and adjusts the cost function to reflect economic opportunity costs. Our iterative algorithm converges to a self-consistent fixed point where the agent's policy aligns with the competitive equilibrium. This approach provides a tractable and theoretically sound methodology for modeling learning agents in economic systems within the broader domain of computational social science.

Suggested Citation

  • Zeqiang Zhang & Ruxin Chen, 2025. "From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models," Papers 2507.18229, arXiv.org.
  • Handle: RePEc:arx:papers:2507.18229
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.18229
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Diogo Gomes & João Saúde, 2014. "Mean Field Games Models—A Brief Survey," Dynamic Games and Applications, Springer, vol. 4(2), pages 110-154, June.
    2. Eric Smith, 1999. "Search, Concave Production, and Optimal Firm Size," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 2(2), pages 456-471, April.
    3. Michael Curry & Alexander Trott & Soham Phade & Yu Bai & Stephan Zheng, 2022. "Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning," Papers 2201.01163, arXiv.org, revised Feb 2022.
    4. Tohid Atashbar & Rui Aruhan Shi, 2023. "AI and Macroeconomic Modeling: Deep Reinforcement Learning in an RBC model," IMF Working Papers 2023/040, International Monetary Fund.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kshama Dwarakanath & Svitlana Vyetrenko & Tucker Balch, 2024. "Empirical Equilibria in Agent-based Economic systems with Learning agents," Papers 2408.12038, arXiv.org.
    2. Qirui Mi & Zhiyu Zhao & Chengdong Ma & Siyu Xia & Yan Song & Mengyue Yang & Jun Wang & Haifeng Zhang, 2024. "Learning Macroeconomic Policies through Dynamic Stackelberg Mean-Field Games," Papers 2403.12093, arXiv.org, revised Jun 2025.
    3. Kshama Dwarakanath & Jialin Dong & Svitlana Vyetrenko, 2024. "Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy," Papers 2408.10391, arXiv.org, revised Oct 2024.
    4. Paulo B. Brito, 2022. "The dynamics of growth and distribution in a spatially heterogeneous world," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 21(3), pages 311-350, September.
    5. Yujing Xu, 2022. "Unobservable investments, trade efficiency and search frictions," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 55(2), pages 764-799, May.
    6. repec:spo:wpmain:info:hdl:2441/3aom2mve1k829p8sp4h3vrpgkg is not listed on IDEAS
    7. William Hawkins & Daron Acemoglu, 2007. "Equilibrium Unemployment in a Generalized Search Model," 2007 Meeting Papers 384, Society for Economic Dynamics.
    8. Nicolas Roys, 2016. "Persistence of Shocks and the Reallocation of Labor," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 22, pages 109-130, October.
    9. Leo Kaas & Philipp Kircher, 2015. "Efficient Firm Dynamics in a Frictional Labor Market," American Economic Review, American Economic Association, vol. 105(10), pages 3030-3060, October.
    10. Noritaka Kudoh & Masaru Sasaki, 2010. "Precautionary Demand For Labour And Firm Size," Bulletin of Economic Research, Wiley Blackwell, vol. 62(2), pages 133-153, April.
    11. Michael U. Krause & Thomas A. Lubik, 2013. "Does Intra-Firm Bargaining Matter for Business Cycle Dynamics?," Economic Quarterly, Federal Reserve Bank of Richmond, issue 3Q, pages 229-250.
    12. Noha Almulla & Rita Ferreira & Diogo Gomes, 2017. "Two Numerical Approaches to Stationary Mean-Field Games," Dynamic Games and Applications, Springer, vol. 7(4), pages 657-682, December.
    13. Dobbelaere, Sabien & Luttens, Roland Iwan, 2016. "Gradual collective wage bargaining," Labour Economics, Elsevier, vol. 40(C), pages 37-42.
    14. Bent Christensen & Jesper Bagger, 2014. "Wage and Productivity Dispersion: The Roles of Rent Sharing, Labor Quality and Capital Intensity," 2014 Meeting Papers 473, Society for Economic Dynamics.
    15. V. N. Kolokoltsov & O. A. Malafeyev, 2018. "Corruption and botnet defense: a mean field game approach," International Journal of Game Theory, Springer;Game Theory Society, vol. 47(3), pages 977-999, September.
    16. Bastgen, A. & Holzner, C.L., 2017. "Employment protection and the market for innovations," Labour Economics, Elsevier, vol. 46(C), pages 77-93.
    17. André Kurmann, 2009. "Holdups and Overinvestment in Physical Capital Markets," Cahiers de recherche 0904, CIRPEE.
    18. Monique Ebell & Christian Haefke, 2009. "Product Market Deregulation and the U.S. Employment Miracle," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 12(3), pages 479-504, July.
    19. Noritaka Kudoh & Masaru Sasaki, 2007. "Precautionary Demand for Labor in Search Equilibrium," Discussion Papers in Economics and Business 07-34, Osaka University, Graduate School of Economics.
    20. Monique Ebell & Christian Haefke, 2002. "Product Market Deregulation and Labor Market Outcomes," Working Papers 02.08, Swiss National Bank, Study Center Gerzensee.
    21. Bauducco, Sofía & Janiak, Alexandre, 2018. "The macroeconomic consequences of raising the minimum wage: Capital accumulation, employment and the wage distribution," European Economic Review, Elsevier, vol. 101(C), pages 57-76.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.18229. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.