From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

My bibliography Save this paper

From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

Author

Listed:

Ruxin Chen
Zeqiang Zhang

Registered:

Zeqiang Zhang

Abstract

The application of Reinforcement Learning (RL) to economic modeling reveals a fundamental conflict between the assumptions of equilibrium theory and the emergent behavior of learning agents. While canonical economic models assume atomistic agents act as `takers' of aggregate market conditions, a naive single-agent RL simulation incentivizes the agent to become a `manipulator' of its environment. This paper first demonstrates this discrepancy within a search-and-matching model with concave production, showing that a standard RL agent learns a non-equilibrium, monopsonistic policy. Additionally, we identify a parametric bias arising from the mismatch between economic discounting and RL's treatment of intertemporal costs. To address both issues, we propose a calibrated Mean-Field Reinforcement Learning framework that embeds a representative agent in a fixed macroeconomic field and adjusts the cost function to reflect economic opportunity costs. Our iterative algorithm converges to a self-consistent fixed point where the agent's policy aligns with the competitive equilibrium. This approach provides a tractable and theoretically sound methodology for modeling learning agents in economic systems within the broader domain of computational social science.

Suggested Citation

Ruxin Chen & Zeqiang Zhang, 2025. "From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models," Papers 2507.18229, arXiv.org, revised Oct 2025.

Handle: RePEc:arx:papers:2507.18229

Download full text from publisher

References listed on IDEAS

Eric Smith, 1999. "Search, Concave Production, and Optimal Firm Size," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 2(2), pages 456-471, April.
- Smith, Eric, 1994. "Search, Concave Production, and Optimal Firm Size," CEPR Discussion Papers 882, C.E.P.R. Discussion Papers.
Diogo Gomes & João Saúde, 2014. "Mean Field Games Models—A Brief Survey," Dynamic Games and Applications, Springer, vol. 4(2), pages 110-154, June.
Michael Curry & Alexander Trott & Soham Phade & Yu Bai & Stephan Zheng, 2022. "Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning," Papers 2201.01163, arXiv.org, revised Feb 2022.
Tohid Atashbar & Rui Aruhan Shi, 2023. "AI and Macroeconomic Modeling: Deep Reinforcement Learning in an RBC model," IMF Working Papers 2023/040, International Monetary Fund.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Kshama Dwarakanath & Svitlana Vyetrenko & Tucker Balch, 2024. "Empirical Equilibria in Agent-based Economic systems with Learning agents," Papers 2408.12038, arXiv.org.
Qirui Mi & Zhiyu Zhao & Chengdong Ma & Siyu Xia & Yan Song & Mengyue Yang & Jun Wang & Haifeng Zhang, 2024. "Learning Macroeconomic Policies through Dynamic Stackelberg Mean-Field Games," Papers 2403.12093, arXiv.org, revised Jun 2025.
Kshama Dwarakanath & Jialin Dong & Svitlana Vyetrenko, 2024. "Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy," Papers 2408.10391, arXiv.org, revised Oct 2024.
Paulo B. Brito, 2022. "The dynamics of growth and distribution in a spatially heterogeneous world," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 21(3), pages 311-350, September.
- Paulo Brito, 2004. "The Dynamics of Growth and Distribution in a Spatially Heterogeneous World," Working Papers Department of Economics 2004/14, ISEG - Lisbon School of Economics and Management, Department of Economics, Universidade de Lisboa.
Yujing Xu, 2022. "Unobservable investments, trade efficiency and search frictions," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 55(2), pages 764-799, May.
repec:spo:wpmain:info:hdl:2441/3aom2mve1k829p8sp4h3vrpgkg is not listed on IDEAS
William Hawkins & Daron Acemoglu, 2007. "Equilibrium Unemployment in a Generalized Search Model," 2007 Meeting Papers 384, Society for Economic Dynamics.
- William Hawkins & Daron Acemoglu, 2010. "Equilibrium Unemployment in a Generalized Search Model," 2010 Meeting Papers 1040, Society for Economic Dynamics.
Nicolas Roys, 2016. "Persistence of Shocks and the Reallocation of Labor," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 22, pages 109-130, October.
- Nicolas Roys, 2016. "Persistence of Shocks and the Reallocation of Labor," Working Papers 2016-14, Federal Reserve Bank of St. Louis.
Leo Kaas & Philipp Kircher, 2015. "Efficient Firm Dynamics in a Frictional Labor Market," American Economic Review, American Economic Association, vol. 105(10), pages 3030-3060, October.
- Philipp Kircher & Leo Kaas, 2010. "Efficient Firm Dynamics in a Frictional Labor Market," 2010 Meeting Papers 89, Society for Economic Dynamics.
- Leo Kaas & Philipp Kircher, 2015. "Efficient Firm Dynamics in a Frictional Labor Market," Working Paper Series of the Department of Economics, University of Konstanz 2015-09, Department of Economics, University of Konstanz.
- Philipp Kircher & Leo Kaas, 2013. "Efficient firm dynamics in a frictional labor market," 2013 Meeting Papers 160, Society for Economic Dynamics.
- Kaas, Leo & Kircher, Philipp, 2011. "Efficient Firm Dynamics in a Frictional Labor Market," IZA Discussion Papers 5452, Institute of Labor Economics (IZA).
- Leo Kaas & Philipp Kircher, 2011. "Efficient Firm Dynamics in a Frictional Labor Market," CESifo Working Paper Series 3336, CESifo.
- Leo Kaas & Philipp Kircher, 2011. "Efficient Firm Dynamics in a Frictional Labor Market," Working Paper Series of the Department of Economics, University of Konstanz 2011-01, Department of Economics, University of Konstanz.
Noritaka Kudoh & Masaru Sasaki, 2010. "Precautionary Demand For Labour And Firm Size," Bulletin of Economic Research, Wiley Blackwell, vol. 62(2), pages 133-153, April.
Michael U. Krause & Thomas A. Lubik, 2013. "Does Intra-Firm Bargaining Matter for Business Cycle Dynamics?," Economic Quarterly, Federal Reserve Bank of Richmond, issue 3Q, pages 229-250.
- Krause, Michael & Lubik, Thomas A., 2007. "Does intra-firm bargaining matter for business cycle dynamics?," Discussion Paper Series 1: Economic Studies 2007,17, Deutsche Bundesbank.
Noha Almulla & Rita Ferreira & Diogo Gomes, 2017. "Two Numerical Approaches to Stationary Mean-Field Games," Dynamic Games and Applications, Springer, vol. 7(4), pages 657-682, December.
Dobbelaere, Sabien & Luttens, Roland Iwan, 2016. "Gradual collective wage bargaining," Labour Economics, Elsevier, vol. 40(C), pages 37-42.
- Dobbelaere, Sabien & Luttens, Roland Iwan, 2016. "Gradual Collective Wage Bargaining," IZA Discussion Papers 9691, Institute of Labor Economics (IZA).
- Sabien Dobbelaere & Roland Iwan Luttens, 2016. "Gradual Collective Wage Bargaining," Tinbergen Institute Discussion Papers 16-004/V, Tinbergen Institute.
Bent Christensen & Jesper Bagger, 2014. "Wage and Productivity Dispersion: The Roles of Rent Sharing, Labor Quality and Capital Intensity," 2014 Meeting Papers 473, Society for Economic Dynamics.
V. N. Kolokoltsov & O. A. Malafeyev, 2018. "Corruption and botnet defense: a mean field game approach," International Journal of Game Theory, Springer;Game Theory Society, vol. 47(3), pages 977-999, September.
Bastgen, A. & Holzner, C.L., 2017. "Employment protection and the market for innovations," Labour Economics, Elsevier, vol. 46(C), pages 77-93.
- Andreas Bastgen & Christian Holzner, 2015. "Employment Protection and the Market for Innovations," CESifo Working Paper Series 5275, CESifo.
André Kurmann, 2009. "Holdups and Overinvestment in Physical Capital Markets," Cahiers de recherche 0904, CIRPEE.
Monique Ebell & Christian Haefke, 2009. "Product Market Deregulation and the U.S. Employment Miracle," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 12(3), pages 479-504, July.
- Monique Ebell & Christian Haefke, 2002. "Product market deregulation and the U.S. employment miracle," Economics Working Papers 930, Department of Economics and Business, Universitat Pompeu Fabra, revised Jan 2006.
- Monique Ebell & Christian Haefke, 2008. "Product Market Deregulation and the U.S. Employment Miracle," CEP Discussion Papers dp0874, Centre for Economic Performance, LSE.
- Ebell, Monique & Haefke, Christian, 2008. "Product market deregulation and the U.S. employment miracle," LSE Research Online Documents on Economics 19569, London School of Economics and Political Science, LSE Library.
- Monique Ebell & Christian Haefke, 2015. "Product Market Deregulation and the U.S. Employment Miracle," Working Papers 250, Barcelona School of Economics.
- Ebell, Monique & Haefke, Christian, 2006. "Product Market Deregulation and the U.S. Employment Miracle," IZA Discussion Papers 1946, Institute of Labor Economics (IZA).
- Ebell, Monique & Haefke, Christian, 2008. "Product Market Deregulation and the U.S. Employment Miracle," Economics Series 223, Institute for Advanced Studies.
Noritaka Kudoh & Masaru Sasaki, 2007. "Precautionary Demand for Labor in Search Equilibrium," Discussion Papers in Economics and Business 07-34, Osaka University, Graduate School of Economics.
Monique Ebell & Christian Haefke, 2002. "Product Market Deregulation and Labor Market Outcomes," Working Papers 02.08, Swiss National Bank, Study Center Gerzensee.
- Ebell, Monique & Haefke, Christian, 2003. "Product Market Deregulation and Labor Market Outcomes," IZA Discussion Papers 957, Institute of Labor Economics (IZA).
- Monique Ebell & Christian Haefke, 2002. "Product market deregulation and labor market outcomes," Economics Working Papers 726, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 2003.
Bauducco, Sofía & Janiak, Alexandre, 2018. "The macroeconomic consequences of raising the minimum wage: Capital accumulation, employment and the wage distribution," European Economic Review, Elsevier, vol. 101(C), pages 57-76.
- Alexandre Janiak & Sofía Bauducco, 2017. "The Macroeconomic Consequences of Raising the Minimum Wage: Capital Accumulation, Employment and the Wage Distribution," Documentos de Trabajo 481, Instituto de Economia. Pontificia Universidad Católica de Chile..

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2025-08-11 (Computational Economics)
NEP-HPE-2025-08-11 (History and Philosophy of Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.18229. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data