IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v61y2023i4d10.1007_s10614-022-10249-3.html
   My bibliography  Save this article

Stock Price Formation: Precepts from a Multi-Agent Reinforcement Learning Model

Author

Listed:
  • Johann Lussange

    (École Normale Supérieure)

  • Stefano Vrizzi

    (École Normale Supérieure
    École Normale Supérieure)

  • Sacha Bourgeois-Gironde

    (Institut Jean-Nicod
    Université Paris II Panthéon-Assas)

  • Stefano Palminteri

    (École Normale Supérieure
    NU University Higher School of Economics)

  • Boris Gutkin

    (École Normale Supérieure
    NU University Higher School of Economics)

Abstract

In the past, the bottom-up study of financial stock markets relied on first-generation multi-agent systems (MAS) , which employed zero-intelligence agents and often required the additional implementation of so-called noise traders to emulate price formation processes. Nowadays, thanks to the tools developed in cognitive science and machine learning, MAS can quantitatively gauge agent learning, a pivotal element for information and stock price estimation in finance. In our previous work, we therefore devised a new generation MAS stock market simulator , which implements two key features: firstly, each agent autonomously learns to perform price forecasting and stock trading via model-free reinforcement learning ; secondly, all agents ’ trading decisions feed a centralised double-auction limit order book, emulating price and volume microstructures. Here, we study which trading strategies (represented as reinforcement learning policies) the agents learn and the time-dependency of their heterogeneity. Our central result is that there are more ways to succeed in trading than to fail. More specifically, we find that : i- better-performing agents learn in time more diverse trading strategies than worse-performing ones, ii- they tend to employ a fundamentalist, rather than chartist, approach to asset price valuation, and iii- their transaction orders are less stringent (i.e. larger bids or lower asks).

Suggested Citation

  • Johann Lussange & Stefano Vrizzi & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2023. "Stock Price Formation: Precepts from a Multi-Agent Reinforcement Learning Model," Computational Economics, Springer;Society for Computational Economics, vol. 61(4), pages 1523-1544, April.
  • Handle: RePEc:kap:compec:v:61:y:2023:i:4:d:10.1007_s10614-022-10249-3
    DOI: 10.1007/s10614-022-10249-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-022-10249-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-022-10249-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stephen J. Hardiman & Nicolas Bercot & Jean-Philippe Bouchaud, 2013. "Critical reflexivity in financial markets: a Hawkes process analysis," Papers 1302.1405, arXiv.org, revised Jun 2013.
    2. Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the clock: An agent-based model of low- and high-frequency trading," Journal of Evolutionary Economics, Springer, vol. 26(1), pages 49-76, March.
    3. T. T. Chen & B. Zheng & Y. Li & X. F. Jiang, 2017. "New approaches in agent-based modeling of complex financial systems," Papers 1703.06840, arXiv.org.
    4. Stefano Palminteri & Mehdi Khamassi & Mateus Joffily & Giorgio Coricelli, 2015. "Contextual modulation of value signals in reward and punishment learning," Nature Communications, Nature, vol. 6(1), pages 1-14, November.
    5. Grossman, Sanford J & Stiglitz, Joseph E, 1980. "On the Impossibility of Informationally Efficient Markets," American Economic Review, American Economic Association, vol. 70(3), pages 393-408, June.
    6. Michael Benzaquen & Jean-Philippe Bouchaud, 2018. "A fractional reaction–diffusion description of supply and demand," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 91(2), pages 1-7, February.
    7. Jan A. Lipski & Ryszard Kutner, 2013. "Agent-Based Stock Market Model with Endogenous Agents' Impact," Papers 1310.0762, arXiv.org, revised Dec 2013.
    8. Thomas Lux & Michele Marchesi, 1999. "Scaling and criticality in a stochastic multi-agent model of a financial market," Nature, Nature, vol. 397(6719), pages 498-500, February.
    9. D. Sornette, 2014. "Physics and Financial Economics (1776-2014): Puzzles, Ising and Agent-Based models," Papers 1404.0243, arXiv.org.
    10. Westerhoff Frank H., 2008. "The Use of Agent-Based Financial Market Models to Test the Effectiveness of Regulatory Policies," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 228(2-3), pages 195-227, April.
    11. Platt, Donovan & Gebbie, Tim, 2018. "Can agent-based models probe market microstructure?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 503(C), pages 1092-1106.
    12. Gualdi, Stanislao & Tarzia, Marco & Zamponi, Francesco & Bouchaud, Jean-Philippe, 2015. "Tipping points in macroeconomic agent-based models," Journal of Economic Dynamics and Control, Elsevier, vol. 50(C), pages 29-61.
    13. Gode, Dhananjay K & Sunder, Shyam, 1993. "Allocative Efficiency of Markets with Zero-Intelligence Traders: Market as a Partial Substitute for Individual Rationality," Journal of Political Economy, University of Chicago Press, vol. 101(1), pages 119-137, February.
    14. Anna Dodonova & Yuri Khoroshilov, 2018. "Private information in futures markets: An experimental study," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 39(1), pages 65-70, January.
    15. Roberto Mota Navarro & Hernán Larralde, 2017. "A detailed heterogeneous agent model for a single asset financial market with trading via an order book," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-27, February.
    16. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    17. Michael Benzaquen & Jean-Philippe Bouchaud, 2018. "A fractional reaction–diffusion description of supply and demand," Post-Print hal-02323544, HAL.
    18. Thomas Spooner & John Fearnley & Rahul Savani & Andreas Koukorinis, 2018. "Market Making via Reinforcement Learning," Papers 1804.04216, arXiv.org.
    19. Sylvain Barde, 2015. "A Practical, Universal, Information Criterion over Nth Order Markov Processes," Studies in Economics 1504, School of Economics, University of Kent.
    20. Marc Potters & Jean-Philippe Bouchaud, 2001. "More stylized facts of financial markets: leverage effect and downside correlations," Science & Finance (CFM) working paper archive 29960, Science & Finance, Capital Fund Management.
    21. I. Momennejad & E. M. Russek & J. H. Cheong & M. M. Botvinick & N. D. Daw & S. J. Gershman, 2017. "The successor representation in human reinforcement learning," Nature Human Behaviour, Nature, vol. 1(9), pages 680-692, September.
    22. Chiarella, Carl & Iori, Giulia, 2009. "The impact of heterogeneous trading rules on the limit order book and order flows," Journal of Economic Dynamics and Control, Elsevier, vol. 33(3), pages 525-537.
    23. Bouchaud, Jean-Philippe & Potters, Marc, 2001. "More stylized facts of financial markets: leverage effect and downside correlations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(1), pages 60-70.
    24. Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
    25. Alessio Emanuele Biondo, 2019. "Order book modeling and financial stability," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 14(3), pages 469-489, September.
    26. Ross, Stephen A, 1973. "The Economic Theory of Agency: The Principal's Problem," American Economic Review, American Economic Association, vol. 63(2), pages 134-139, May.
    27. J. Doyne Farmer & Duncan Foley, 2009. "The economy needs agent-based modelling," Nature, Nature, vol. 460(7256), pages 685-686, August.
    28. Marco Bartolozzi, 2010. "A Multi Agent Model for the Limit Order Book Dynamics," Papers 1005.0182, arXiv.org, revised Oct 2010.
    29. Stefano Palminteri & Mehdi Khamassi & Mateus Joffily & Giorgio Coricelli, 2015. "Contextual modulation of value signals in reward and punishment learning," Post-Print halshs-01236045, HAL.
    30. Damien Challet & Robin Stinchcombe, 2003. "Non-constant rates and over-diffusive prices in a simple model of limit order markets," Quantitative Finance, Taylor & Francis Journals, vol. 3(3), pages 155-162.
    31. Weibing Huang & Charles-Albert Lehalle & Mathieu Rosenbaum, 2015. "Simulating and Analyzing Order Book Data: The Queue-Reactive Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 107-122, March.
    32. Pramod Kumar Naik & Rangan Gupta & Puja Padhi, 2018. "The Relationship Between Stock Market Volatility And Trading Volume: Evidence From South Africa," Journal of Developing Areas, Tennessee State University, College of Business, vol. 52(1), pages 99-114, January-M.
    33. Erev, I. & Roth, Alvin E., 2014. "Maximization, learning, and economic behavior," Scholarly Articles 30831199, Harvard University Department of Economics.
    34. Stephen Hardiman & Nicolas Bercot & Jean-Philippe Bouchaud, 2013. "Critical reflexivity in financial markets: a Hawkes process analysis," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 86(10), pages 1-9, October.
    35. M. Bartolozzi, 2010. "A multi agent model for the limit order book dynamics," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 78(2), pages 265-273, November.
    36. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    37. Franke, Reiner & Westerhoff, Frank, 2012. "Structural stochastic volatility in asset pricing dynamics: Estimation and model contest," Journal of Economic Dynamics and Control, Elsevier, vol. 36(8), pages 1193-1211.
    38. Maslov, Sergei, 2000. "Simple model of a limit order-driven market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 278(3), pages 571-578.
    39. Sumitra Ganesh & Nelson Vadori & Mengda Xu & Hua Zheng & Prashant Reddy & Manuela Veloso, 2019. "Reinforcement Learning for Market Making in a Multi-agent Dealer Market," Papers 1911.05892, arXiv.org.
    40. Germain Lefebvre & Maël Lebreton & Florent Meyniel & Sacha Bourgeois-Gironde & Stefano Palminteri, 2017. "Behavioural and neural characterization of optimistic reinforcement learning," Nature Human Behaviour, Nature, vol. 1(4), pages 1-9, April.
    41. Jean-Philippe Bouchaud, 2019. "Econophysics: Still fringe after 30 years?," Papers 1901.03691, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.
    2. Johann Lussange & Boris Gutkin, 2023. "Order book regulatory impact on stock market quality: a multi-agent reinforcement learning perspective," Papers 2302.04184, arXiv.org.
    3. Johann Lussange & Stefano Vrizzi & Stefano Palminteri & Boris Gutkin, 2024. "Modelling crypto markets by multi-agent reinforcement learning," Papers 2402.10803, arXiv.org.
    4. Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the clock: An agent-based model of low- and high-frequency trading," Journal of Evolutionary Economics, Springer, vol. 26(1), pages 49-76, March.
    5. Roberto Mota Navarro & Hern'an Larralde Ridaura, 2016. "A detailed heterogeneous agent model for a single asset financial market with trading via an order book," Papers 1601.00229, arXiv.org, revised Jul 2016.
    6. repec:hal:spmain:info:hdl:2441/f6h8764enu2lskk9p4oq9ig8k is not listed on IDEAS
    7. Leal, Sandrine Jacob & Napoletano, Mauro, 2019. "Market stability vs. market resilience: Regulatory policies experiments in an agent-based model with low- and high-frequency trading," Journal of Economic Behavior & Organization, Elsevier, vol. 157(C), pages 15-41.
    8. repec:hal:spmain:info:hdl:2441/6ummnc8nko827b2luohnctekk7 is not listed on IDEAS
    9. repec:hal:spmain:info:hdl:2441/3utlh0ehcn860pus6p2p683ade is not listed on IDEAS
    10. Torsten Trimborn & Philipp Otte & Simon Cramer & Maximilian Beikirch & Emma Pabich & Martin Frank, 2020. "SABCEMM: A Simulator for Agent-Based Computational Economic Market Models," Computational Economics, Springer;Society for Computational Economics, vol. 55(2), pages 707-744, February.
    11. Jovanovic, Franck & Schinckus, Christophe, 2017. "Econophysics and Financial Economics: An Emerging Dialogue," OUP Catalogue, Oxford University Press, number 9780190205034.
    12. Gao-Feng Gu & Xiong Xiong & Hai-Chuan Xu & Wei Zhang & Yongjie Zhang & Wei Chen & Wei-Xing Zhou, 2021. "An empirical behavioral order-driven model with price limit rules," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.
    13. Roberto Mota Navarro & Hernán Larralde, 2017. "A detailed heterogeneous agent model for a single asset financial market with trading via an order book," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-27, February.
    14. Alessio Emanuele Biondo, 2019. "Order book modeling and financial stability," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 14(3), pages 469-489, September.
    15. Martin D. Gould & Mason A. Porter & Stacy Williams & Mark McDonald & Daniel J. Fenn & Sam D. Howison, 2010. "Limit Order Books," Papers 1012.0349, arXiv.org, revised Apr 2013.
    16. Simon Cramer & Torsten Trimborn, 2019. "Stylized Facts and Agent-Based Modeling," Papers 1912.02684, arXiv.org.
    17. Torsten Trimborn & Philipp Otte & Simon Cramer & Max Beikirch & Emma Pabich & Martin Frank, 2018. "SABCEMM-A Simulator for Agent-Based Computational Economic Market Models," Papers 1801.01811, arXiv.org, revised Oct 2018.
    18. Sandrine Jacob Leal & Mauro Napoletano, 2017. "Market Stability vs. Market Resilience: Regulatory Policies Experiments in an Agent-Based Model with Low- and High-Frequency Trading," Post-Print hal-01768876, HAL.
    19. Pankaj Kumar, 2021. "Deep Hawkes Process for High-Frequency Market Making," Papers 2109.15110, arXiv.org.
    20. Kei Katahira & Yu Chen & Gaku Hashimoto & Hiroshi Okuda, 2019. "Development of an agent-based speculation game for higher reproducibility of financial stylized facts," Papers 1902.02040, arXiv.org.
    21. Kononovicius, Aleksejus & Ruseckas, Julius, 2019. "Order book model with herd behavior exhibiting long-range memory," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 525(C), pages 171-191.
    22. Luis Goncalves de Faria, 2022. "An Agent-Based Model With Realistic Financial Time Series: A Method for Agent-Based Models Validation," Papers 2206.09772, arXiv.org.
    23. Aleksejus Kononovicius & Julius Ruseckas, 2018. "Order book model with herd behavior exhibiting long-range memory," Papers 1809.02772, arXiv.org, revised Apr 2019.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:61:y:2023:i:4:d:10.1007_s10614-022-10249-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.