IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.10913.html

Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies

Author

Listed:
  • Mohammad Rezoanul Hoque
  • Md Meftahul Ferdaus
  • M. Kabir Hassan

Abstract

Reinforcement learning (RL) is an innovative approach to financial decision making, offering specialized solutions to complex investment problems where traditional methods fail. This review analyzes 167 articles from 2017--2025, focusing on market making, portfolio optimization, and algorithmic trading. It identifies key performance issues and challenges in RL for finance. Generally, RL offers advantages over traditional methods, particularly in market making. This study proposes a unified framework to address common concerns such as explainability, robustness, and deployment feasibility. Empirical evidence with synthetic data suggests that implementation quality and domain knowledge often outweigh algorithmic complexity. The study highlights the need for interpretable RL architectures for regulatory compliance, enhanced robustness in nonstationary environments, and standardized benchmarking protocols. Organizations should focus less on algorithm sophistication and more on market microstructure, regulatory constraints, and risk management in decision-making.

Suggested Citation

  • Mohammad Rezoanul Hoque & Md Meftahul Ferdaus & M. Kabir Hassan, 2025. "Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies," Papers 2512.10913, arXiv.org.
  • Handle: RePEc:arx:papers:2512.10913
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.10913
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Edson Bastos Santos & Rama Cont, 2010. "The Brazilian Interbank Network Structure and Systemic Risk," Working Papers Series 219, Central Bank of Brazil, Research Department.
    2. Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.
    3. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    4. Peter Gomber & Jascha-Alexander Koch & Michael Siering, 2017. "Digital Finance and FinTech: current research and future research directions," Journal of Business Economics, Springer, vol. 87(5), pages 537-580, July.
    5. James B. Heaton & Nicholas Polson & Jan H. Witte, 2017. "Rejoinder to ‘Deep learning for finance: deep portfolios’," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 19-21, January.
    6. Fuli Feng & Xiangnan He & Xiang Wang & Cheng Luo & Yiqun Liu & Tat-Seng Chua, 2018. "Temporal Relational Ranking for Stock Prediction," Papers 1809.09441, arXiv.org, revised Jan 2019.
    7. Bertsimas, Dimitris & Lo, Andrew W., 1998. "Optimal control of execution costs," Journal of Financial Markets, Elsevier, vol. 1(1), pages 1-50, April.
    8. Alexander J. McNeil & Rüdiger Frey & Paul Embrechts, 2015. "Quantitative Risk Management: Concepts, Techniques and Tools Revised edition," Economics Books, Princeton University Press, edition 2, number 10496, December.
    9. Robert Almgren, 2003. "Optimal execution with nonlinear impact functions and trading-enhanced risk," Applied Mathematical Finance, Taylor & Francis Journals, vol. 10(1), pages 1-18.
    10. Kian‐Ping Lim & Robert Brooks, 2011. "The Evolution Of Stock Market Efficiency Over Time: A Survey Of The Empirical Literature," Journal of Economic Surveys, Wiley Blackwell, vol. 25(1), pages 69-108, February.
    11. Thomas Spooner & John Fearnley & Rahul Savani & Andreas Koukorinis, 2018. "Market Making via Reinforcement Learning," Papers 1804.04216, arXiv.org.
    12. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    13. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    14. Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
    15. Roberto Arturo Agudelo Aguirre & Alberto Antonio Agudelo Aguirre, 2024. "Behavioral finance: Evolution from the classical theory and remarks," Journal of Economic Surveys, Wiley Blackwell, vol. 38(2), pages 452-475, April.
    16. J. B. Heaton & N. G. Polson & J. H. Witte, 2017. "Deep learning for finance: deep portfolios," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 3-12, January.
    17. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    18. Brown, Stephen J, et al, 1992. "Survivorship Bias in Performance Studies," The Review of Financial Studies, Society for Financial Studies, vol. 5(4), pages 553-580.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arasteh, Abdollah, 2025. "A data-driven prediction method for multi-period portfolio optimization using the real options approach," Finance Research Letters, Elsevier, vol. 80(C).
    2. Abraham Itzhak Weinberg, 2025. "Hybrid Quantum-Classical Ensemble Learning for S\&P 500 Directional Prediction," Papers 2512.15738, arXiv.org.
    3. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    4. Vitor Azevedo & Christopher Hoegner, 2023. "Enhancing stock market anomalies with machine learning," Review of Quantitative Finance and Accounting, Springer, vol. 60(1), pages 195-230, January.
    5. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    6. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    7. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    8. Junjie Guo, 2025. "Integration of Wavelet Transform Convolution and Channel Attention with LSTM for Stock Price Prediction based Portfolio Allocation," Papers 2507.01973, arXiv.org, revised Jul 2025.
    9. Haoren Zhu & Pengfei Zhao & Wilfred Siu Hung NG & Dik Lun Lee, 2024. "Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns," Papers 2406.11886, arXiv.org.
    10. Jiang, Yifu & Olmo, Jose & Atwi, Majed, 2025. "High-dimensional multi-period portfolio allocation using deep reinforcement learning," International Review of Economics & Finance, Elsevier, vol. 98(C).
    11. Kim, A. & Yang, Y. & Lessmann, S. & Ma, T. & Sung, M.-C. & Johnson, J.E.V., 2020. "Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 217-234.
    12. S. C. P. Yam & W. Zhou, 2017. "Optimal Liquidation of Child Limit Orders," Mathematics of Operations Research, INFORMS, vol. 42(2), pages 517-545, May.
    13. Werner Kristjanpoller & Kevin Michell & Cristian Llanos & Marcel C. Minutolo, 2025. "Incorporating causal notions to forecasting time series: a case study," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 11(1), pages 1-22, December.
    14. Iwao Maeda & David deGraw & Michiharu Kitano & Hiroyasu Matsushima & Hiroki Sakaji & Kiyoshi Izumi & Atsuo Kato, 2020. "Deep Reinforcement Learning in Agent Based Financial Market Simulation," JRFM, MDPI, vol. 13(4), pages 1-17, April.
    15. Daniil Karzanov & Rub'en Garz'on & Mikhail Terekhov & Caglar Gulcehre & Thomas Raffinot & Marcin Detyniecki, 2025. "Regret-Optimized Portfolio Enhancement through Deep Reinforcement Learning and Future Looking Rewards," Papers 2502.02619, arXiv.org.
    16. Xiangyu Cui & Xun Li & Yun Shi & Si Zhao, 2023. "Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning," Papers 2312.15385, arXiv.org.
    17. Nikolay A. Andreev, 2015. "Worst-Case Approach To Strategic Optimal Portfolio Selection Under Transaction Costs And Trading Limits," HSE Working papers WP BRP 45/FE/2015, National Research University Higher School of Economics.
    18. Arif Pathan, 2025. "Transformers Beyond Order: A Chaos-Markov-Gaussian Framework for Short-Term Sentiment Forecasting of Any Financial OHLC timeseries Data," Papers 2506.17244, arXiv.org.
    19. Shanyu Han & Yang Liu & Xiang Yu, 2025. "Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions," Papers 2505.04553, arXiv.org, revised May 2025.
    20. Jinyang Li, 2024. "A Deep Reinforcement Learning Framework For Financial Portfolio Management," Papers 2409.08426, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.10913. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.