IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2509.17964.html
   My bibliography  Save this paper

FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance

Author

Listed:
  • Yang Li
  • Zhi Chen
  • Steve Y. Yang
  • Ruixun Zhang

Abstract

Traditional stochastic control methods in finance rely on simplifying assumptions that often fail in real world markets. While these methods work well in specific, well defined scenarios, they underperform when market conditions change. We introduce FinFlowRL, a novel framework for financial stochastic control that combines imitation learning with reinforcement learning. The framework first pretrains an adaptive meta policy by learning from multiple expert strategies, then finetunes it through reinforcement learning in the noise space to optimize the generation process. By employing action chunking, that is generating sequences of actions rather than single decisions, it addresses the non Markovian nature of financial markets. FinFlowRL consistently outperforms individually optimized experts across diverse market conditions.

Suggested Citation

  • Yang Li & Zhi Chen & Steve Y. Yang & Ruixun Zhang, 2025. "FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance," Papers 2509.17964, arXiv.org.
  • Handle: RePEc:arx:papers:2509.17964
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2509.17964
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Olivier Gu'eant & Charles-Albert Lehalle & Joaquin Fernandez Tapia, 2011. "Dealing with the Inventory Risk. A solution to the market making problem," Papers 1105.3115, arXiv.org, revised Aug 2012.
    2. Emmanuel Bacry & Iacopo Mastromatteo & Jean-Franc{c}ois Muzy, 2015. "Hawkes processes in finance," Papers 1502.04592, arXiv.org, revised May 2015.
    3. Merton, Robert C., 1976. "Option pricing when underlying stock returns are discontinuous," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 125-144.
    4. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    5. Merton, Robert C, 1969. "Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case," The Review of Economics and Statistics, MIT Press, vol. 51(3), pages 247-257, August.
    6. Marco Avellaneda & Sasha Stoikov, 2008. "High-frequency trading in a limit order book," Quantitative Finance, Taylor & Francis Journals, vol. 8(3), pages 217-224.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang Li & Zhi Chen, 2025. "FinFlowRL: An Imitation-Reinforcement Learning Framework for Adaptive Stochastic Control in Finance," Papers 2510.15883, arXiv.org.
    2. Jian'an Zhang, 2025. "Risk-Sensitive Option Market Making with Arbitrage-Free eSSVI Surfaces: A Constrained RL and Stochastic Control Bridge," Papers 2510.04569, arXiv.org.
    3. Bjork, Tomas, 2009. "Arbitrage Theory in Continuous Time," OUP Catalogue, Oxford University Press, edition 3, number 9780199574742.
    4. Christensen, Bent Jesper & Parra-Alvarez, Juan Carlos & Serrano, Rafael, 2021. "Optimal control of investment, premium and deductible for a non-life insurance company," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 384-405.
    5. Suresh M. Sundaresan, 2000. "Continuous‐Time Methods in Finance: A Review and an Assessment," Journal of Finance, American Finance Association, vol. 55(4), pages 1569-1622, August.
    6. Shuang Li & Yanli Zhou & Yonghong Wu & Xiangyu Ge, 2017. "Equilibrium approach of asset and option pricing under Lévy process and stochastic volatility," Australian Journal of Management, Australian School of Business, vol. 42(2), pages 276-295, May.
    7. Bilel Jarraya & Abdelfettah Bouri, 2013. "A Theoretical Assessment on Optimal Asset Allocations in Insurance Industry," International Journal of Finance & Banking Studies, Center for the Strategic Studies in Business and Finance, vol. 2(4), pages 30-44, October.
    8. Olivier Guéant, 2016. "The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making," Post-Print hal-01393136, HAL.
    9. Wojakowski, Rafal M. & Ebrahim, M. Shahid & Shackleton, Mark B., 2016. "Reducing the impact of real estate foreclosures with Amortizing Participation Mortgages," Journal of Banking & Finance, Elsevier, vol. 71(C), pages 62-74.
    10. Jaimungal, Sebastian & Young, Virginia R., 2005. "Pricing equity-linked pure endowments with risky assets that follow Lévy processes," Insurance: Mathematics and Economics, Elsevier, vol. 36(3), pages 329-346, June.
    11. Amit K. Sinha, 2021. "The reliability of geometric Brownian motion forecasts of S&P500 index values," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(8), pages 1444-1462, December.
    12. Grechuk, Bogdan & Zabarankin, Michael, 2018. "Direct data-based decision making under uncertainty," European Journal of Operational Research, Elsevier, vol. 267(1), pages 200-211.
    13. Bruno Gav{s}perov & Zvonko Kostanjv{c}ar, 2022. "Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book Model," Papers 2207.09951, arXiv.org.
    14. Robert A. Jarrow, 1999. "In Honor of the Nobel Laureates Robert C. Merton and Myron S. Scholes: A Partial Differential Equation That Changed the World," Journal of Economic Perspectives, American Economic Association, vol. 13(4), pages 229-248, Fall.
    15. Ziyi Wang & Carmine Ventre & Maria Polukarov, 2025. "ARL-Based Multi-Action Market Making with Hawkes Processes and Variable Volatility," Papers 2508.16589, arXiv.org.
    16. Stylianos Perrakis, 2022. "From innovation to obfuscation: continuous time finance fifty years later," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 36(3), pages 369-401, September.
    17. Pankaj Kumar, 2021. "Deep Hawkes Process for High-Frequency Market Making," Papers 2109.15110, arXiv.org.
    18. Cui, Zhenyu & Lars Kirkby, J. & Nguyen, Duy, 2019. "A general framework for time-changed Markov processes and applications," European Journal of Operational Research, Elsevier, vol. 273(2), pages 785-800.
    19. Munk, Claus, 2015. "Financial Asset Pricing Theory," OUP Catalogue, Oxford University Press, number 9780198716457.
    20. Torben G. Andersen & Tim Bollerslev & Peter F. Christoffersen & Francis X. Diebold, 2005. "Volatility Forecasting," PIER Working Paper Archive 05-011, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2509.17964. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.