Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting

Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting

Author

Listed:

Eric Benhamou
David Saltiel
Serge Tabachnik
Sui Kai Wong
Franc{c}ois Chareyron

Abstract

Model-Free Reinforcement Learning has achieved meaningful results in stable environments but, to this day, it remains problematic in regime changing environments like financial markets. In contrast, model-based RL is able to capture some fundamental and dynamical concepts of the environment but suffer from cognitive bias. In this work, we propose to combine the best of the two techniques by selecting various model-based approaches thanks to Model-Free Deep Reinforcement Learning. Using not only past performance and volatility, we include additional contextual information such as macro and risk appetite signals to account for implicit regime changes. We also adapt traditional RL methods to real-life situations by considering only past data for the training sets. Hence, we cannot use future information in our training data set as implied by K-fold cross validation. Building on traditional statistical methods, we use the traditional "walk-forward analysis", which is defined by successive training and testing based on expanding periods, to assert the robustness of the resulting agent. Finally, we present the concept of statistical difference's significance based on a two-tailed T-test, to highlight the ways in which our models differ from more traditional ones. Our experimental results show that our approach outperforms traditional financial baseline portfolio models such as the Markowitz model in almost all evaluation metrics commonly used in financial mathematics, namely net performance, Sharpe and Sortino ratios, maximum drawdown, maximum drawdown over volatility.

Suggested Citation

Eric Benhamou & David Saltiel & Serge Tabachnik & Sui Kai Wong & Franc{c}ois Chareyron, 2021. "Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting," Papers 2104.10483, arXiv.org, revised Apr 2021.

Handle: RePEc:arx:papers:2104.10483

Download full text from publisher

References listed on IDEAS

Eric Benhamou & David Saltiel & Beatrice Guez & Nicolas Paris, 2019. "Testing Sharpe ratio: luck or skill?," Papers 1905.08042, arXiv.org, revised May 2019.
- Eric Benhamou & David Saltiel & Beatrice Guez & Nicolas Paris, 2020. "Testing Sharpe ratio: luck or skill?," Working Papers hal-02886500, HAL.
Xinyi Li & Yinchuan Li & Yuancheng Zhan & Xiao-Yang Liu, 2019. "Optimistic Bull or Pessimistic Bear: Adaptive Deep Reinforcement Learning for Stock Portfolio Allocation," Papers 1907.01503, arXiv.org.
Eric Benhamou, 2018. "Connecting Sharpe ratio and Student t-statistic, and beyond," Papers 1808.04233, arXiv.org, revised May 2019.
- Eric Benhamou, 2019. "Connecting Sharpe ratio and Student t-statistic, and beyond," Working Papers hal-02012448, HAL.
Eric Benhamou & Beatrice Guez & Nicolas Paris1, 2019. "Omega and Sharpe ratio," Papers 1911.10254, arXiv.org.
- Eric Benhamou & Beatrice Guez & Nicolas Paris, 2020. "Omega and Sharpe ratio," Working Papers hal-02886481, HAL.
Eric Benhamou, 2018. "Trend without hiccups: a Kalman filter approach," Papers 1808.03297, arXiv.org.
Haoran Wang & Xun Yu Zhou, 2019. "Continuous-Time Mean-Variance Portfolio Selection: A Reinforcement Learning Framework," Papers 1904.11392, arXiv.org, revised May 2019.
Yunan Ye & Hengzhi Pei & Boxin Wang & Pin-Yu Chen & Yada Zhu & Jun Xiao & Bo Li, 2020. "Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States," Papers 2002.05780, arXiv.org.
Glosten, Lawrence R & Jagannathan, Ravi & Runkle, David E, 1993. "On the Relation between the Expected Value and the Volatility of the Nominal Excess Return on Stocks," Journal of Finance, American Finance Association, vol. 48(5), pages 1779-1801, December.
- Lawrence R. Glosten & Ravi Jagannathan & David E. Runkle, 1993. "On the relation between the expected value and the volatility of the nominal excess return on stocks," Staff Report 157, Federal Reserve Bank of Minneapolis.
Eric Benhamou & Beatrice Guez, 2018. "Incremental Sharpe and other performance ratios," Journal of Statistical and Econometric Methods, SCIENPRESS Ltd, vol. 7(4), pages 1-2.
- Eric Benhamou & Beatrice Guez, 2018. "Incremental Sharpe and other performance ratios," Papers 1807.09864, arXiv.org, revised Dec 2018.
- Eric Benhamou & Beatrice Guez, 2018. "Incremental Sharpe and other performance ratios," Post-Print hal-02012443, HAL.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay, 2020. "Time your hedge with Deep Reinforcement Learning," Papers 2009.14136, arXiv.org, revised Nov 2020.
James B. Heaton & Nicholas Polson & Jan H. Witte, 2017. "Rejoinder to ‘Deep learning for finance: deep portfolios’," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 19-21, January.
Dias, José G. & Vermunt, Jeroen K. & Ramos, Sofia, 2015. "Clustering financial time series: New insights from an extended hidden Markov model," European Journal of Operational Research, Elsevier, vol. 243(3), pages 852-864.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay, 2020. "Bridging the gap between Markowitz planning and deep reinforcement learning," Papers 2010.09108, arXiv.org.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Zhipeng Liang & Hao Chen & Junhao Zhu & Kangkang Jiang & Yanran Li, 2018. "Adversarial Deep Reinforcement Learning in Portfolio Management," Papers 1808.09940, arXiv.org, revised Nov 2018.
J. B. Heaton & N. G. Polson & J. H. Witte, 2017. "Deep learning for finance: deep portfolios," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 3-12, January.
Diaa Noureldin & Neil Shephard & Kevin Sheppard, 2012. "Multivariate high‐frequency‐based volatility (HEAVY) models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(6), pages 907-933, September.
- Diaa Noureldin & Neil Shephard & Kevin Sheppard, 2011. "Multivariate High-Frequency-Based Volatility (HEAVY) Models," Economics Papers 2011-W01, Economics Group, Nuffield College, University of Oxford.
- Diaa Noureldin & Neil Shephard & Kevin Sheppard, 2011. "Multivariate High-Frequency-Based Volatility (HEAVY) Models," Economics Series Working Papers 533, University of Oxford, Department of Economics.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Eric Benhamou & David Saltiel & Serge Tabachnik & Sui Kai Wong & François Chareyron, 2021. "Distinguish the indistinguishable: a Deep Reinforcement Learning approach for volatility targeting models," Working Papers hal-03202431, HAL.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay & Jamal Atif, 2020. "AAMDRL: Augmented Asset Management with Deep Reinforcement Learning," Papers 2010.08497, arXiv.org.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay, 2020. "Time your hedge with Deep Reinforcement Learning," Papers 2009.14136, arXiv.org, revised Nov 2020.
Eric Benhamou & David Saltiel & Sandrine Ungari & Abhishek Mukhopadhyay, 2020. "Bridging the gap between Markowitz planning and deep reinforcement learning," Papers 2010.09108, arXiv.org.
Amirhosein Mosavi & Yaser Faghan & Pedram Ghamisi & Puhong Duan & Sina Faizollahzadeh Ardabili & Ely Salwana & Shahab S. Band, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Mathematics, MDPI, vol. 8(10), pages 1-42, September.
Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Kumar Yashaswi, 2021. "Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module," Papers 2102.06233, arXiv.org.
Zhengyong Jiang & Jeyan Thiayagalingam & Jionglong Su & Jinjun Liang, 2023. "CAD: Clustering And Deep Reinforcement Learning Based Multi-Period Portfolio Management Strategy," Papers 2310.01319, arXiv.org.
Eric Benhamou & Beatrice Guez, 2021. "Computation of the marginal contribution of Sharpe ratio and other performance ratios," Working Papers hal-03189299, HAL.
Yu, Pengrui & Liu, Siya & Jin, Chengneng & Gu, Runsheng & Gong, Xiaomin, 2025. "Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management," Pacific-Basin Finance Journal, Elsevier, vol. 91(C).
Ruixun Zhang, 2025. "Toward interpretable machine learning: evaluating models of heterogeneous predictions," Annals of Operations Research, Springer, vol. 347(2), pages 867-887, April.
Han, Chulwoo & Park, Frank C., 2022. "A geometric framework for covariance dynamics," Journal of Banking & Finance, Elsevier, vol. 134(C).
Karanasos, Menelaos & Xu, Yongdeng & Yfanti, Stavroula, 2017. "Constrained QML Estimation for Multivariate Asymmetric MEM with Spillovers: The Practicality of Matrix Inequalities," Cardiff Economics Working Papers E2017/14, Cardiff University, Cardiff Business School, Economics Section.
Catania, Leopoldo & Proietti, Tommaso, 2020. "Forecasting volatility with time-varying leverage and volatility of volatility effects," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1301-1317.
- Leopoldo Catania & Tommaso Proietti, 2019. "Forecasting Volatility with Time-Varying Leverage and Volatility of Volatility Effects," CEIS Research Paper 450, Tor Vergata University, CEIS, revised 06 Feb 2019.
Benjamin Coriat & Eric Benhamou, 2025. "HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization," Papers 2507.18560, arXiv.org.
Amir Mosavi & Pedram Ghamisi & Yaser Faghan & Puhong Duan, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Papers 2004.01509, arXiv.org.
Dinghai Xu, 2021. "A study on volatility spurious almost integration effect: A threshold realized GARCH approach," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(3), pages 4104-4126, July.
- Dinghai Xu, 2019. "A Study on Volatility Spurious Almost Integration Effect: A Threshold Realized GARCH Approach," Working Papers 1903, University of Waterloo, Department of Economics, revised Dec 2019.
Vitor Azevedo & Christopher Hoegner, 2023. "Enhancing stock market anomalies with machine learning," Review of Quantitative Finance and Accounting, Springer, vol. 60(1), pages 195-230, January.
Tang, Xuli & Li, Xin & Ding, Ying & Song, Min & Bu, Yi, 2020. "The pace of artificial intelligence innovations: Speed, talent, and trial-and-error," Journal of Informetrics, Elsevier, vol. 14(4).
Mei-Li Shen & Cheng-Feng Lee & Hsiou-Hsiang Liu & Po-Yin Chang & Cheng-Hong Yang, 2021. "An Effective Hybrid Approach for Forecasting Currency Exchange Rates," Sustainability, MDPI, vol. 13(5), pages 1-29, March.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2021-04-26 (Big Data)
NEP-CWA-2021-04-26 (Central and Western Asia)
NEP-ECM-2021-04-26 (Econometrics)
NEP-RMG-2021-04-26 (Risk Management)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2104.10483. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data