Reinforcement Learning and Deep Stochastic Optimal Control for Final Quadratic Hedging

My bibliography Save this paper

Reinforcement Learning and Deep Stochastic Optimal Control for Final Quadratic Hedging

Author

Listed:

Bernhard Hientzsch

Registered:

Abstract

We consider two data driven approaches, Reinforcement Learning (RL) and Deep Trajectory-based Stochastic Optimal Control (DTSOC) for hedging a European call option without and with transaction cost according to a quadratic hedging P&L objective at maturity ("variance-optimal hedging" or "final quadratic hedging"). We study the performance of the two approaches under various market environments (modeled via the Black-Scholes and/or the log-normal SABR model) to understand their advantages and limitations. Without transaction costs and in the Black-Scholes model, both approaches match the performance of the variance-optimal Delta hedge. In the log-normal SABR model without transaction costs, they match the performance of the variance-optimal Barlett's Delta hedge. Agents trained on Black-Scholes trajectories with matching initial volatility but used on SABR trajectories match the performance of Bartlett's Delta hedge in average cost, but show substantially wider variance. To apply RL approaches to these problems, P&L at maturity is written as sum of step-wise contributions and variants of RL algorithms are implemented and used that minimize expectation of second moments of such sums.

Suggested Citation

Bernhard Hientzsch, 2023. "Reinforcement Learning and Deep Stochastic Optimal Control for Final Quadratic Hedging," Papers 2401.08600, arXiv.org.

Handle: RePEc:arx:papers:2401.08600

Download full text from publisher

References listed on IDEAS

Jian Liang & Zhe Xu & Peter Li, 2021. "Deep learning-based least squares forward-backward stochastic differential equation solver for high-dimensional derivative pricing," Quantitative Finance, Taylor & Francis Journals, vol. 21(8), pages 1309-1323, August.
Nicolas Boursin & Carl Remlinger & Joseph Mikael & Carol Anne Hargreaves, 2022. "Deep Generators on Commodity Markets; application to Deep Hedging," Papers 2205.13942, arXiv.org.
Bernhard Hientzsch, 2019. "Introduction to Solving Quant Finance Problems with Time-Stepped FBSDE and Deep Learning," Papers 1911.12231, arXiv.org.
Nicolas Boursin & Carl Remlinger & Joseph Mikael, 2022. "Deep Generators on Commodity Markets Application to Deep Hedging," Risks, MDPI, vol. 11(1), pages 1-18, December.
Arun Kumar Polala & Bernhard Hientzsch, 2023. "Parametric Differential Machine Learning for Pricing and Calibration," Papers 2302.06682, arXiv.org, revised Feb 2023.
Martin Keller-Ressel, 2022. "Bartlett's Delta revisited: Variance-optimal hedging in the lognormal SABR and in the rough Bergomi model," Papers 2207.13573, arXiv.org.
Yajie Yu & Narayan Ganesan & Bernhard Hientzsch, 2023. "Backward Deep BSDE Methods and Applications to Nonlinear Problems," Risks, MDPI, vol. 11(3), pages 1-16, March.
Narayan Ganesan & Yajie Yu & Bernhard Hientzsch, 2020. "Pricing Barrier Options with DeepBSDEs," Papers 2005.10966, arXiv.org, revised Sep 2024.
Patrick S. Hagan & Andrew Lesniewski, 2017. "Bartlett's delta in the SABR model," Papers 1704.03110, arXiv.org, revised May 2020.
A. Max Reppen & H. Mete Soner & Valentin Tissot-Daguette, 2022. "Deep Stochastic Optimization in Finance," Papers 2205.04604, arXiv.org.
Samuel N. Cohen & Christoph Reisinger & Sheng Wang, 2021. "Arbitrage-free neural-SDE market models," Papers 2105.11053, arXiv.org, revised Aug 2021.
Jay Cao & Jacky Chen & John Hull & Zissis Poulos, 2021. "Deep Hedging of Derivatives Using Reinforcement Learning," Papers 2103.16409, arXiv.org.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Hardik Routray & Bernhard Hientzsch, 2024. "Enforcing asymptotic behavior with DNNs for approximation and regression in finance," Papers 2411.05257, arXiv.org, revised Jul 2025.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Ali Fathi & Bernhard Hientzsch, 2023. "A Comparison of Reinforcement Learning and Deep Trajectory Based Stochastic Control Agents for Stepwise Mean-Variance Hedging," Papers 2302.07996, arXiv.org, revised Nov 2023.
Hardik Routray & Bernhard Hientzsch, 2024. "Enforcing asymptotic behavior with DNNs for approximation and regression in finance," Papers 2411.05257, arXiv.org, revised Jul 2025.
Nacira Agram & Bernt Øksendal & Jan Rems, 2024. "Deep learning for quadratic hedging in incomplete jump market," Digital Finance, Springer, vol. 6(3), pages 463-499, September.
Yajie Yu & Bernhard Hientzsch & Narayan Ganesan, 2020. "Backward Deep BSDE Methods and Applications to Nonlinear Problems," Papers 2006.07635, arXiv.org.
Nacira Agram & Bernt {O}ksendal & Jan Rems, 2024. "Deep learning for quadratic hedging in incomplete jump market," Papers 2407.13688, arXiv.org.
Vedant Choudhary & Sebastian Jaimungal & Maxime Bergeron, 2023. "FuNVol: A Multi-Asset Implied Volatility Market Simulator using Functional Principal Components and Neural SDEs," Papers 2303.00859, arXiv.org, revised Dec 2023.
Alexandre Carbonneau & Fr'ed'eric Godin, 2021. "Deep equal risk pricing of financial derivatives with non-translation invariant risk measures," Papers 2107.11340, arXiv.org.
Federico Giorgi & Stefano Herzel & Paolo Pigato, 2023. "A Reinforcement Learning Algorithm for Trading Commodities," CEIS Research Paper 552, Tor Vergata University, CEIS, revised 18 Feb 2023.
Emmanuel Gnabeyeu & Omar Karkar & Imad Idboufous, 2024. "Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach," Papers 2410.11789, arXiv.org.
Jay Cao & Jacky Chen & Soroush Farghadani & John Hull & Zissis Poulos & Zeyu Wang & Jun Yuan, 2022. "Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning," Papers 2205.05614, arXiv.org, revised Jan 2023.
Rong Du & Duy-Minh Dang, 2023. "Fourier Neural Network Approximation of Transition Densities in Finance," Papers 2309.03966, arXiv.org, revised Sep 2024.
Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
Lorenc Kapllani & Long Teng, 2024. "A backward differential deep learning-based algorithm for solving high-dimensional nonlinear backward stochastic differential equations," Papers 2404.08456, arXiv.org.
Pierre Renucci, 2023. "Optimal Linear Signal: An Unsupervised Machine Learning Framework to Optimize PnL with Linear Signals," Papers 2401.05337, arXiv.org.
Aghapour, Ahmad & Arian, Hamid & Seco, Luis, 2025. "Deep-time neural networks: An efficient approach for solving high-dimensional PDEs," Applied Mathematics and Computation, Elsevier, vol. 488(C).
Yannick Limmer & Blanka Horvath, 2023. "Robust Hedging GANs," Papers 2307.02310, arXiv.org.
Josef Teichmann & Hanna Wutte, 2023. "Machine Learning-powered Pricing of the Multidimensional Passport Option," Papers 2307.14887, arXiv.org.
Parvin Malekzadeh & Zissis Poulos & Jacky Chen & Zeyu Wang & Konstantinos N. Plataniotis, 2024. "EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning," Papers 2408.12446, arXiv.org, revised Aug 2024.
Christa Cuchiero & Philipp Schmocker & Josef Teichmann, 2023. "Global universal approximation of functional input maps on weighted spaces," Papers 2306.03303, arXiv.org, revised Feb 2025.
Beatrice Acciaio & Anastasis Kratsios & Gudmund Pammer, 2022. "Designing Universal Causal Deep Learning Models: The Geometric (Hyper)Transformer," Papers 2201.13094, arXiv.org, revised Mar 2023.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-03-11 (Big Data)
NEP-CMP-2024-03-11 (Computational Economics)
NEP-RMG-2024-03-11 (Risk Management)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2401.08600. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Reinforcement Learning and Deep Stochastic Optimal Control for Final Quadratic Hedging

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data