A Comparison of Reinforcement Learning and Deep Trajectory Based Stochastic Control Agents for Stepwise Mean-Variance Hedging

My bibliography Save this paper

A Comparison of Reinforcement Learning and Deep Trajectory Based Stochastic Control Agents for Stepwise Mean-Variance Hedging

Author

Listed:

Ali Fathi
Bernhard Hientzsch

Registered:

Abstract

We consider two data-driven approaches to hedging, Reinforcement Learning and Deep Trajectory-based Stochastic Optimal Control, under a stepwise mean-variance objective. We compare their performance for a European call option in the presence of transaction costs under discrete trading schedules. We do this for a setting where stock prices follow Black-Scholes-Merton dynamics and the "book-keeping" price for the option is given by the Black-Scholes-Merton model with the same parameters. This simulated data setting provides a "sanitized" lab environment with simple enough features where we can conduct a detailed study of strengths, features, issues, and limitations of these two approaches. However, the formulation is model free and could allow any other setting with available book-keeping prices. We consider this study as a first step to develop, test, and validate autonomous hedging agents, and we provide blueprints for such efforts that address various concerns and requirements.

Suggested Citation

Ali Fathi & Bernhard Hientzsch, 2023. "A Comparison of Reinforcement Learning and Deep Trajectory Based Stochastic Control Agents for Stepwise Mean-Variance Hedging," Papers 2302.07996, arXiv.org, revised Nov 2023.

Handle: RePEc:arx:papers:2302.07996

Download full text from publisher

References listed on IDEAS

Samuel N. Cohen & Derek Snow & Lukasz Szpruch, 2021. "Black-box model risk in finance," Papers 2102.04757, arXiv.org.
Magnus Wiese & Lianjun Bai & Ben Wood & Hans Buehler, 2019. "Deep Hedging: Learning to Simulate Equity Option Markets," Papers 1911.01700, arXiv.org.
Magnus Wiese & Phillip Murray, 2022. "Risk-Neutral Market Simulation," Papers 2202.13996, arXiv.org.
Jay Cao & Jacky Chen & John Hull & Zissis Poulos, 2021. "Deep Hedging of Derivatives Using Reinforcement Learning," Papers 2103.16409, arXiv.org.
Nicolas Boursin & Carl Remlinger & Joseph Mikael, 2022. "Deep Generators on Commodity Markets Application to Deep Hedging," Risks, MDPI, vol. 11(1), pages 1-18, December.
Narayan Ganesan & Yajie Yu & Bernhard Hientzsch, 2020. "Pricing Barrier Options with DeepBSDEs," Papers 2005.10966, arXiv.org.
A. Max Reppen & H. Mete Soner & Valentin Tissot-Daguette, 2022. "Deep Stochastic Optimization in Finance," Papers 2205.04604, arXiv.org.
Jian Liang & Zhe Xu & Peter Li, 2021. "Deep learning-based least squares forward-backward stochastic differential equation solver for high-dimensional derivative pricing," Quantitative Finance, Taylor & Francis Journals, vol. 21(8), pages 1309-1323, August.
Nicolas Boursin & Carl Remlinger & Joseph Mikael & Carol Anne Hargreaves, 2022. "Deep Generators on Commodity Markets; application to Deep Hedging," Papers 2205.13942, arXiv.org.
Bernhard Hientzsch, 2019. "Introduction to Solving Quant Finance Problems with Time-Stepped FBSDE and Deep Learning," Papers 1911.12231, arXiv.org.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Bernhard Hientzsch, 2023. "Reinforcement Learning and Deep Stochastic Optimal Control for Final Quadratic Hedging," Papers 2401.08600, arXiv.org.
Vedant Choudhary & Sebastian Jaimungal & Maxime Bergeron, 2023. "FuNVol: A Multi-Asset Implied Volatility Market Simulator using Functional Principal Components and Neural SDEs," Papers 2303.00859, arXiv.org, revised Dec 2023.
Yajie Yu & Bernhard Hientzsch & Narayan Ganesan, 2020. "Backward Deep BSDE Methods and Applications to Nonlinear Problems," Papers 2006.07635, arXiv.org.
Samuel N. Cohen & Christoph Reisinger & Sheng Wang, 2022. "Estimating risks of option books using neural-SDE market models," Papers 2202.07148, arXiv.org.
Michael Karpe, 2020. "An overall view of key problems in algorithmic trading and recent progress," Papers 2006.05515, arXiv.org.
Solveig Flaig & Gero Junike, 2022. "Scenario Generation for Market Risk Models Using Generative Neural Networks," Risks, MDPI, vol. 10(11), pages 1-28, October.
Blanka Horvath & Josef Teichmann & Žan Žurič, 2021. "Deep Hedging under Rough Volatility," Risks, MDPI, vol. 9(7), pages 1-20, July.
Alexandre Carbonneau & Fr'ed'eric Godin, 2021. "Deep equal risk pricing of financial derivatives with non-translation invariant risk measures," Papers 2107.11340, arXiv.org.
Hans Buhler & Blanka Horvath & Terry Lyons & Imanol Perez Arribas & Ben Wood, 2020. "A Data-driven Market Simulator for Small Data Environments," Papers 2006.14498, arXiv.org.
Federico Giorgi & Stefano Herzel & Paolo Pigato, 2023. "A Reinforcement Learning Algorithm for Trading Commodities," CEIS Research Paper 552, Tor Vergata University, CEIS, revised 18 Feb 2023.
Jay Cao & Jacky Chen & Soroush Farghadani & John Hull & Zissis Poulos & Zeyu Wang & Jun Yuan, 2022. "Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning," Papers 2205.05614, arXiv.org, revised Jan 2023.
Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
Blanka Horvath & Josef Teichmann & Zan Zuric, 2021. "Deep Hedging under Rough Volatility," Papers 2102.01962, arXiv.org.
Lorenc Kapllani & Long Teng, 2024. "A backward differential deep learning-based algorithm for solving high-dimensional nonlinear backward stochastic differential equations," Papers 2404.08456, arXiv.org.
Weilong Fu & Ali Hirsa & Jorg Osterrieder, 2022. "Simulating financial time series using attention," Papers 2207.00493, arXiv.org.
Hans Buehler & Phillip Murray & Mikko S. Pakkanen & Ben Wood, 2021. "Deep Hedging: Learning Risk-Neutral Implied Volatility Dynamics," Papers 2103.11948, arXiv.org, revised Jul 2021.
Pierre Renucci, 2023. "Optimal Linear Signal: An Unsupervised Machine Learning Framework to Optimize PnL with Linear Signals," Papers 2401.05337, arXiv.org.
Josef Teichmann & Hanna Wutte, 2023. "Machine Learning-powered Pricing of the Multidimensional Passport Option," Papers 2307.14887, arXiv.org.
Zacharia Issa & Blanka Horvath, 2023. "Non-parametric online market regime detection and regime clustering for multidimensional and path-dependent data structures," Papers 2306.15835, arXiv.org.
Ariel Neufeld & Julian Sester & Mario v{S}iki'c, 2022. "Markov Decision Processes under Model Uncertainty," Papers 2206.06109, arXiv.org, revised Jan 2023.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2023-04-03 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2302.07996. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Comparison of Reinforcement Learning and Deep Trajectory Based Stochastic Control Agents for Stepwise Mean-Variance Hedging

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data