IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2503.04218.html
   My bibliography  Save this paper

Hedging with Sparse Reward Reinforcement Learning

Author

Listed:
  • Yiheng Ding
  • Gangnan Yuan
  • Dewei Zuo
  • Ting Gao

Abstract

Derivatives, as a critical class of financial instruments, isolate and trade the price attributes of risk assets such as stocks, commodities, and indices, aiding risk management and enhancing market efficiency. However, traditional hedging models, constrained by assumptions such as continuous trading and zero transaction costs, fail to satisfy risk control requirements in complex and uncertain real-world markets. With advances in computing technology and deep learning, data-driven trading strategies are becoming increasingly prevalent. This thesis proposes a derivatives hedging framework integrating deep learning and reinforcement learning. The framework comprises a probabilistic forecasting model and a hedging agent, enabling market probability prediction, derivative pricing, and hedging. Specifically, we design a spatiotemporal attention-based probabilistic financial time series forecasting Transformer to address the scarcity of derivatives hedging data. A low-rank attention mechanism compresses high-dimensional assets into a low-dimensional latent space, capturing nonlinear asset relationships. The Transformer models sequential dependencies within this latent space, improving market probability forecasts and constructing an online training environment for downstream hedging tasks. Additionally, we incorporate generalized geometric Brownian motion to develop a risk-neutral pricing approach for derivatives. We model derivatives hedging as a reinforcement learning problem with sparse rewards and propose a behavior cloning-based recurrent proximal policy optimization (BC-RPPO) algorithm. This pretraining-finetuning framework significantly enhances the hedging agent's performance. Numerical experiments in the U.S. and Chinese financial markets demonstrate our method's superiority over traditional approaches.

Suggested Citation

  • Yiheng Ding & Gangnan Yuan & Dewei Zuo & Ting Gao, 2025. "Hedging with Sparse Reward Reinforcement Learning," Papers 2503.04218, arXiv.org.
  • Handle: RePEc:arx:papers:2503.04218
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2503.04218
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Samuel N. Cohen & Christoph Reisinger & Sheng Wang, 2023. "Arbitrage-Free Neural-SDE Market Models," Applied Mathematical Finance, Taylor & Francis Journals, vol. 30(1), pages 1-46, January.
    2. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    3. Heston, Steven L, 1993. "A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options," The Review of Financial Studies, Society for Financial Studies, vol. 6(2), pages 327-343.
    4. Jay Cao & Jacky Chen & Soroush Farghadani & John Hull & Zissis Poulos & Zeyu Wang & Jun Yuan, 2022. "Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning," Papers 2205.05614, arXiv.org, revised Jan 2023.
    5. Reilly Pickard & F. Wredenhagen & Y. Lawryshyn, 2024. "Optimizing Deep Reinforcement Learning for American Put Option Hedging," Papers 2405.08602, arXiv.org.
    6. David Wu & Sebastian Jaimungal, 2023. "Robust Risk-Aware Option Hedging," Papers 2303.15216, arXiv.org, revised Dec 2023.
    7. Igor Halperin, 2019. "The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1543-1553, September.
    8. David Wu & Sebastian Jaimungal, 2023. "Robust Risk-Aware Option Hedging," Applied Mathematical Finance, Taylor & Francis Journals, vol. 30(3), pages 153-174, May.
    9. Jay Cao & Jacky Chen & John Hull & Zissis Poulos, 2021. "Deep Hedging of Derivatives Using Reinforcement Learning," Papers 2103.16409, arXiv.org.
    10. Edoardo Vittori & Michele Trapletti & Marcello Restelli, 2020. "Option Hedging with Risk Averse Reinforcement Learning," Papers 2010.12245, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xianhua Peng & Xiang Zhou & Bo Xiao & Yi Wu, 2024. "A Risk Sensitive Contract-unified Reinforcement Learning Approach for Option Hedging," Papers 2411.09659, arXiv.org.
    2. Pascal Franc{c}ois & Genevi`eve Gauthier & Fr'ed'eric Godin & Carlos Octavio P'erez Mendoza, 2024. "Enhancing Deep Hedging of Options with Implied Volatility Surface Feedback Information," Papers 2407.21138, arXiv.org.
    3. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    4. Zoran Stoiljkovic, 2023. "Applying Reinforcement Learning to Option Pricing and Hedging," Papers 2310.04336, arXiv.org.
    5. Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
    6. Zheng Gong & Carmine Ventre & John O'Hara, 2021. "The Efficient Hedging Frontier with Deep Neural Networks," Papers 2104.05280, arXiv.org.
    7. Yannick Limmer & Blanka Horvath, 2023. "Robust Hedging GANs," Papers 2307.02310, arXiv.org.
    8. Parvin Malekzadeh & Zissis Poulos & Jacky Chen & Zeyu Wang & Konstantinos N. Plataniotis, 2024. "EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning," Papers 2408.12446, arXiv.org, revised Aug 2024.
    9. Ofelia Bonesini & Emilio Ferrucci & Ioannis Gasteratos & Antoine Jacquier, 2024. "Rough differential equations for volatility," Papers 2412.21192, arXiv.org.
    10. Kang Gao & Stephen Weston & Perukrishnen Vytelingum & Namid R. Stillman & Wayne Luk & Ce Guo, 2023. "Deeper Hedging: A New Agent-based Model for Effective Deep Hedging," Papers 2310.18755, arXiv.org.
    11. Xia, Kun & Yang, Xuewei & Zhu, Peng, 2023. "Delta hedging and volatility-price elasticity: A two-step approach," Journal of Banking & Finance, Elsevier, vol. 153(C).
    12. Reilly Pickard & F. Wredenhagen & Y. Lawryshyn, 2024. "Optimizing Deep Reinforcement Learning for American Put Option Hedging," Papers 2405.08602, arXiv.org.
    13. Francesco Mandelli & Marco Pinciroli & Michele Trapletti & Edoardo Vittori, 2023. "Reinforcement Learning for Credit Index Option Hedging," Papers 2307.09844, arXiv.org.
    14. Pascal Franc{c}ois & Genevi`eve Gauthier & Fr'ed'eric Godin & Carlos Octavio P'erez Mendoza, 2024. "Is the difference between deep hedging and delta hedging a statistical arbitrage?," Papers 2407.14736, arXiv.org, revised Oct 2024.
    15. Reilly Pickard & Finn Wredenhagen & Julio DeJesus & Mario Schlener & Yuri Lawryshyn, 2024. "Hedging American Put Options with Deep Reinforcement Learning," Papers 2405.06774, arXiv.org.
    16. Carol Alexandra & Leonardo M. Nogueira, 2005. "Optimal Hedging and Scale Inavriance: A Taxonomy of Option Pricing Models," ICMA Centre Discussion Papers in Finance icma-dp2005-10, Henley Business School, University of Reading, revised Nov 2005.
    17. Thomas Kokholm & Martin Stisen, 2015. "Joint pricing of VIX and SPX options with stochastic volatility and jump models," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 16(1), pages 27-48, January.
    18. Antoine Jacquier & Patrick Roome, 2015. "Black-Scholes in a CEV random environment," Papers 1503.08082, arXiv.org, revised Nov 2017.
    19. Eduardo Abi Jaber, 2022. "The characteristic function of Gaussian stochastic volatility models: an analytic expression," Working Papers hal-02946146, HAL.
    20. Peter Carr & Liuren Wu, 2014. "Static Hedging of Standard Options," Journal of Financial Econometrics, Oxford University Press, vol. 12(1), pages 3-46.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2503.04218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.