IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.16409.html
   My bibliography  Save this paper

Deep Hedging of Derivatives Using Reinforcement Learning

Author

Listed:
  • Jay Cao
  • Jacky Chen
  • John Hull
  • Zissis Poulos

Abstract

This paper shows how reinforcement learning can be used to derive optimal hedging strategies for derivatives when there are transaction costs. The paper illustrates the approach by showing the difference between using delta hedging and optimal hedging for a short position in a call option when the objective is to minimize a function equal to the mean hedging cost plus a constant times the standard deviation of the hedging cost. Two situations are considered. In the first, the asset price follows a geometric Brownian motion. In the second, the asset price follows a stochastic volatility process. The paper extends the basic reinforcement learning approach in a number of ways. First, it uses two different Q-functions so that both the expected value of the cost and the expected value of the square of the cost are tracked for different state/action combinations. This approach increases the range of objective functions that can be used. Second, it uses a learning algorithm that allows for continuous state and action space. Third, it compares the accounting P&L approach (where the hedged position is valued at each step) and the cash flow approach (where cash inflows and outflows are used). We find that a hybrid approach involving the use of an accounting P&L approach that incorporates a relatively simple valuation model works well. The valuation model does not have to correspond to the process assumed for the underlying asset price.

Suggested Citation

  • Jay Cao & Jacky Chen & John Hull & Zissis Poulos, 2021. "Deep Hedging of Derivatives Using Reinforcement Learning," Papers 2103.16409, arXiv.org.
  • Handle: RePEc:arx:papers:2103.16409
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.16409
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Boyle, Phelim P & Vorst, Ton, 1992. "Option Replication in Discrete Time with Transaction Costs," Journal of Finance, American Finance Association, vol. 47(1), pages 271-293, March.
    2. Philippe Artzner & Freddy Delbaen & Jean‐Marc Eber & David Heath, 1999. "Coherent Measures of Risk," Mathematical Finance, Wiley Blackwell, vol. 9(3), pages 203-228, July.
    3. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Monoyios, Michael, 2004. "Option pricing with transaction costs using a Markov chain approximation," Journal of Economic Dynamics and Control, Elsevier, vol. 28(5), pages 889-913, February.
    2. Balder, Sven & Brandl, Michael & Mahayni, Antje, 2009. "Effectiveness of CPPI strategies under discrete-time trading," Journal of Economic Dynamics and Control, Elsevier, vol. 33(1), pages 204-220, January.
    3. Mark Broadie & Jerome B. Detemple, 2004. "ANNIVERSARY ARTICLE: Option Pricing: Valuation Models and Applications," Management Science, INFORMS, vol. 50(9), pages 1145-1177, September.
    4. Domagoj Demeterfi & Kathrin Glau & Linus Wunderlich, 2025. "Function approximations for counterparty credit exposure calculations," Papers 2507.09004, arXiv.org.
    5. Wang, Jun & Liang, Jin-Rong & Lv, Long-Jin & Qiu, Wei-Yuan & Ren, Fu-Yao, 2012. "Continuous time Black–Scholes equation with transaction costs in subdiffusive fractional Brownian motion regime," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(3), pages 750-759.
    6. Heller, Yuval & Schreiber, Amnon, 2020. "Short-term investments and indices of risk," Theoretical Economics, Econometric Society, vol. 15(3), July.
    7. Zachary Feinstein & Birgit Rudloff, 2018. "Scalar multivariate risk measures with a single eligible asset," Papers 1807.10694, arXiv.org, revised Feb 2021.
    8. Pascal Franc{c}ois & Genevi`eve Gauthier & Fr'ed'eric Godin & Carlos Octavio P'erez Mendoza, 2024. "Enhancing Deep Hedging of Options with Implied Volatility Surface Feedback Information," Papers 2407.21138, arXiv.org, revised Aug 2025.
    9. Dichtl, Hubert & Drobetz, Wolfgang, 2011. "Portfolio insurance and prospect theory investors: Popularity and optimal design of capital protected financial products," Journal of Banking & Finance, Elsevier, vol. 35(7), pages 1683-1697, July.
    10. Peter Christoffersen & Ruslan Goyenko & Kris Jacobs & Mehdi Karoui, 2018. "Illiquidity Premia in the Equity Options Market," The Review of Financial Studies, Society for Financial Studies, vol. 31(3), pages 811-851.
    11. Patrice Gaillardetz & Saeb Hachem, 2019. "Risk-Control Strategies," Papers 1908.02228, arXiv.org.
    12. Jinglun Yao & Sabine Laurent & Brice B'enaben, 2017. "Managing Volatility Risk: An Application of Karhunen-Lo\`eve Decomposition and Filtered Historical Simulation," Papers 1710.00859, arXiv.org.
    13. Virmani, Vineet, 2014. "Model Risk in Pricing Path-dependent Derivatives: An Illustration," IIMA Working Papers WP2014-03-22, Indian Institute of Management Ahmedabad, Research and Publication Department.
    14. Scheuenstuhl, Gerhard & Zagst, Rudi, 2008. "Integrated portfolio management with options," European Journal of Operational Research, Elsevier, vol. 185(3), pages 1477-1500, March.
    15. Carr, Peter & Geman, Helyette & Madan, Dilip B., 2001. "Pricing and hedging in incomplete markets," Journal of Financial Economics, Elsevier, vol. 62(1), pages 131-167, October.
    16. Alexandre Carbonneau & Fr'ed'eric Godin, 2021. "Deep equal risk pricing of financial derivatives with non-translation invariant risk measures," Papers 2107.11340, arXiv.org.
    17. Steve Zymler & Daniel Kuhn & Berç Rustem, 2013. "Worst-Case Value at Risk of Nonlinear Portfolios," Management Science, INFORMS, vol. 59(1), pages 172-188, July.
    18. Lin, Zih-Ying & Chang, Chuang-Chang & Wang, Yaw-Huei, 2018. "The impacts of asymmetric information and short sales on the illiquidity risk premium in the stock option market," Journal of Banking & Finance, Elsevier, vol. 94(C), pages 152-165.
    19. Bas Peeters & Cees L. Dert & André Lucas, 2003. "Black Scholes for Portfolios of Options in Discrete Time: the Price is Right, the Hedge is wrong," Tinbergen Institute Discussion Papers 03-090/2, Tinbergen Institute.
    20. Al–Zhour, Zeyad & Barfeie, Mahdiar & Soleymani, Fazlollah & Tohidi, Emran, 2019. "A computational method to price with transaction costs under the nonlinear Black–Scholes model," Chaos, Solitons & Fractals, Elsevier, vol. 127(C), pages 291-301.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.16409. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.