IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.01709.html

Reinforcement Learning for Option Hedging: Static Implied-Volatility Fit versus Shortfall-Aware Performance

Author

Listed:
  • Ziheng Chen
  • Minxuan Hu
  • Jiayu Yi
  • Wenxi Sun

Abstract

We extend the Q-learner in Black-Scholes (QLBS) framework by incorporating risk aversion and trading costs, and propose a novel Replication Learning of Option Pricing (RLOP) approach. Both methods are fully compatible with standard reinforcement learning algorithms and operate under market frictions. Using SPY and XOP option data, we evaluate performance along static and dynamic dimensions. Adaptive-QLBS achieves higher static pricing accuracy in implied volatility space, while RLOP delivers superior dynamic hedging performance by reducing shortfall probability. These results highlight the importance of evaluating option pricing models beyond static fit, emphasizing realized hedging outcomes.

Suggested Citation

  • Ziheng Chen & Minxuan Hu & Jiayu Yi & Wenxi Sun, 2026. "Reinforcement Learning for Option Hedging: Static Implied-Volatility Fit versus Shortfall-Aware Performance," Papers 2601.01709, arXiv.org.
  • Handle: RePEc:arx:papers:2601.01709
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.01709
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hans FÃllmer & Peter Leukert, 2000. "Efficient hedging: Cost versus shortfall risk," Finance and Stochastics, Springer, vol. 4(2), pages 117-146.
    2. Fan, Qingqian & Feng, Sixian, 2022. "An empirical study on the characterization of implied volatility and pricing in the Chinese option market," Finance Research Letters, Elsevier, vol. 49(C).
    3. Stutz, Michael & Popova, Ivilina, 2025. "JDAPP: Reinforcement learning with phaseshift jump diffusion and Fourier randomization for robust financial agent training," Finance Research Letters, Elsevier, vol. 86(PG).
    4. François, Pascal & Gauthier, Geneviève & Godin, Frédéric & Mendoza, Carlos Octavio Pérez, 2025. "Is the difference between deep hedging and delta hedging a statistical arbitrage?," Finance Research Letters, Elsevier, vol. 73(C).
    5. Li, Lingfei & Wu, Jingyu & Zhu, Minting & Wang, Mancang, 2025. "Analytic solutions for pricing American style options," Finance Research Letters, Elsevier, vol. 86(PB).
    6. Merton, Robert C., 1976. "Option pricing when underlying stock returns are discontinuous," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 125-144.
    7. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    8. Heston, Steven L, 1993. "A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options," The Review of Financial Studies, Society for Financial Studies, vol. 6(2), pages 327-343.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Minxuan Hu & Ziheng Chen & Jiayu Yi & Wenxi Sun, 2026. "Autonomous AI Agents for Option Hedging: Enhancing Financial Stability through Shortfall Aware Reinforcement Learning," Papers 2603.06587, arXiv.org.
    2. Yoonsik Hong & Diego Klabjan, 2025. "Statistical Arbitrage in Options Markets by Graph Learning and Synthetic Long Positions," Papers 2508.14762, arXiv.org, revised Aug 2025.
    3. Gonçalo Faria & João Correia-da-Silva, 2014. "A closed-form solution for options with ambiguity about stochastic volatility," Review of Derivatives Research, Springer, vol. 17(2), pages 125-159, July.
    4. Li, Pengshi & Lin, Yan & Yu, Xing & Liu, Guifang, 2025. "Does bid-ask spread explains the smile? On DVF and DML," Pacific-Basin Finance Journal, Elsevier, vol. 90(C).
    5. Domagoj Demeterfi & Kathrin Glau & Linus Wunderlich, 2025. "Function approximations for counterparty credit exposure calculations," Papers 2507.09004, arXiv.org.
    6. Carol Alexandra & Leonardo M. Nogueira, 2005. "Optimal Hedging and Scale Inavriance: A Taxonomy of Option Pricing Models," ICMA Centre Discussion Papers in Finance icma-dp2005-10, Henley Business School, University of Reading, revised Nov 2005.
    7. Peter Carr & Liuren Wu, 2014. "Static Hedging of Standard Options," Journal of Financial Econometrics, Oxford University Press, vol. 12(1), pages 3-46.
    8. René Garcia & Richard Luger & Eric Renault, 2000. "Asymmetric Smiles, Leverage Effects and Structural Parameters," Working Papers 2000-57, Center for Research in Economics and Statistics.
    9. Carvalho, Augusto & Guimaraes, Bernardo, 2018. "State-controlled companies and political risk: Evidence from the 2014 Brazilian election," Journal of Public Economics, Elsevier, vol. 159(C), pages 66-78.
    10. Jurczenko, Emmanuel & Maillet, Bertrand & Negrea, Bogdan, 2002. "Revisited multi-moment approximate option pricing models a general comparison (Part 1)," LSE Research Online Documents on Economics 24950, London School of Economics and Political Science, LSE Library.
    11. Viktor Stojkoski & Trifce Sandev & Lasko Basnarkov & Ljupco Kocarev & Ralf Metzler, 2020. "Generalised geometric Brownian motion: Theory and applications to option pricing," Papers 2011.00312, arXiv.org.
    12. Yeap, Claudia & Kwok, Simon S. & Choy, S. T. Boris, 2016. "A Flexible Generalised Hyperbolic Option Pricing Model and its Special Cases," Working Papers 2016-14, University of Sydney, School of Economics.
    13. José Valentim Machado Vicente & Jaqueline Terra Moura Marins, 2019. "A Volatility Smile-Based Uncertainty Index," Working Papers Series 502, Central Bank of Brazil, Research Department.
    14. Ciprian Necula & Gabriel Drimus & Walter Farkas, 2019. "A general closed form option pricing formula," Review of Derivatives Research, Springer, vol. 22(1), pages 1-40, April.
    15. Yongxin Yang & Yu Zheng & Timothy M. Hospedales, 2016. "Gated Neural Networks for Option Pricing: Rationality by Design," Papers 1609.07472, arXiv.org, revised Mar 2020.
    16. Semih Yon & Cafer Erhan Bozdag, 2014. "Test of Log-Normal Process with Importance Sampling for Options Pricing," Proceedings of Economics and Finance Conferences 0401571, International Institute of Social and Economic Sciences.
    17. Leunga Njike, Charles Guy & Hainaut, Donatien, 2024. "Affine Heston model style with self-exciting jumps and long memory," LIDAM Discussion Papers ISBA 2024001, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    18. Zura Kakushadze, 2016. "Volatility Smile as Relativistic Effect," Papers 1610.02456, arXiv.org, revised Feb 2017.
    19. Ibáñez, Alfredo, 2008. "Factorization of European and American option prices under complete and incomplete markets," Journal of Banking & Finance, Elsevier, vol. 32(2), pages 311-325, February.
    20. Henri Bertholon & Alain Monfort & Fulvio Pegoraro, 2006. "Pricing and Inference with Mixtures of Conditionally Normal Processes," Working Papers 2006-28, Center for Research in Economics and Statistics.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.01709. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.