IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2603.06587.html

Autonomous AI Agents for Option Hedging: Enhancing Financial Stability through Shortfall Aware Reinforcement Learning

Author

Listed:
  • Minxuan Hu
  • Ziheng Chen
  • Jiayu Yi
  • Wenxi Sun

Abstract

The deployment of autonomous AI agents in derivatives markets has widened a practical gap between static model calibration and realized hedging outcomes. We introduce two reinforcement learning frameworks, a novel Replication Learning of Option Pricing (RLOP) approach and an adaptive extension of Q-learner in Black-Scholes (QLBS), that prioritize shortfall probability and align learning objectives with downside sensitive hedging. Using listed SPY and XOP options, we evaluate models using realized path delta hedging outcome distributions, shortfall probability, and tail risk measures such as Expected Shortfall. Empirically, RLOP reduces shortfall frequency in most slices and shows the clearest tail-risk improvements in stress, while implied volatility fit often favors parametric models yet poorly predicts after-cost hedging performance. This friction-aware RL framework supports a practical approach to autonomous derivatives risk management as AI-augmented trading systems scale.

Suggested Citation

  • Minxuan Hu & Ziheng Chen & Jiayu Yi & Wenxi Sun, 2026. "Autonomous AI Agents for Option Hedging: Enhancing Financial Stability through Shortfall Aware Reinforcement Learning," Papers 2603.06587, arXiv.org.
  • Handle: RePEc:arx:papers:2603.06587
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2603.06587
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Nathan Lassance & Frédéric Vrins, 2018. "A comparison of pricing and hedging performances of equity derivatives models," Applied Economics, Taylor & Francis Journals, vol. 50(10), pages 1122-1137, February.
    2. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    3. Leland, Hayne E, 1985. "Option Pricing and Replication with Transactions Costs," Journal of Finance, American Finance Association, vol. 40(5), pages 1283-1301, December.
    4. Golbabai, A. & Ballestra, L.V. & Ahmadian, D., 2013. "Superconvergence of the finite element solutions of the Black–Scholes equation," Finance Research Letters, Elsevier, vol. 10(1), pages 17-26.
    5. Hans FÃllmer & Peter Leukert, 2000. "Efficient hedging: Cost versus shortfall risk," Finance and Stochastics, Springer, vol. 4(2), pages 117-146.
    6. Fan, Qingqian & Feng, Sixian, 2022. "An empirical study on the characterization of implied volatility and pricing in the Chinese option market," Finance Research Letters, Elsevier, vol. 49(C).
    7. Black, Fischer, 1976. "The pricing of commodity contracts," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 167-179.
    8. François, Pascal & Gauthier, Geneviève & Godin, Frédéric & Mendoza, Carlos Octavio Pérez, 2025. "Is the difference between deep hedging and delta hedging a statistical arbitrage?," Finance Research Letters, Elsevier, vol. 73(C).
    9. Li, Lingfei & Wu, Jingyu & Zhu, Minting & Wang, Mancang, 2025. "Analytic solutions for pricing American style options," Finance Research Letters, Elsevier, vol. 86(PB).
    10. Merton, Robert C., 1976. "Option pricing when underlying stock returns are discontinuous," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 125-144.
    11. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    12. Heston, Steven L, 1993. "A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options," The Review of Financial Studies, Society for Financial Studies, vol. 6(2), pages 327-343.
    13. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ziheng Chen & Minxuan Hu & Jiayu Yi & Wenxi Sun, 2026. "Reinforcement Learning for Option Hedging: Static Implied-Volatility Fit versus Shortfall-Aware Performance," Papers 2601.01709, arXiv.org.
    2. Cao, Yi & Zhai, Jia & Wen, Conghua & Zong, Lu & Yang, Ao, 2025. "Commodity futures option valuation – An ensemble model," International Review of Financial Analysis, Elsevier, vol. 105(C).
    3. Suresh M. Sundaresan, 2000. "Continuous‐Time Methods in Finance: A Review and an Assessment," Journal of Finance, American Finance Association, vol. 55(4), pages 1569-1622, August.
    4. Maung, Kenwin & Swanson, Norman R., 2025. "A survey of models and methods used for forecasting when investing in financial markets," International Journal of Forecasting, Elsevier, vol. 41(4), pages 1355-1382.
    5. Yao Wang & Jingmei Zhao & Qing Li & Xiangyu Wei, 2024. "Considering momentum spillover effects via graph neural network in option pricing," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 44(6), pages 1069-1094, June.
    6. Mark Broadie & Jerome B. Detemple, 2004. "ANNIVERSARY ARTICLE: Option Pricing: Valuation Models and Applications," Management Science, INFORMS, vol. 50(9), pages 1145-1177, September.
    7. Peter Carr & Liuren Wu, 2014. "Static Hedging of Standard Options," Journal of Financial Econometrics, Oxford University Press, vol. 12(1), pages 3-46.
    8. Bjork, Tomas, 2009. "Arbitrage Theory in Continuous Time," OUP Catalogue, Oxford University Press, edition 3, number 9780199574742.
    9. Ako Doffou & Jimmy E. Hilliard, 2001. "Pricing Currency Options Under Stochastic Interest Rates And Jump-Diffusion Processes," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 24(4), pages 565-585, December.
    10. Pierre Brugi`ere & Gabriel Turinici, 2025. "Model-Free Deep Hedging with Transaction Costs and Light Data Requirements," Papers 2505.22836, arXiv.org.
    11. Lim, Terence & Lo, Andrew W. & Merton, Robert C. & Scholes, Myron S., 2006. "The Derivatives Sourcebook," Foundations and Trends(R) in Finance, now publishers, vol. 1(5–6), pages 365-572, April.
    12. Paul Handro & Bogdan Dima, 2024. "Analyzing Financial Markets Efficiency: Insights from a Bibliometric and Content Review," Journal of Financial Studies, Institute of Financial Studies, vol. 16(9), pages 119-175, May.
    13. Christina Nikitopoulos-Sklibosios, 2005. "A Class of Markovian Models for the Term Structure of Interest Rates Under Jump-Diffusions," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 6, July-Dece.
    14. Carl Chiarella & Christina Nikitopoulos-Sklibosios & Erik Schlogl, 2005. "A Control Variate Method for Monte Carlo Simulations of Heath-Jarrow-Morton with Jumps," Research Paper Series 167, Quantitative Finance Research Centre, University of Technology, Sydney.
    15. Chen, Gang & Roberts, Matthew C. & Roe, Brian E., 2005. "Forecasting Livestock Feed Cost Risks Using Futures and Options," 2005 Conference, April 18-19, 2005, St. Louis, Missouri 19048, NCR-134 Conference on Applied Commodity Price Analysis, Forecasting, and Market Risk Management.
    16. Yoonsik Hong & Diego Klabjan, 2025. "Statistical Arbitrage in Options Markets by Graph Learning and Synthetic Long Positions," Papers 2508.14762, arXiv.org, revised Aug 2025.
    17. Guidolin, Massimo & Timmermann, Allan, 2003. "Option prices under Bayesian learning: implied volatility dynamics and predictive densities," Journal of Economic Dynamics and Control, Elsevier, vol. 27(5), pages 717-769, March.
    18. Jondeau, Eric & Rockinger, Michael, 2000. "Reading the smile: the message conveyed by methods which infer risk neutral densities," Journal of International Money and Finance, Elsevier, vol. 19(6), pages 885-915, December.
    19. Goodell, John W. & Kumar, Satish & Lim, Weng Marc & Pattnaik, Debidutta, 2021. "Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    20. Alexander Lipton, 2024. "Hydrodynamics of Markets:Hidden Links Between Physics and Finance," Papers 2403.09761, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2603.06587. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.