IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.09224.html

Exploratory Mean-Variance with Jumps: An Equilibrium Approach

Author

Listed:
  • Yuling Max Chen
  • Bin Li
  • David Saunders

Abstract

Revisiting the continuous-time Mean-Variance (MV) Portfolio Optimization problem, we model the market dynamics with a jump-diffusion process and apply Reinforcement Learning (RL) techniques to facilitate informed exploration within the control space. We recognize the time-inconsistency of the MV problem and adopt the time-inconsistent control (TIC) approach to analytically solve for an exploratory equilibrium investment policy, which is a Gaussian distribution centered on the equilibrium control of the classical MV problem. Our approach accounts for time-inconsistent preferences and actions, and our equilibrium policy is the best option an investor can take at any given time during the investment period. Moreover, we leverage the martingale properties of the equilibrium policy, design a RL model, and propose an Actor-Critic RL algorithm. All of our RL model parameters converge to the corresponding true values in a simulation study. Our numerical study on 24 years of real market data shows that the proposed RL model is profitable in 13 out of 14 tests, demonstrating its practical applicability in real world investment.

Suggested Citation

  • Yuling Max Chen & Bin Li & David Saunders, 2025. "Exploratory Mean-Variance with Jumps: An Equilibrium Approach," Papers 2512.09224, arXiv.org.
  • Handle: RePEc:arx:papers:2512.09224
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.09224
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ernesto Mordecki, 2002. "Optimal stopping and perpetual options for Lévy processes," Finance and Stochastics, Springer, vol. 6(4), pages 473-493.
    2. Xu, Weidong & Wu, Chongfeng & Xu, Weijun & Li, Hongyi, 2009. "A jump-diffusion model for option pricing under fuzzy environments," Insurance: Mathematics and Economics, Elsevier, vol. 44(3), pages 337-344, June.
    3. Das, Sanjiv R., 2002. "The surprise element: jumps in interest rates," Journal of Econometrics, Elsevier, vol. 106(1), pages 27-65, January.
    4. Bates, David S, 1991. "The Crash of '87: Was It Expected? The Evidence from Options Markets," Journal of Finance, American Finance Association, vol. 46(3), pages 1009-1044, July.
    5. Peter Carr & Anita Mayo, 2007. "On the Numerical Evaluation of Option Prices in Jump Diffusion Processes," The European Journal of Finance, Taylor & Francis Journals, vol. 13(4), pages 353-372.
    6. Erhan Bayraktar & Masahiko Egami, 2008. "Optimizing venture capital investments in a jump diffusion model," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 67(1), pages 21-42, February.
    7. Min Dai & Yuchao Dong & Yanwei Jia, 2023. "Learning equilibrium mean‐variance strategy," Mathematical Finance, Wiley Blackwell, vol. 33(4), pages 1166-1212, October.
    8. Yuanyuan Zhang & Xiang Li & Sini Guo, 2018. "Portfolio selection problems with Markowitz’s mean–variance framework: a review of literature," Fuzzy Optimization and Decision Making, Springer, vol. 17(2), pages 125-158, June.
    9. Xie, Shuxiang & Li, Zhongfei & Wang, Shouyang, 2008. "Continuous-time portfolio selection with liability: Mean-variance model and stochastic LQ approach," Insurance: Mathematics and Economics, Elsevier, vol. 42(3), pages 943-953, June.
    10. Haoran Wang & Xun Yu Zhou, 2020. "Continuous‐time mean–variance portfolio selection: A reinforcement learning framework," Mathematical Finance, Wiley Blackwell, vol. 30(4), pages 1273-1308, October.
    11. C. He & J. Kennedy & T. Coleman & P. Forsyth & Y. Li & K. Vetzal, 2006. "Calibration and hedging under jump diffusion," Review of Derivatives Research, Springer, vol. 9(1), pages 1-35, January.
    12. Karl Friedrich Mina & Gerald H. L. Cheang & Carl Chiarella, 2015. "Approximate Hedging Of Options Under Jump-Diffusion Processes," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 18(04), pages 1-26.
    13. Leif Andersen & Jesper Andreasen, 2000. "Jump-Diffusion Processes: Volatility Smile Fitting and Numerical Methods for Option Pricing," Review of Derivatives Research, Springer, vol. 4(3), pages 231-262, October.
    14. Wu, Bo & Li, Lingfei, 2024. "Reinforcement learning for continuous-time mean-variance portfolio selection in a regime-switching market," Journal of Economic Dynamics and Control, Elsevier, vol. 158(C).
    15. Amin, Kaushik I, 1993. "Jump Diffusion Option Valuation in Discrete Time," Journal of Finance, American Finance Association, vol. 48(5), pages 1833-1863, December.
    16. Tomas Björk & Mariana Khapko & Agatha Murgoci, 2021. "Time-Inconsistent Control Theory with Finance Applications," Springer Finance, Springer, number 978-3-030-81843-2, March.
    17. R. Jiang & D. Saunders & C. Weng, 2022. "The reinforcement learning Kelly strategy," Quantitative Finance, Taylor & Francis Journals, vol. 22(8), pages 1445-1464, August.
    18. Yanwei Jia & Xun Yu Zhou, 2021. "Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach," Papers 2108.06655, arXiv.org, revised Feb 2022.
    19. Merton, Robert C., 1976. "Option pricing when underlying stock returns are discontinuous," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 125-144.
    20. E. Savku & G.-W Weber, 2022. "Stochastic differential games for optimal investment problems in a Markov regime-switching jump-diffusion market," Annals of Operations Research, Springer, vol. 312(2), pages 1171-1196, May.
    21. Park, Keehwan & Ahn, Chang Mo & Fujihara, Roger, 1993. "Optimal hedged portfolios: the case of jump-diffusion risks," Journal of International Money and Finance, Elsevier, vol. 12(5), pages 493-510, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuling Max Chen & Bin Li & David Saunders, 2025. "Exploratory Mean-Variance Portfolio Optimization with Regime-Switching Market Dynamics," Papers 2501.16659, arXiv.org.
    2. Xuefeng Gao & Lingfei Li & Xun Yu Zhou, 2024. "Reinforcement Learning for Jump-Diffusions, with Financial Applications," Papers 2405.16449, arXiv.org, revised Aug 2025.
    3. Hatem Ben-Ameur & Rim Chérif & Bruno Rémillard, 2016. "American-style options in jump-diffusion models: estimation and evaluation," Quantitative Finance, Taylor & Francis Journals, vol. 16(8), pages 1313-1324, August.
    4. Kozarski, R., 2013. "Pricing and hedging in the VIX derivative market," Other publications TiSEM 221fefe0-241e-4914-b6bd-c, Tilburg University, School of Economics and Management.
    5. Christara, Christina C. & Leung, Nat Chun-Ho, 2016. "Option pricing in jump diffusion models with quadratic spline collocation," Applied Mathematics and Computation, Elsevier, vol. 279(C), pages 28-42.
    6. J. S. Kennedy & P. A. Forsyth & K. R. Vetzal, 2009. "Dynamic Hedging Under Jump Diffusion with Transaction Costs," Operations Research, INFORMS, vol. 57(3), pages 541-559, June.
    7. Hilliard, Jimmy E. & Hilliard, Jitka, 2019. "A jump-diffusion model for pricing and hedging with margined options: An application to Brent crude oil contracts," Journal of Banking & Finance, Elsevier, vol. 98(C), pages 137-155.
    8. Chen Ziyi & Gu Jia-wen, 2025. "Exploratory Utility Maximization Problem with Tsallis Entropy," Papers 2502.01269, arXiv.org.
    9. Kuo-Shing Chen & Yu-Chuan Huang, 2021. "Detecting Jump Risk and Jump-Diffusion Model for Bitcoin Options Pricing and Hedging," Mathematics, MDPI, vol. 9(20), pages 1-24, October.
    10. Junyan Ye & Hoi Ying Wong & Kyunghyun Park, 2025. "Robust Exploratory Stopping under Ambiguity in Reinforcement Learning," Papers 2510.10260, arXiv.org.
    11. Michael Albert & Jason Fink & Kristin E. Fink, 2008. "Adaptive Mesh Modeling And Barrier Option Pricing Under A Jump‐Diffusion Process," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 31(4), pages 381-408, December.
    12. C. He & J. Kennedy & T. Coleman & P. Forsyth & Y. Li & K. Vetzal, 2006. "Calibration and hedging under jump diffusion," Review of Derivatives Research, Springer, vol. 9(1), pages 1-35, January.
    13. Erhan Bayraktar & Hao Xing, 2009. "Pricing American options for jump diffusions by iterating optimal stopping problems for diffusions," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 70(3), pages 505-525, December.
    14. Michael C. Fu & Bingqing Li & Guozhen Li & Rongwen Wu, 2017. "Option Pricing for a Jump-Diffusion Model with General Discrete Jump-Size Distributions," Management Science, INFORMS, vol. 63(11), pages 3961-3977, November.
    15. Ron Chan & Simon Hubbert, 2014. "Options pricing under the one-dimensional jump-diffusion model using the radial basis function interpolation scheme," Review of Derivatives Research, Springer, vol. 17(2), pages 161-189, July.
    16. Bakshi, Gurdip & Panayotov, George, 2010. "First-passage probability, jump models, and intra-horizon risk," Journal of Financial Economics, Elsevier, vol. 95(1), pages 20-40, January.
    17. Hamed Ghanbari & Michael Oancea & Stylianos Perrakis, 2021. "Shedding light on a dark matter: Jump diffusion and option‐implied investor preferences," European Financial Management, European Financial Management Association, vol. 27(2), pages 244-286, March.
    18. Perrakis, Stylianos & Boloorforoosh, Ali, 2013. "Valuing catastrophe derivatives under limited diversification: A stochastic dominance approach," Journal of Banking & Finance, Elsevier, vol. 37(8), pages 3157-3168.
    19. Chen, Gongmeng & Choi, Yoon K. & Zhou, Yong, 2008. "Detections of changes in return by a wavelet smoother with conditional heteroscedastic volatility," Journal of Econometrics, Elsevier, vol. 143(2), pages 227-262, April.
    20. Guidolin, Massimo & Timmermann, Allan, 2003. "Option prices under Bayesian learning: implied volatility dynamics and predictive densities," Journal of Economic Dynamics and Control, Elsevier, vol. 27(5), pages 717-769, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.09224. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.