IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.09916.html
   My bibliography  Save this paper

Solving dynamic portfolio selection problems via score-based diffusion models

Author

Listed:
  • Ahmad Aghapour
  • Erhan Bayraktar
  • Fengyi Yuan

Abstract

In this paper, we tackle the dynamic mean-variance portfolio selection problem in a {\it model-free} manner, based on (generative) diffusion models. We propose using data sampled from the real model $\mathbb P$ (which is unknown) with limited size to train a generative model $\mathbb Q$ (from which we can easily and adequately sample). With adaptive training and sampling methods that are tailor-made for time series data, we obtain quantification bounds between $\mathbb P$ and $\mathbb Q$ in terms of the adapted Wasserstein metric $\mathcal A W_2$. Importantly, the proposed adapted sampling method also facilitates {\it conditional sampling}. In the second part of this paper, we provide the stability of the mean-variance portfolio optimization problems in $\mathcal A W _2$. Then, combined with the error bounds and the stability result, we propose a policy gradient algorithm based on the generative environment, in which our innovative adapted sampling method provides approximate scenario generators. We illustrate the performance of our algorithm on both simulated and real data. For real data, the algorithm based on the generative environment produces portfolios that beat several important baselines, including the Markowitz portfolio, the equal weight (naive) portfolio, and S\&P 500.

Suggested Citation

  • Ahmad Aghapour & Erhan Bayraktar & Fengyi Yuan, 2025. "Solving dynamic portfolio selection problems via score-based diffusion models," Papers 2507.09916, arXiv.org, revised Aug 2025.
  • Handle: RePEc:arx:papers:2507.09916
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.09916
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John Y. Campbell & Luis M. Viceira, 1999. "Consumption and Portfolio Decisions when Expected Returns are Time Varying," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 433-495.
    2. Martin Schweizer, 1995. "Variance-Optimal Hedging in Discrete Time," Mathematics of Operations Research, INFORMS, vol. 20(1), pages 1-32, February.
    3. Jiwook Kim & Minhyeok Lee, 2023. "Portfolio Optimization using Predictive Auxiliary Classifier Generative Adversarial Networks with Measuring Uncertainty," Papers 2304.11856, arXiv.org.
    4. Andrew Ang & Geert Bekaert, 2002. "International Asset Allocation With Regime Shifts," The Review of Financial Studies, Society for Financial Studies, vol. 15(4), pages 1137-1187.
    5. Michael W. Brandt & Amit Goyal & Pedro Santa-Clara & Jonathan R. Stroud, 2005. "A Simulation Approach to Dynamic Portfolio Choice with an Application to Learning About Return Predictability," The Review of Financial Studies, Society for Financial Studies, vol. 18(3), pages 831-873.
    6. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    7. Magnus Wiese & Robert Knobloch & Ralf Korn & Peter Kretschmer, 2020. "Quant GANs: deep generation of financial time series," Quantitative Finance, Taylor & Francis Journals, vol. 20(9), pages 1419-1440, September.
    8. Aleš Černý & Christoph Czichowsky & Jan Kallsen, 2024. "Numeraire-Invariant Quadratic Hedging and Mean–Variance Portfolio Allocation," Mathematics of Operations Research, INFORMS, vol. 49(2), pages 752-781, May.
    9. Victor DeMiguel & Lorenzo Garlappi & Raman Uppal, 2009. "Optimal Versus Naive Diversification: How Inefficient is the 1-N Portfolio Strategy?," The Review of Financial Studies, Society for Financial Studies, vol. 22(5), pages 1915-1953, May.
    10. Karatzas, Ioannis & Ruf, Johannes, 2017. "Trading strategies generated by Lyapunov functions," LSE Research Online Documents on Economics 69177, London School of Economics and Political Science, LSE Library.
    11. Guastaroba, Gianfranco & Mansini, Renata & Speranza, M. Grazia, 2009. "On the effectiveness of scenario generation techniques in single-period portfolio optimization," European Journal of Operational Research, Elsevier, vol. 192(2), pages 500-511, January.
    12. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    13. Merton, Robert C., 1971. "Optimum consumption and portfolio rules in a continuous-time model," Journal of Economic Theory, Elsevier, vol. 3(4), pages 373-413, December.
    14. Duan Li & Wan‐Lung Ng, 2000. "Optimal Dynamic Portfolio Selection: Multiperiod Mean‐Variance Formulation," Mathematical Finance, Wiley Blackwell, vol. 10(3), pages 387-406, July.
    15. Julio Backhoff-Veraguas & Daniel Bartl & Mathias Beiglböck & Manu Eder, 2020. "Adapted Wasserstein distances and stability in mathematical finance," Finance and Stochastics, Springer, vol. 24(3), pages 601-632, July.
    16. Merton, Robert C., 1976. "Option pricing when underlying stock returns are discontinuous," Journal of Financial Economics, Elsevier, vol. 3(1-2), pages 125-144.
    17. Heston, Steven L, 1993. "A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options," The Review of Financial Studies, Society for Financial Studies, vol. 6(2), pages 327-343.
    18. Suleyman Basak & Georgy Chabakauri, 2010. "Dynamic Mean-Variance Asset Allocation," The Review of Financial Studies, Society for Financial Studies, vol. 23(8), pages 2970-3016, August.
    19. Ioannis Karatzas & Johannes Ruf, 2017. "Trading strategies generated by Lyapunov functions," Finance and Stochastics, Springer, vol. 21(3), pages 753-787, July.
    20. Christoph Czichowsky & Martin Schweizer, 2012. "Convex Duality in Mean Variance Hedging Under Convex Trading Constraints," Swiss Finance Institute Research Paper Series 12-24, Swiss Finance Institute.
    21. Milena Vuletić & Rama Cont, 2024. "VolGAN: A Generative Model for Arbitrage-Free Implied Volatility Surfaces," Applied Mathematical Finance, Taylor & Francis Journals, vol. 31(4), pages 203-238, July.
    22. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    23. Jose Blanchet & Lin Chen & Xun Yu Zhou, 2022. "Distributionally Robust Mean-Variance Portfolio Selection with Wasserstein Distances," Management Science, INFORMS, vol. 68(9), pages 6382-6410, September.
    24. MOSSIN, Jan, 1968. "Optimal multiperiod portfolio policies," LIDAM Reprints CORE 19, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    25. Gah-Yi Ban & Noureddine El Karoui & Andrew E. B. Lim, 2018. "Machine Learning and Portfolio Optimization," Management Science, INFORMS, vol. 64(3), pages 1136-1154, March.
    26. Julio Backhoff-Veraguas & Daniel Bartl & Mathias Beiglbock & Manu Eder, 2019. "Adapted Wasserstein Distances and Stability in Mathematical Finance," Papers 1901.07450, arXiv.org, revised May 2020.
    27. Paolella, Marc S. & Polak, Paweł & Walker, Patrick S., 2021. "A non-elliptical orthogonal GARCH model for portfolio selection under transaction costs," Journal of Banking & Finance, Elsevier, vol. 125(C).
    28. Bates, David S, 1996. "Jumps and Stochastic Volatility: Exchange Rate Processes Implicit in Deutsche Mark Options," The Review of Financial Studies, Society for Financial Studies, vol. 9(1), pages 69-107.
    29. Boyle, Phelim P., 1977. "Options: A Monte Carlo approach," Journal of Financial Economics, Elsevier, vol. 4(3), pages 323-338, May.
    30. Tomas Björk & Agatha Murgoci & Xun Yu Zhou, 2014. "Mean–Variance Portfolio Optimization With State-Dependent Risk Aversion," Mathematical Finance, Wiley Blackwell, vol. 24(1), pages 1-24, January.
    31. Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
    32. Thomas M. Cover, 1991. "Universal Portfolios," Mathematical Finance, Wiley Blackwell, vol. 1(1), pages 1-29, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Suresh M. Sundaresan, 2000. "Continuous‐Time Methods in Finance: A Review and an Assessment," Journal of Finance, American Finance Association, vol. 55(4), pages 1569-1622, August.
    2. Min Dai & Hanqing Jin & Steven Kou & Yuhong Xu, 2021. "A Dynamic Mean-Variance Analysis for Log Returns," Management Science, INFORMS, vol. 67(2), pages 1093-1108, February.
    3. Xavier Warin, 2016. "The Asset Liability Management problem of a nuclear operator : a numerical stochastic optimization approach," Papers 1611.04877, arXiv.org.
    4. Yilie Huang & Yanwei Jia & Xun Yu Zhou, 2024. "Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study," Papers 2412.16175, arXiv.org, revised Aug 2025.
    5. David Allen & Stephen Satchell & Colin Lizieri, 2024. "Quantifying the non-Gaussian gain," Journal of Asset Management, Palgrave Macmillan, vol. 25(1), pages 1-18, February.
    6. Min Dai & Hanqing Jin & Steven Kou & Yuhong Xu, 2021. "Robo-advising: a dynamic mean-variance approach," Digital Finance, Springer, vol. 3(2), pages 81-97, June.
    7. Suleyman Basak & Georgy Chabakauri, 2010. "Dynamic Mean-Variance Asset Allocation," The Review of Financial Studies, Society for Financial Studies, vol. 23(8), pages 2970-3016, August.
    8. Yao Wang & Jingmei Zhao & Qing Li & Xiangyu Wei, 2024. "Considering momentum spillover effects via graph neural network in option pricing," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 44(6), pages 1069-1094, June.
    9. Minshuo Chen & Renyuan Xu & Yumin Xu & Ruixun Zhang, 2025. "Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure," Papers 2504.06566, arXiv.org, revised Jul 2025.
    10. Mark E. Wohar & David E. Rapach, 2005. "Return Predictability and the Implied Intertemporal Hedging Demands for Stocks and Bonds: International Evidence," Computing in Economics and Finance 2005 329, Society for Computational Economics.
    11. Francisco Peñaranda & Enrique Sentana, 2024. "Portfolio management with big data," Working Papers wp2024_2411, CEMFI.
    12. Wang, Yuanrong & Aste, Tomaso, 2023. "Dynamic portfolio optimization with inverse covariance clustering," LSE Research Online Documents on Economics 117701, London School of Economics and Political Science, LSE Library.
    13. Sun, Chuting & Wu, Qi & Yan, Xing, 2024. "Dynamic CVaR portfolio construction with attention-powered generative factor learning," Journal of Economic Dynamics and Control, Elsevier, vol. 160(C).
    14. Tobias Lipp & Grégoire Loeper & Olivier Pironneau, 2013. "Mixing Monte-Carlo and Partial Differential Equations for Pricing Options," Post-Print hal-01558826, HAL.
    15. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    16. Zheng, Wenyuan & Li, Bingqing & Huang, Zhiyong & Chen, Lu, 2022. "Why Was There More Household Stock Market Participation During the COVID-19 Pandemic?," Finance Research Letters, Elsevier, vol. 46(PB).
    17. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    18. Konrad Mueller & Amira Akkari & Lukas Gonon & Ben Wood, 2024. "Fast Deep Hedging with Second-Order Optimization," Papers 2410.22568, arXiv.org.
    19. Jiang, Yifu & Olmo, Jose & Atwi, Majed, 2025. "High-dimensional multi-period portfolio allocation using deep reinforcement learning," International Review of Economics & Finance, Elsevier, vol. 98(C).
    20. George Chacko & Luis M. Viceira, 2005. "Dynamic Consumption and Portfolio Choice with Stochastic Volatility in Incomplete Markets," The Review of Financial Studies, Society for Financial Studies, vol. 18(4), pages 1369-1402.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.09916. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.