IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.21798.html

Deep Generative Models for Synthetic Financial Data: Applications to Portfolio and Risk Modeling

Author

Listed:
  • Christophe D. Hounwanou
  • Yae Ulrich Gaba

Abstract

Synthetic financial data provides a practical solution to the privacy, accessibility, and reproducibility challenges that often constrain empirical research in quantitative finance. This paper investigates the use of deep generative models, specifically Time-series Generative Adversarial Networks (TimeGAN) and Variational Autoencoders (VAEs) to generate realistic synthetic financial return series for portfolio construction and risk modeling applications. Using historical daily returns from the S and P 500 as a benchmark, we generate synthetic datasets under comparable market conditions and evaluate them using statistical similarity metrics, temporal structure tests, and downstream financial tasks. The study shows that TimeGAN produces synthetic data with distributional shapes, volatility patterns, and autocorrelation behaviour that are close to those observed in real returns. When applied to mean--variance portfolio optimization, the resulting synthetic datasets lead to portfolio weights, Sharpe ratios, and risk levels that remain close to those obtained from real data. The VAE provides more stable training but tends to smooth extreme market movements, which affects risk estimation. Finally, the analysis supports the use of synthetic datasets as substitutes for real financial data in portfolio analysis and risk simulation, particularly when models are able to capture temporal dynamics. Synthetic data therefore provides a privacy-preserving, cost-effective, and reproducible tool for financial experimentation and model development.

Suggested Citation

  • Christophe D. Hounwanou & Yae Ulrich Gaba, 2025. "Deep Generative Models for Synthetic Financial Data: Applications to Portfolio and Risk Modeling," Papers 2512.21798, arXiv.org, revised Dec 2025.
  • Handle: RePEc:arx:papers:2512.21798
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.21798
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    2. James B. Heaton & Nicholas Polson & Jan H. Witte, 2017. "Rejoinder to ‘Deep learning for finance: deep portfolios’," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 19-21, January.
    3. J. B. Heaton & N. G. Polson & J. H. Witte, 2017. "Deep learning for finance: deep portfolios," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(1), pages 3-12, January.
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abraham Itzhak Weinberg, 2025. "Hybrid Quantum-Classical Ensemble Learning for S\&P 500 Directional Prediction," Papers 2512.15738, arXiv.org.
    2. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    3. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    4. Junjie Guo, 2025. "Integration of Wavelet Transform Convolution and Channel Attention with LSTM for Stock Price Prediction based Portfolio Allocation," Papers 2507.01973, arXiv.org, revised Jul 2025.
    5. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    6. Arasteh, Abdollah, 2025. "A data-driven prediction method for multi-period portfolio optimization using the real options approach," Finance Research Letters, Elsevier, vol. 80(C).
    7. Shanyan Lai, 2025. "Is attention truly all we need? An empirical study of asset pricing in pretrained RNN sparse and global attention models," Papers 2508.19006, arXiv.org.
    8. Wenbo Wu & Jiaqi Chen & Zhibin (Ben) Yang & Michael L. Tindall, 2021. "A Cross-Sectional Machine Learning Approach for Hedge Fund Return Prediction and Selection," Management Science, INFORMS, vol. 67(7), pages 4577-4601, July.
    9. Caldeira, João F. & Santos, André A.P. & Torrent, Hudson S., 2023. "Semiparametric portfolios: Improving portfolio performance by exploiting non-linearities in firm characteristics," Economic Modelling, Elsevier, vol. 122(C).
    10. Federico Vittorio Cortesi & Giuseppe Iannone & Giulia Crippa & Tomaso Poggio & Pierfrancesco Beneventano, 2026. "Same Error, Different Function: The Optimizer as an Implicit Prior in Financial Time Series," Papers 2603.02620, arXiv.org.
    11. Yao, Haixiang & Xia, Shenghao & Liu, Hao, 2022. "Six-factor asset pricing and portfolio investment via deep learning: Evidence from Chinese stock market," Pacific-Basin Finance Journal, Elsevier, vol. 76(C).
    12. Brini, Alessio & Toscano, Giacomo, 2025. "SpotV2Net: Multivariate intraday spot volatility forecasting via vol-of-vol-informed graph attention networks," International Journal of Forecasting, Elsevier, vol. 41(3), pages 1093-1111.
    13. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    14. Uddin, Ajim & Yu, Dantong, 2020. "Latent factor model for asset pricing," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    15. Qiong Wu & Christopher G. Brinton & Zheng Zhang & Andrea Pizzoferrato & Zhenming Liu & Mihai Cucuringu, 2019. "Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing," Papers 1909.04497, arXiv.org, revised Oct 2021.
    16. Luyang Chen & Markus Pelger & Jason Zhu, 2024. "Deep Learning in Asset Pricing," Management Science, INFORMS, vol. 70(2), pages 714-750, February.
    17. Babiak, Mykola & Baruník, Jozef, 2026. "Deep learning, predictability, and optimal portfolio returns," Journal of Empirical Finance, Elsevier, vol. 87(C).
    18. Li, Weiping & Mei, Feng, 2020. "Asset returns in deep learning methods: An empirical analysis on SSE 50 and CSI 300," Research in International Business and Finance, Elsevier, vol. 54(C).
    19. Weijia Peng & Chun Yao, 2023. "Sector-level equity returns predictability with machine learning and market contagion measure," Empirical Economics, Springer, vol. 65(4), pages 1761-1798, October.
    20. James Yae & Yang Luo, 2023. "Robust monitoring machine: a machine learning solution for out-of-sample R $$^2$$ 2 -hacking in return predictability monitoring," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-28, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.21798. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.