IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.17773.html

MarketGANs: Multivariate financial time-series data augmentation using generative adversarial networks

Author

Listed:
  • Jeonggyu Huh
  • Seungwon Jeong
  • Hyun-Gyoon Kim
  • Hyeng Keun Koo
  • Byung Hwa Lim

Abstract

This paper introduces MarketGAN, a factor-based generative framework for high-dimensional asset return generation under severe data scarcity. We embed an explicit asset-pricing factor structure as an economic inductive bias and generate returns as a single joint vector, thereby preserving cross-sectional dependence and tail co-movement alongside inter-temporal dynamics. MarketGAN employs generative adversarial learning with a temporal convolutional network (TCN) backbone, which models stochastic, time-varying factor loadings and volatilities and captures long-range temporal dependence. Using daily returns of large U.S. equities, we find that MarketGAN more closely matches empirical stylized facts of asset returns, including heavy-tailed marginal distributions, volatility clustering, leverage effects, and, most notably, high-dimensional cross-sectional correlation structures and tail co-movement across assets, than conventional factor-model-based bootstrap approaches. In portfolio applications, covariance estimates derived from MarketGAN-generated samples outperform those derived from other methods when factor information is at least weakly informative, demonstrating tangible economic value.

Suggested Citation

  • Jeonggyu Huh & Seungwon Jeong & Hyun-Gyoon Kim & Hyeng Keun Koo & Byung Hwa Lim, 2026. "MarketGANs: Multivariate financial time-series data augmentation using generative adversarial networks," Papers 2601.17773, arXiv.org.
  • Handle: RePEc:arx:papers:2601.17773
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.17773
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luyang Chen & Markus Pelger & Jason Zhu, 2024. "Deep Learning in Asset Pricing," Management Science, INFORMS, vol. 70(2), pages 714-750, February.
    2. Blanka Horvath & Jonathan Plenk & Milena Vuletić, 2026. "Market Generators: A Paradigm Shift in Financial Modeling," Springer Finance, in: Christian Bayer & Goncalo dos Reis & Blanka Horvath & Harald Oberhauser (ed.), Signature Methods in Finance, pages 127-159, Springer.
    3. Anders Max Reppen & Halil Mete Soner, 2023. "Deep empirical risk minimization in finance: Looking into the future," Mathematical Finance, Wiley Blackwell, vol. 33(1), pages 116-145, January.
    4. Ali Boloorforoosh & Peter Christoffersen & Mathieu Fournier & Christian Gouriéroux & Stijn Van Nieuwerburgh, 2020. "Beta Risk in the Cross-Section of Equities," The Review of Financial Studies, Society for Financial Studies, vol. 33(9), pages 4318-4366.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    6. Magnus Wiese & Robert Knobloch & Ralf Korn & Peter Kretschmer, 2020. "Quant GANs: deep generation of financial time series," Quantitative Finance, Taylor & Francis Journals, vol. 20(9), pages 1419-1440, September.
    7. Chen, Andrew Y. & McCoy, Jack, 2024. "Missing values handling for machine learning portfolios," Journal of Financial Economics, Elsevier, vol. 155(C).
    8. Lionel Martellini & Volker Ziemann, 2010. "Improved Estimates of Higher-Order Comoments and Implications for Portfolio Selection," The Review of Financial Studies, Society for Financial Studies, vol. 23(4), pages 1467-1502, April.
    9. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    10. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    11. Takahashi, Shuntaro & Chen, Yu & Tanaka-Ishii, Kumiko, 2019. "Modeling financial time-series with generative adversarial networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 527(C).
    12. Joachim Freyberger & Bjoern Hoeppner & Andreas Neuhierl & Michael Weber, 2025. "Missing Data in Asset Pricing Panels," The Review of Financial Studies, Society for Financial Studies, vol. 38(3), pages 760-802.
    13. Jiawei Wang & Zhen Chen, 2024. "Factor-GAN: Enhancing stock price prediction and factor investment with Generative Adversarial Networks," PLOS ONE, Public Library of Science, vol. 19(6), pages 1-26, June.
    14. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    15. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    16. Chamberlain, Gary, 1982. "Multivariate regression models for panel data," Journal of Econometrics, Elsevier, vol. 18(1), pages 5-46, January.
    17. Benoit Mandelbrot, 2015. "The Variation of Certain Speculative Prices," World Scientific Book Chapters, in: Anastasios G Malliaris & William T Ziemba (ed.), THE WORLD SCIENTIFIC HANDBOOK OF FUTURES MARKETS, chapter 3, pages 39-78, World Scientific Publishing Co. Pte. Ltd..
    18. Svetlana Bryzgalova & Markus Pelger & Jason Zhu, 2025. "Forest through the Trees: Building Cross‐Sections of Stock Returns," Journal of Finance, American Finance Association, vol. 80(5), pages 2447-2506, October.
    19. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    20. Christopher S. Armstrong & Snehal Banerjee & Carlos Corona, 2013. "Factor-Loading Uncertainty and Expected Returns," The Review of Financial Studies, Society for Financial Studies, vol. 26(1), pages 158-207.
    21. So-Yoon Cho & Jin-Young Kim & Kayoung Ban & Hyeng Keun Koo & Hyun-Gyoon Kim, 2025. "Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction," Papers 2511.07014, arXiv.org, revised Mar 2026.
    22. Rizzato, Matteo & Wallart, Julien & Geissler, Christophe & Morizet, Nicolas & Boumlaik, Noureddine, 2023. "Generative Adversarial Networks applied to synthetic financial scenarios generation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 623(C).
    23. Ledoit, Olivier & Wolf, Michael, 2004. "A well-conditioned estimator for large-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 88(2), pages 365-411, February.
    24. Jostova, Gergana & Philipov, Alexander, 2005. "Bayesian Analysis of Stochastic Betas," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 40(4), pages 747-778, December.
    25. Andrew Y. Chen & Jack McCoy, 2022. "Missing Values Handling for Machine Learning Portfolios," Papers 2207.13071, arXiv.org, revised Jan 2024.
    26. Matteo Rizzato & Julien Wallart & Christophe Geissler & Nicolas Morizet & Noureddine Boumlaik, 2023. "Generative Adversarial Networks Applied to Synthetic Financial Scenarios Generation [Data Latent space]," Post-Print hal-03716692, HAL.
    27. Ruslan Tepelyan & Achintya Gopal, 2023. "Generative Machine Learning for Multivariate Equity Returns," Papers 2311.14735, arXiv.org.
    28. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    29. A. Max Reppen & H. Mete Soner & Valentin Tissot-Daguette, 2023. "Deep stochastic optimization in finance," Digital Finance, Springer, vol. 5(1), pages 91-111, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Changeun Kim & Younwoo Jeong & Bong-Gyu Jang, 2025. "Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model," Papers 2512.16251, arXiv.org, revised Apr 2026.
    2. Li, Bin & Rossi, Alberto G. & Yan, Xuemin (Sterling) & Zheng, Lingling, 2025. "Machine learning from a “Universe” of signals: The role of feature engineering," Journal of Financial Economics, Elsevier, vol. 172(C).
    3. Minshuo Chen & Renyuan Xu & Yumin Xu & Ruixun Zhang, 2025. "Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure," Papers 2504.06566, arXiv.org, revised Jan 2026.
    4. Wu, Hongxu & Wang, Qiao & Li, Jianping & Deng, Zhibin, 2025. "Enhancing stock return prediction in the Chinese market: A GAN-based approach," Research in International Business and Finance, Elsevier, vol. 75(C).
    5. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    6. Adcock, Christopher & Bessler, Wolfgang & Conlon, Thomas, 2022. "Characteristic-sorted portfolios and macroeconomic risks—An orthogonal decomposition," Journal of Empirical Finance, Elsevier, vol. 65(C), pages 24-50.
    7. Bhaskar Goswami & Ajim Uddin, 2026. "Significance of predictors: revisiting stock return predictions using explainable AI," Annals of Operations Research, Springer, vol. 357(1), pages 223-257, February.
    8. Wolfgang Drobetz & Tizian Otto, 2021. "Empirical asset pricing via machine learning: evidence from the European stock market," Journal of Asset Management, Palgrave Macmillan, vol. 22(7), pages 507-538, December.
    9. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    10. Cong Wang, 2024. "Stock return prediction with multiple measures using neural network models," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-34, December.
    11. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.
    12. Obaid, Khaled & Pukthuanthong, Kuntara, 2022. "A picture is worth a thousand words: Measuring investor sentiment by combining machine learning and photos from news," Journal of Financial Economics, Elsevier, vol. 144(1), pages 273-297.
    13. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    14. Paul Handro & Bogdan Dima, 2024. "Analyzing Financial Markets Efficiency: Insights from a Bibliometric and Content Review," Journal of Financial Studies, Institute of Financial Studies, vol. 16(9), pages 119-175, May.
    15. Uddin, Ajim & Tao, Xinyuan & Yu, Dantong, 2023. "Attention based dynamic graph neural network for asset pricing," Global Finance Journal, Elsevier, vol. 58(C).
    16. Xiao, Xiang & Hua, Xia & Qin, Kexin, 2024. "A self-attention based cross-sectional return forecasting model with evidence from the Chinese market," Finance Research Letters, Elsevier, vol. 62(PA).
    17. Guillaume Chevalier & Guillaume Coqueret & Thomas Raffinot, 2022. "Supervised portfolios," Post-Print hal-04144588, HAL.
    18. Maung, Kenwin & Swanson, Norman R., 2025. "A survey of models and methods used for forecasting when investing in financial markets," International Journal of Forecasting, Elsevier, vol. 41(4), pages 1355-1382.
    19. Tian Ma & Cunfei Liao & Fuwei Jiang, 2023. "Timing the factor zoo via deep learning: Evidence from China," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(1), pages 485-505, March.
    20. Wang, Chou-Wen & Liu, Kai & Li, Bin & Tan, Ken Seng, 2022. "Portfolio optimization under multivariate affine generalized hyperbolic distributions," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 49-66.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.17773. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.