IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.23596.html

The Nonstationarity-Complexity Tradeoff in Return Prediction

Author

Listed:
  • Agostino Capponi
  • Chengpiao Huang
  • J. Antonio Sidaoui
  • Kaizheng Wang
  • Jiacheng Zou

Abstract

We investigate machine learning models for stock return prediction in non-stationary environments, revealing a fundamental nonstationarity-complexity tradeoff: complex models reduce misspecification error but require longer training windows that introduce stronger non-stationarity. We resolve this tension with a novel model selection method that jointly optimizes model class and training window size using a tournament procedure that adaptively evaluates candidates on non-stationary validation data. Our theoretical analysis demonstrates that this approach balances misspecification error, estimation variance, and non-stationarity, performing close to the best model in hindsight. Applying our method to 17 industry portfolio returns, we consistently outperform standard rolling-window benchmarks, improving out-of-sample $R^2$ by 14-23% on average. During NBER-designated recessions, improvements are substantial: our method achieves positive $R^2$ during the Gulf War recession while benchmarks are negative, and improves $R^2$ in absolute terms by at least 80bps during the 2001 recession as well as superior performance during the 2008 Financial Crisis. Economically, a trading strategy based on our selected model generates 31% higher cumulative returns averaged across the industries.

Suggested Citation

  • Agostino Capponi & Chengpiao Huang & J. Antonio Sidaoui & Kaizheng Wang & Jiacheng Zou, 2025. "The Nonstationarity-Complexity Tradeoff in Return Prediction," Papers 2512.23596, arXiv.org.
  • Handle: RePEc:arx:papers:2512.23596
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.23596
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Darwin Choi & Wenxi Jiang & Chao Zhang, 2025. "Alpha Go Everywhere: Machine Learning and International Stock Returns," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 15(3-4), pages 288-331.
    2. Inoue, Atsushi & Jin, Lu & Rossi, Barbara, 2017. "Rolling window selection for out-of-sample forecasting with time-varying parameters," Journal of Econometrics, Elsevier, vol. 196(1), pages 55-67.
    3. Bryan T. Kelly & Semyon Malamud & Kangying Zhou, 2022. "The Virtue of Complexity Everywhere," Swiss Finance Institute Research Paper Series 22-57, Swiss Finance Institute.
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    5. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    6. Chun Liu & John M. Maheu, 2008. "Are There Structural Breaks in Realized Volatility?," Journal of Financial Econometrics, Oxford University Press, vol. 6(3), pages 326-360, Summer.
    7. Bryan T. Kelly & Semyon Malamud, 2025. "Understanding The Virtue of Complexity," Swiss Finance Institute Research Paper Series 25-96, Swiss Finance Institute.
    8. Bryan Kelly & Semyon Malamud & Kangying Zhou, 2024. "The Virtue of Complexity in Return Prediction," Journal of Finance, American Finance Association, vol. 79(1), pages 459-503, February.
    9. Jushan Bai & Pierre Perron, 1998. "Estimating and Testing Linear Models with Multiple Structural Changes," Econometrica, Econometric Society, vol. 66(1), pages 47-78, January.
    10. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    11. Pesaran, M. Hashem & Pick, Andreas, 2011. "Forecast Combination Across Estimation Windows," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(2), pages 307-318.
    12. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    13. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    14. Donald W. K. Andrews, 2003. "Tests for Parameter Instability and Structural Change with Unknown Change Point: A Corrigendum," Econometrica, Econometric Society, vol. 71(1), pages 395-397, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fieberg, Christian & Liedtke, Gerrit & Zaremba, Adam & Cakici, Nusret, 2025. "A factor model for the cross-section of country equity risk premia," Journal of Banking & Finance, Elsevier, vol. 171(C).
    2. Smith, Simon C. & Timmermann, Allan, 2022. "Have risk premia vanished?," Journal of Financial Economics, Elsevier, vol. 145(2), pages 553-576.
    3. Chen, Minghui & Hanauer, Matthias X. & Kalsbach, Tobias, 2025. "Model complexity and the performance of global versus regional models," Economics Letters, Elsevier, vol. 257(C).
    4. Changeun Kim & Younwoo Jeong & Bong-Gyu Jang, 2025. "Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model," Papers 2512.16251, arXiv.org, revised Apr 2026.
    5. Malakhov, Alexey & Riley, Timothy B. & Yan, Qing, 2024. "Do hedge funds bet against beta?," International Review of Economics & Finance, Elsevier, vol. 93(PA), pages 1507-1525.
    6. Cong Wang, 2024. "Stock return prediction with multiple measures using neural network models," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-34, December.
    7. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    8. Yan, Jingda & Yu, Jialin, 2023. "Cross-stock momentum and factor momentum," Journal of Financial Economics, Elsevier, vol. 150(2).
    9. Qian, Yihe & Zhang, Yang, 2025. "Long-term forecasting in asset pricing: Machine learning models’ sensitivity to macroeconomic shifts and firm-specific factors," The North American Journal of Economics and Finance, Elsevier, vol. 78(C).
    10. Esfandiar Maasoumi & Jianqiu Wang & Zhuo Wang & Ke Wu, 2024. "Identifying factors via automatic debiased machine learning," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 438-461, April.
    11. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    12. Allen Yikuan Huang & Zheqi Fan, 2026. "Beyond Prompting: An Autonomous Framework for Systematic Factor Investing via Agentic AI," Papers 2603.14288, arXiv.org, revised Apr 2026.
    13. Lioui, Abraham & Tarelli, Andrea, 2022. "Chasing the ESG factor," Journal of Banking & Finance, Elsevier, vol. 139(C).
    14. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    15. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    16. Guillaume Chevalier & Guillaume Coqueret & Thomas Raffinot, 2022. "Supervised portfolios," Post-Print hal-04144588, HAL.
    17. Tian Ma & Cunfei Liao & Fuwei Jiang, 2023. "Timing the factor zoo via deep learning: Evidence from China," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(1), pages 485-505, March.
    18. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    19. Arasteh, Abdollah, 2025. "A data-driven prediction method for multi-period portfolio optimization using the real options approach," Finance Research Letters, Elsevier, vol. 80(C).
    20. Ma, Tian & Leong, Wen Jun & Jiang, Fuwei, 2023. "A latent factor model for the Chinese stock market," International Review of Financial Analysis, Elsevier, vol. 87(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.23596. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.