IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2511.17462.html

Scaling Conditional Autoencoders for Portfolio Optimization via Uncertainty-Aware Factor Selection

Author

Listed:
  • Ryan Engel
  • Yu Chen
  • Pawel Polak
  • Ioana Boier

Abstract

Conditional Autoencoders (CAEs) offer a flexible, interpretable approach for estimating latent asset-pricing factors from firm characteristics. However, existing studies usually limit the latent factor dimension to around K=5 due to concerns that larger K can degrade performance. To overcome this challenge, we propose a scalable framework that couples a high-dimensional CAE with an uncertainty-aware factor selection procedure. We employ three models for quantile prediction: zero-shot Chronos, a pretrained time-series foundation model (ZS-Chronos), gradient-boosted quantile regression trees using XGBoost and RAPIDS (Q-Boost), and an I.I.D bootstrap-based sample mean model (IID-BS). For each model, we rank factors by forecast uncertainty and retain the top-k most predictable factors for portfolio construction, where k denotes the selected subset of factors. This pruning strategy delivers substantial gains in risk-adjusted performance across all forecasting models. Furthermore, due to each model's uncorrelated predictions, a performance-weighted ensemble consistently outperforms individual models with higher Sharpe, Sortino, and Omega ratios.

Suggested Citation

  • Ryan Engel & Yu Chen & Pawel Polak & Ioana Boier, 2025. "Scaling Conditional Autoencoders for Portfolio Optimization via Uncertainty-Aware Factor Selection," Papers 2511.17462, arXiv.org.
  • Handle: RePEc:arx:papers:2511.17462
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2511.17462
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stefano Giglio & Dacheng Xiu, 2021. "Asset Pricing with Omitted Factors," Journal of Political Economy, University of Chicago Press, vol. 129(7), pages 1947-1990.
    2. Makridakis, Spyros & Spiliotis, Evangelos & Assimakopoulos, Vassilios, 2020. "The M4 Competition: 100,000 time series and 61 forecasting methods," International Journal of Forecasting, Elsevier, vol. 36(1), pages 54-74.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    2. Spiliotis, Evangelos & Makridakis, Spyros & Kaltsounis, Anastasios & Assimakopoulos, Vassilios, 2021. "Product sales probabilistic forecasting: An empirical evaluation using the M5 competition data," International Journal of Production Economics, Elsevier, vol. 240(C).
    3. Tiantian Tu, 2025. "Bridging Short- and Long-Term Dependencies: A CNN-Transformer Hybrid for Financial Time Series Forecasting," Papers 2504.19309, arXiv.org.
    4. Tsoumalis, Georgios I. & Bampos, Zafeirios N. & Biskas, Pandelis N. & Keranidis, Stratos D. & Symeonidis, Polychronis A. & Voulgarakis, Dimitrios K., 2022. "A novel system for providing explicit demand response from domestic natural gas boilers," Applied Energy, Elsevier, vol. 317(C).
    5. d'Aspremont, Alexandre & Ben Arous, Simon & Bricongne, Jean-Charles & Lietti, Benjamin & Meunier, Baptiste, 2025. "Satellites turn “concrete”: Tracking cement with satellite data and neural networks," Journal of Econometrics, Elsevier, vol. 249(PC).
    6. Bojer, Casper Solheim & Meldgaard, Jens Peder, 2021. "Kaggle forecasting competitions: An overlooked learning opportunity," International Journal of Forecasting, Elsevier, vol. 37(2), pages 587-603.
    7. Barkan, Oren & Benchimol, Jonathan & Caspi, Itamar & Cohen, Eliya & Hammer, Allon & Koenigstein, Noam, 2023. "Forecasting CPI inflation components with Hierarchical Recurrent Neural Networks," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 39(3), pages 1145-1162.
    8. Alroomi, Azzam & Karamatzanis, Georgios & Nikolopoulos, Konstantinos & Tilba, Anna & Xiao, Shujun, 2022. "Fathoming empirical forecasting competitions’ winners," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1519-1525.
    9. Spiliotis, Evangelos & Petropoulos, Fotios, 2024. "On the update frequency of univariate forecasting models," European Journal of Operational Research, Elsevier, vol. 314(1), pages 111-121.
    10. Jiangtao Duan & Jushan Bai & Xu Han, 2025. "Taxonomy and Estimation of Multiple Breakpoints in High-Dimensional Factor Models," Papers 2503.06645, arXiv.org, revised Aug 2025.
    11. Winita Sulandari & Yudho Yudhanto & Sri Subanti & Crisma Devika Setiawan & Riskhia Hapsari & Paulo Canas Rodrigues, 2023. "Comparing the Simple to Complex Automatic Methods with the Ensemble Approach in Forecasting Electrical Time Series Data," Energies, MDPI, vol. 16(22), pages 1-16, November.
    12. Van Belle, Jente & Guns, Tias & Verbeke, Wouter, 2021. "Using shared sell-through data to forecast wholesaler demand in multi-echelon supply chains," European Journal of Operational Research, Elsevier, vol. 288(2), pages 466-479.
    13. Nghia Chu & Binh Dao & Nga Pham & Huy Nguyen & Hien Tran, 2022. "Predicting Mutual Funds' Performance using Deep Learning and Ensemble Techniques," Papers 2209.09649, arXiv.org, revised Jul 2023.
    14. Beaulieu, Marie-Claude & Dufour, Jean-Marie & Khalaf, Lynda & Melin, Olena, 2023. "Identification-robust beta pricing, spanning, mimicking portfolios, and the benchmark neutrality of catastrophe bonds," Journal of Econometrics, Elsevier, vol. 236(1).
    15. Yuan Liao & Viktor Todorov, 2024. "Changes in the span of systematic risk exposures," Quantitative Economics, Econometric Society, vol. 15(3), pages 817-847, July.
    16. Frison, Lilli & Gölzhäuser, Simon & Bitterling, Moritz & Kramer, Wolfgang, 2024. "Evaluating different artificial neural network forecasting approaches for optimizing district heating network operation," Energy, Elsevier, vol. 307(C).
    17. Esfandiar Maasoumi & Jianqiu Wang & Zhuo Wang & Ke Wu, 2024. "Identifying factors via automatic debiased machine learning," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 438-461, April.
    18. Sprangers, Olivier & Schelter, Sebastian & de Rijke, Maarten, 2023. "Parameter-efficient deep probabilistic forecasting," International Journal of Forecasting, Elsevier, vol. 39(1), pages 332-345.
    19. Cynthia M. Gong & Di Luo & Huainan Zhao, 2021. "Liquidity risk and the beta premium," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 44(4), pages 789-814, December.
    20. Fabio Franceschini, 2025. "The Innovation Long-Run Risk Component," Working Papers wp1215, Dipartimento Scienze Economiche, Universita' di Bologna.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2511.17462. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.