IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.17211.html
   My bibliography  Save this paper

EFS: Evolutionary Factor Searching for Sparse Portfolio Optimization Using Large Language Models

Author

Listed:
  • Haochen Luo
  • Yuan Zhang
  • Chen Liu

Abstract

Sparse portfolio optimization is a fundamental yet challenging problem in quantitative finance, since traditional approaches heavily relying on historical return statistics and static objectives can hardly adapt to dynamic market regimes. To address this issue, we propose Evolutionary Factor Search (EFS), a novel framework that leverages large language models (LLMs) to automate the generation and evolution of alpha factors for sparse portfolio construction. By reformulating the asset selection problem as a top-m ranking task guided by LLM-generated factors, EFS incorporates an evolutionary feedback loop to iteratively refine the factor pool based on performance. Extensive experiments on five Fama-French benchmark datasets and three real-market datasets (US50, HSI45 and CSI300) demonstrate that EFS significantly outperforms both statistical-based and optimization-based baselines, especially in larger asset universes and volatile conditions. Comprehensive ablation studies validate the importance of prompt composition, factor diversity, and LLM backend choice. Our results highlight the promise of language-guided evolution as a robust and interpretable paradigm for portfolio optimization under structural constraints.

Suggested Citation

  • Haochen Luo & Yuan Zhang & Chen Liu, 2025. "EFS: Evolutionary Factor Searching for Sparse Portfolio Optimization Using Large Language Models," Papers 2507.17211, arXiv.org.
  • Handle: RePEc:arx:papers:2507.17211
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.17211
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. repec:eme:mfppss:eb013433 is not listed on IDEAS
    2. Tianping Zhang & Yuanqi Li & Yifei Jin & Jian Li, 2020. "AutoAlpha: an Efficient Hierarchical Evolutionary Algorithm for Mining Alpha Factors in Quantitative Investment," Papers 2002.08245, arXiv.org, revised Apr 2020.
    3. Kremer, Philipp J. & Lee, Sangkyun & Bogdan, Małgorzata & Paterlini, Sandra, 2020. "Sparse portfolio selection via the sorted ℓ1-Norm," Journal of Banking & Finance, Elsevier, vol. 110(C).
    4. Yongjae Lee & Min Jeong Kim & Jang Ho Kim & Ju Ri Jang & Woo Chang Kim, 2020. "Sparse and robust portfolio selection via semi-definite relaxation," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 71(5), pages 687-699, May.
    5. Xiao Yang & Weiqing Liu & Dong Zhou & Jiang Bian & Tie-Yan Liu, 2020. "Qlib: An AI-oriented Quantitative Investment Platform," Papers 2009.11189, arXiv.org.
    6. Ang, Andrew, 2014. "Asset Management: A Systematic Approach to Factor Investing," OUP Catalogue, Oxford University Press, number 9780199959327.
    7. Saizhuo Wang & Hang Yuan & Leon Zhou & Lionel M. Ni & Heung-Yeung Shum & Jian Guo, 2023. "Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment," Papers 2308.00016, arXiv.org, revised Sep 2025.
    8. Yuqi Nie & Yaxuan Kong & Xiaowen Dong & John M. Mulvey & H. Vincent Poor & Qingsong Wen & Stefan Zohren, 2024. "A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges," Papers 2406.11903, arXiv.org.
    9. Hang Yuan & Saizhuo Wang & Jian Guo, 2024. "Alpha-GPT 2.0: Human-in-the-Loop AI for Quantitative Investment," Papers 2402.09746, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shi, Longyu & Wang, Yunyun & Li, Wenyue & Zhang, Zhimin, 2025. "Multi-period mean–variance portfolio optimization with capital injections," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 233(C), pages 400-412.
    2. Wu, Zhongming & Sun, Kexin & Ge, Zhili & Allen-Zhao, Zhihua & Zeng, Tieyong, 2024. "Sparse portfolio optimization via ℓ1 over ℓ2 regularization," European Journal of Operational Research, Elsevier, vol. 319(3), pages 820-833.
    3. Michele Costola & Bertrand Maillet & Zhining Yuan & Xiang Zhang, 2024. "Mean–variance efficient large portfolios: a simple machine learning heuristic technique based on the two-fund separation theorem," Annals of Operations Research, Springer, vol. 334(1), pages 133-155, March.
    4. Lioui, Abraham & Tarelli, Andrea, 2022. "Chasing the ESG factor," Journal of Banking & Finance, Elsevier, vol. 139(C).
    5. Shuyu Gong & Taizhong Hu & Zhenfeng Zou, 2025. "Norms Based on Generalized Expected-Shortfalls and Applications," Papers 2507.09444, arXiv.org.
    6. Gaete, Michael & Herrera, Rodrigo, 2023. "Diversification benefits of commodities in portfolio allocation: A dynamic factor copula approach," Journal of Commodity Markets, Elsevier, vol. 32(C).
    7. Stelios Arvanitis, 2025. "Norm Constrained Empirical Portfolio Optimization with Stochastic Dominance: Robust Optimization Non-Asymptotics," Working Paper 1533, Economics Department, Queen's University.
    8. Ahmad Mousavi & George Michailidis, 2025. "Cardinality constrained mean-variance portfolios: a penalty decomposition algorithm," Computational Optimization and Applications, Springer, vol. 90(3), pages 631-648, April.
    9. Farshad Noravesh, 2022. "Sparse Non-Convex Optimization For Higher Moment Portfolio Management," Papers 2201.01227, arXiv.org, revised Jan 2022.
    10. Li, Xuepeng & Xu, Fengmin & Jing, Kui, 2022. "Robust enhanced indexation with ESG: An empirical study in the Chinese Stock Market," Economic Modelling, Elsevier, vol. 107(C).
    11. Pier Francesco Procacci & Tomaso Aste, 2022. "Portfolio optimization with sparse multivariate modeling," Journal of Asset Management, Palgrave Macmillan, vol. 23(6), pages 445-465, October.
    12. Alejandro Lopez-Lira & Jihoon Kwon & Sangwoon Yoon & Jy-yong Sohn & Chanyeol Choi, 2025. "Bridging Language Models and Financial Analysis," Papers 2503.22693, arXiv.org.
    13. Hyunglip Bae & Haeun Jeon & Minsu Park & Yongjae Lee & Woo Chang Kim, 2025. "A Cholesky decomposition-based asset selection heuristic for sparse tangent portfolio optimization," Papers 2502.11701, arXiv.org.
    14. Shi, Fangquan & Shu, Lianjie & He, Fangyi & Huang, Wenpo, 2025. "Improving minimum-variance portfolio through shrinkage of large covariance matrices," Economic Modelling, Elsevier, vol. 144(C).
    15. Hongjun Ding & Binqi Chen & Jinsheng Huang & Taian Guo & Zhengyang Mao & Guoyi Shao & Lutong Zou & Luchen Liu & Ming Zhang, 2025. "AlphaEval: A Comprehensive and Efficient Evaluation Framework for Formula Alpha Mining," Papers 2508.13174, arXiv.org.
    16. Cederburg, Scott & O’Doherty, Michael S. & Wang, Feifei & Yan, Xuemin (Sterling), 2020. "On the performance of volatility-managed portfolios," Journal of Financial Economics, Elsevier, vol. 138(1), pages 95-117.
    17. Carlo A. Favero & Alessandro Melone, 2019. "Asset Pricing vs Asset Expected Returning in Factor Models," Working Papers 651, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    18. Adam Zaremba & Jacob Koby Shemer, 2018. "Price-Based Investment Strategies," Springer Books, Springer, number 978-3-319-91530-2, December.
    19. Jonas Heipertz & Amine Ouazad & Romain Rancière & Natacha Valla, 2017. "Balance-Sheet Diversification in General Equilibrium: Identification and Network Effects," NBER Working Papers 23572, National Bureau of Economic Research, Inc.
    20. Alexander Berglund & Massimo Guidolin & Manuela Pedio, 2020. "Monetary policy after the crisis: A threat to hedge funds' alphas?," Journal of Asset Management, Palgrave Macmillan, vol. 21(3), pages 219-238, May.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.17211. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.