IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2602.07085.html

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Author

Listed:
  • Jun Han
  • Shuo Zhang
  • Wei Li
  • Zhi Yang
  • Yifan Dong
  • Tu Hu
  • Jialuo Yuan
  • Xiaomin Yu
  • Yumo Zhu
  • Fangqi Lou
  • Xin Guo
  • Zhaowei Liu
  • Tianyi Jiang
  • Ruichuan An
  • Jingping Liu
  • Biao Wu
  • Rongze Chen
  • Kunyi Wang
  • Yifan Wang
  • Sen Hu
  • Xinbing Kong
  • Liwen Zhang
  • Ronghao Chen
  • Huacan Wang

Abstract

Financial markets are noisy and non-stationary, making alpha mining highly sensitive to noise in backtesting results and sudden market regime shifts. While recent agentic frameworks improve alpha mining automation, they often lack controllable multi-round search and reliable reuse of validated experience. To address these challenges, we propose QuantaAlpha, an evolutionary alpha mining framework that treats each end-to-end mining run as a trajectory and improves factors through trajectory-level mutation and crossover operations. QuantaAlpha localizes suboptimal steps in each trajectory for targeted revision and recombines complementary high-reward segments to reuse effective patterns, enabling structured exploration and refinement across mining iterations. During factor generation, QuantaAlpha enforces semantic consistency across the hypothesis, factor expression, and executable code, while constraining the complexity and redundancy of the generated factor to mitigate crowding. Extensive experiments on the China Securities Index 300 (CSI 300) demonstrate consistent gains over strong baseline models and prior agentic systems. When utilizing GPT-5.2, QuantaAlpha achieves an Information Coefficient (IC) of 0.1501, with an Annualized Rate of Return (ARR) of 27.75% and a Maximum Drawdown (MDD) of 7.98%. Moreover, factors mined on CSI 300 transfer effectively to the China Securities Index 500 (CSI 500) and the Standard & Poor's 500 Index (S&P 500), delivering 160% and 137% cumulative excess return over four years, respectively, which indicates strong robustness of QuantaAlpha under market distribution shifts.

Suggested Citation

  • Jun Han & Shuo Zhang & Wei Li & Zhi Yang & Yifan Dong & Tu Hu & Jialuo Yuan & Xiaomin Yu & Yumo Zhu & Fangqi Lou & Xin Guo & Zhaowei Liu & Tianyi Jiang & Ruichuan An & Jingping Liu & Biao Wu & Rongze , 2026. "QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining," Papers 2602.07085, arXiv.org, revised Apr 2026.
  • Handle: RePEc:arx:papers:2602.07085
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2602.07085
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Binqi Chen & Hongjun Ding & Ning Shen & Jinsheng Huang & Taian Guo & Luchen Liu & Ming Zhang, 2025. "AlphaSAGE: Structure-Aware Alpha Mining via GFlowNets for Robust Exploration," Papers 2509.25055, arXiv.org, revised Sep 2025.
    2. Wentao Zhang & Lingxuan Zhao & Haochong Xia & Shuo Sun & Jiaze Sun & Molei Qin & Xinyi Li & Yuqing Zhao & Yilei Zhao & Xinyu Cai & Longtao Zheng & Xinrun Wang & Bo An, 2024. "A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist," Papers 2402.18485, arXiv.org, revised Jun 2024.
    3. Hongjun Ding & Binqi Chen & Jinsheng Huang & Taian Guo & Zhengyang Mao & Guoyi Shao & Lutong Zou & Luchen Liu & Ming Zhang, 2025. "AlphaEval: A Comprehensive and Efficient Evaluation Framework for Formula Alpha Mining," Papers 2508.13174, arXiv.org.
    4. Xiangyu Li & Yawen Zeng & Xiaofen Xing & Jin Xu & Xiangmin Xu, 2025. "HedgeAgents: A Balanced-aware Multi-agent Financial Trading System," Papers 2502.13165, arXiv.org.
    5. Xiao-Yang Liu & Guoxuan Wang & Hongyang Yang & Daochen Zha, 2023. "FinGPT: Democratizing Internet-scale Data for Financial Large Language Models," Papers 2307.10485, arXiv.org, revised Nov 2023.
    6. Jimin Huang & Mengxi Xiao & Dong Li & Zihao Jiang & Yuzhe Yang & Yifei Zhang & Lingfei Qian & Yan Wang & Xueqing Peng & Yang Ren & Ruoyu Xiang & Zhengyu Chen & Xiao Zhang & Yueru He & Weiguang Han & S, 2024. "Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications," Papers 2408.11878, arXiv.org, revised Jun 2025.
    7. M. Hashem Pesaran, 2021. "General diagnostic tests for cross-sectional dependence in panels," Empirical Economics, Springer, vol. 60(1), pages 13-50, January.
    8. Engle, Robert F, 1982. "Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation," Econometrica, Econometric Society, vol. 50(4), pages 987-1007, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zuoyou Jiang & Li Zhao & Rui Sun & Ruohan Sun & Zhongjian Li & Jing Li & Daxin Jiang & Zuo Bai & Cheng Hua, 2025. "Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning," Papers 2512.23515, arXiv.org.
    2. Mostapha Benhenda, 2026. "Look-Ahead-Bench: a Standardized Benchmark of Look-ahead Bias in Point-in-Time LLMs for Finance," Papers 2601.13770, arXiv.org.
    3. Kajal Lahiri & Fushang Liu, 2006. "Modelling multi‐period inflation uncertainty using a panel of density forecasts," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(8), pages 1199-1219, December.
    4. Christos Bouras & Christina Christou & Rangan Gupta & Tahir Suleman, 2020. "Geopolitical Risks, Returns, and Volatility in Emerging Stock Markets: Evidence from a Panel GARCH Model," Emerging Markets Finance and Trade, Taylor & Francis Journals, vol. 55(8), pages 1841-1856, July.
    5. Masoud Soleimani, 2025. "LLM-Generated Counterfactual Stress Scenarios for Portfolio Risk Simulation via Hybrid Prompt-RAG Pipeline," Papers 2512.07867, arXiv.org.
    6. Aninday Banerjee & Markus Eberhardt & J James Reade, 2010. "Panel Estimation for Worriers," Discussion Papers 10-33, Department of Economics, University of Birmingham.
    7. Andreas A. Andrikopoulos & Dimitrios C. Gkountanis, 2011. "Issues and Models in Applied Econometrics: A partial survey," South-Eastern Europe Journal of Economics, Association of Economic Universities of South and Eastern Europe and the Black Sea Region, vol. 9(2), pages 107-165.
    8. Çiçekçi, Cumhur & Gaygısız, Esma, 2023. "Procyclicality of fiscal policy in oil-rich countries: Roles of resource funds and institutional quality," Resources Policy, Elsevier, vol. 85(PB).
    9. Bathia, Deven & Bouras, Christos & Demirer, Riza & Gupta, Rangan, 2020. "Cross-border capital flows and return dynamics in emerging stock markets: Relative roles of equity and debt flows," Journal of International Money and Finance, Elsevier, vol. 109(C).
    10. Ameer Tamoor Khan & Shuai Li & Xinwei Cao, 2025. "Bridging finance and AI: a comprehensive survey of large language models in financial system," Digital Finance, Springer, vol. 7(4), pages 679-701, December.
    11. Gerald Shively & Ganesh Thapa, 2017. "Markets, Transportation Infrastructure, and Food Prices in Nepal," American Journal of Agricultural Economics, John Wiley & Sons, vol. 99(3), pages 660-682, April.
    12. Kunihiro Miyazaki & Takanobu Kawahara & Stephen Roberts & Stefan Zohren, 2026. "Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks," Papers 2602.23330, arXiv.org.
    13. Emeka Nkoro & Aham Kelvin Uko, 2016. "Exchange Rate and Inflation Volatility and Stock Prices Volatility: Evidence from Nigeria, 1986-2012," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 6(6), pages 1-4.
    14. Minot, Nicholas, 2014. "Food price volatility in sub-Saharan Africa: Has it really increased?," Food Policy, Elsevier, vol. 45(C), pages 45-56.
    15. Cécile Couharde & Rémi Generoso, 2015. "Hydro-climatic thresholds and economic growth reversals in developing countries: an empirical investigation," EconomiX Working Papers 2015-26, University of Paris Nanterre, EconomiX.
    16. Shively, Gerald E., 2001. "Price thresholds, price volatility, and the private costs of investment in a developing country grain market," Economic Modelling, Elsevier, vol. 18(3), pages 399-414, August.
    17. Eric Ghysels & Leonardo Iania & Jonas Striaukas, 2018. "Quantile-based Inflation Risk Models," Working Paper Research 349, National Bank of Belgium.
    18. Marfatia, Hardik A., 2017. "A fresh look at integration of risks in the international stock markets: A wavelet approach," Review of Financial Economics, Elsevier, vol. 34(C), pages 33-49.
    19. Tomanova, Lucie, 2013. "Exchange Rate Volatility and the Foreign Trade in CEEC," EY International Congress on Economics I (EYC2013), October 24-25, 2013, Ankara, Turkey 267, Ekonomik Yaklasim Association.
    20. Bernard, Jean-Thomas & Idoudi, Nadhem & Khalaf, Lynda & Yelou, Clement, 2007. "Finite sample multivariate structural change tests with application to energy demand models," Journal of Econometrics, Elsevier, vol. 141(2), pages 1219-1244, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2602.07085. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.