IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.17327.html

Signal or Noise in Multi-Agent LLM-based Stock Recommendations?

Author

Listed:
  • George Fatouros
  • Kostas Metaxas

Abstract

We present the first portfolio-level validation of MarketSenseAI, a deployed multi-agent LLM equity system. All signals are generated live at each observation date, eliminating look-ahead bias. The system routes four specialist agents (News, Fundamentals, Dynamics, and Macro) through a synthesis agent that issues a monthly equity thesis and recommendation for each stock in its coverage universe, and we ask two questions: do its buy recommendations add value over both passive benchmarks and random selection, and what does the internal agent structure reveal about the source of the edge? On the S&P 500 cohort (19 months) the strong-buy equal-weight portfolio earns +2.18%/month against a passive equal-weight benchmark of +1.15% (approximating RSP), a +25.2% compound excess, and ranks at the 99.7th percentile of 10,000 Monte Carlo portfolios (p=0.003). The S&P 100 cohort (35 months) delivers a +30.5% compound excess over EQWL with consistent direction but formal significance not reached, limited by the small average selection of ~10 stocks per month. Non-negative least-squares projection of thesis embeddings onto agent embeddings reveals an adaptive-integration mechanism. Agent contributions rotate with market regime (Fundamentals leads on S&P 500, Macro on S&P 100, Dynamics acts as an episodic momentum signal) and this agent rotation moves in lockstep with both the sector composition of strong-buy selections and identifiable macro-calendar events, three independent views of the same underlying adaptation. The recommendation's cross-sectional Information Coefficient is statistically significant on S&P 500 (ICIR=+0.489, p=0.024). These results suggest that multi-agent LLM equity systems can identify sources of alpha beyond what classical factor models capture, and that the buy signal functions as an effective universe-filter that can sit upstream of any portfolio-construction process.

Suggested Citation

  • George Fatouros & Kostas Metaxas, 2026. "Signal or Noise in Multi-Agent LLM-based Stock Recommendations?," Papers 2604.17327, arXiv.org.
  • Handle: RePEc:arx:papers:2604.17327
    as

    Download full text from publisher

    File URL: https://arxiv.org/pdf/2604.17327
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wenxi Geng & Dingyuan Liu & Liya Li & Yiqing Wang, 2026. "Could Large Language Models work as Post-hoc Explainability Tools in Credit Risk Models?," Papers 2602.18895, arXiv.org, revised May 2026.
    2. Diego Vallarino, 2025. "Adaptive Market Intelligence: A Mixture of Experts Framework for Volatility-Sensitive Stock Forecasting," Papers 2508.02686, arXiv.org.
    3. Yiyao Zhang & Diksha Goel & Hussain Ahmad & Claudia Szabo, 2025. "RegimeFolio: A Regime Aware ML System for Sectoral Portfolio Optimization in Dynamic Markets," Papers 2510.14986, arXiv.org.
    4. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    5. Hariom Tatsat & Ariye Shater, 2025. "Beyond the Black Box: Interpretability of LLMs in Finance," Papers 2505.24650, arXiv.org.
    6. Shijie Wu & Ozan Irsoy & Steven Lu & Vadim Dabravolski & Mark Dredze & Sebastian Gehrmann & Prabhanjan Kambadur & David Rosenberg & Gideon Mann, 2023. "BloombergGPT: A Large Language Model for Finance," Papers 2303.17564, arXiv.org, revised Dec 2023.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zuoyou Jiang & Li Zhao & Rui Sun & Ruohan Sun & Zhongjian Li & Jing Li & Daxin Jiang & Zuo Bai & Cheng Hua, 2025. "Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning," Papers 2512.23515, arXiv.org.
    2. Yijia Xiao & Edward Sun & Tong Chen & Fang Wu & Di Luo & Wei Wang, 2025. "Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning," Papers 2509.11420, arXiv.org.
    3. Maher Hamid, 2026. "Implementing domain-specific LLMs for strategic investment decisions: a retrospective case study comparing AI and human expertise," Digital Finance, Springer, vol. 8(1), pages 1-134, March.
    4. Zhaofeng Zhang & Banghao Chen & Shengxin Zhu & Nicolas Langren'e, 2024. "Quantformer: from attention to profit with a quantitative transformer trading strategy," Papers 2404.00424, arXiv.org, revised Aug 2025.
    5. Yijia Chen, 2026. "Be Water: An Evolutionary Proof for Trend-Following," Papers 2603.29593, arXiv.org.
    6. Zhangyuhua Weng & Shengli Zhang & Taotao Wang & Yihan Xia, 2026. "AlphaLogics: A Market Logic-Driven Multi-Agent System for Scalable and Interpretable Alpha Factor Generation," Papers 2603.20247, arXiv.org.
    7. Vasant Dhar & Jo~ao Sedoc, 2025. "DBOT: Artificial Intelligence for Systematic Long-Term Investing," Papers 2504.05639, arXiv.org.
    8. Masoud Soleimani, 2025. "LLM-Generated Counterfactual Stress Scenarios for Portfolio Risk Simulation via Hybrid Prompt-RAG Pipeline," Papers 2512.07867, arXiv.org.
    9. Takayuki Sakuma, 2025. "Diagram-to-Circuit QNLP for Financial Sentiment Analysis," Papers 2511.18804, arXiv.org, revised Dec 2025.
    10. Darima Fotheringham & Michael A. Wiles, 2023. "The effect of implementing chatbot customer service on stock returns: an event study analysis," Journal of the Academy of Marketing Science, Springer, vol. 51(4), pages 802-822, July.
    11. Christiane Goodfellow & Dirk Schiereck & Steffen Wippler, 2013. "Are behavioural finance equity funds a superior investment? A note on fund performance and market efficiency," Journal of Asset Management, Palgrave Macmillan, vol. 14(2), pages 111-119, April.
    12. Chuan-Hao Hsu & Hung-Gay Fung & Yi-Ping Chang, 2016. "The performance of Taiwanese firms after a share repurchase announcement," Review of Quantitative Finance and Accounting, Springer, vol. 47(4), pages 1251-1269, November.
    13. Frederico Belo & Chen Xue & Lu Zhang, 2010. "Cross-sectional Tobin's Q," NBER Working Papers 16336, National Bureau of Economic Research, Inc.
    14. Manuel Ammann & Philipp Horsch & David Oesch, 2016. "Competing with Superstars," Management Science, INFORMS, vol. 62(10), pages 2842-2858, October.
    15. Bansal, Ravi & Kiku, Dana & Yaron, Amir, 2016. "Risks for the long run: Estimation with time aggregation," Journal of Monetary Economics, Elsevier, vol. 82(C), pages 52-69.
    16. David Hirshleifer & Danling Jiang, 2010. "A Financing-Based Misvaluation Factor and the Cross-Section of Expected Returns," The Review of Financial Studies, Society for Financial Studies, vol. 23(9), pages 3401-3436.
    17. Arthur, Bruno R. & Katchova, Ani L., 2012. "Accruals Anomaly in Agriculture Financial Economics," 2012 Annual Meeting, February 4-7, 2012, Birmingham, Alabama 119822, Southern Agricultural Economics Association.
    18. Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    19. David J. Moore & David McMillan, 2016. "A look at the actual cost of capital of US firms," Cogent Economics & Finance, Taylor & Francis Journals, vol. 4(1), pages 1233628-123, December.
    20. Greg Hebb, 2021. "On the performance of Bank-managed mutual funds: Canadian evidence," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 45(1), pages 22-48, January.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.17327. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: https://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.