When LLM Signals Hurt: A Coverage-Density Analysis of LLM-Augmented Reinforcement Learning for Stock Trading

When LLM Signals Hurt: A Coverage-Density Analysis of LLM-Augmented Reinforcement Learning for Stock Trading

Author

Listed:

Kausar, Shafiya
(INSEAD)

Abstract

We evaluate LLM-augmented reinforcement learning for stock trading on Nasdaq- 100 (2019–2023) and report a previously unmeasured experimental phenomenon: the relationship between LLM signal coverage density and trading performance is non-monotonic, with a clearly identifiable harmful regime. In a controlled coverage sweep over {0%,5%, 20%, 50%, 80%, 100%}, signal injection at 5% and 20% coverage degrades performance below the no-signal baseline, becoming net-positive only at ≥ 50% coverage. The FNSPID dataset’s 9.7% non-neutral coverage sits inside this harmful regime—meaning that for typical research configurations available today, adding LLM signals to the RL pipeline reduces returns. Beyond this density finding, we report three further negative results that the LLMRL trading literature has not adequately addressed. First, our LLM-augmented RL agent (158.11% cumulative return as a 3-seed ensemble) is outperformed by three standard non-RL baselines that prior work in this thread does not report: momentum top-10 (250.45%), equal-weight buy-and-hold (235.00%), and equal-weight monthly rebalanced (214.06%), all of which also exceed the Nasdaq- 100 buy-and-hold benchmark (164.52%). Second, we control for the daily-vs.- monthly rebalancing-frequency confound by deploying the same trained agents under matched-frequency monthly execution; the monthly variant underperforms its daily counterpart by 47pp (111.01% vs. 158.11%), confirming that the baseline gap is not driven by transaction-cost differences. Third, a v3-matched ablation finds that removing the CVaR tail-risk constraint produces a difference within the seedto- seed variability of the experiment. Across two independent runs, the sign of this difference flipped, providing direct empirical evidence that the algorithmic risk-tail machinery contributes no detectable return benefit in this setting. A regime decomposition reveals one clear win for the agent: in the 2023 recovery period, the 3-seed ensemble (52.6%) outperforms all non-RL baselines, suggesting the learned policy may have regime-specific advantages that single-window evaluation obscures. We argue that LLM-RL trading research should adopt non-RL baselines as standard practice, report signal coverage density as a first-class experimental variable, and decompose results by regime. Code and trained models are available at https: //anonymous.4open.science/r/signal-density-llm-trading-9966/.

Suggested Citation

Kausar, Shafiya, 2026. "When LLM Signals Hurt: A Coverage-Density Analysis of LLM-Augmented Reinforcement Learning for Stock Trading," SocArXiv nxvdp_v1, Center for Open Science.

Handle: RePEc:osf:socarx:nxvdp_v1
DOI: 10.31219/osf.io/nxvdp_v1

Download full text from publisher

References listed on IDEAS

Kumar Yashaswi, 2021. "Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module," Papers 2102.06233, arXiv.org.
repec:dau:papers:123456789/4688 is not listed on IDEAS
Clifford S. Asness & Tobias J. Moskowitz & Lasse Heje Pedersen, 2013. "Value and Momentum Everywhere," Journal of Finance, American Finance Association, vol. 68(3), pages 929-985, June.
Ananya Unnikrishnan, 2024. "Financial News-Driven LLM Reinforcement Learning for Portfolio Management," Papers 2411.11059, arXiv.org.
Zihan Dong & Xinyu Fan & Zhiyuan Peng, 2024. "FNSPID: A Comprehensive Financial News Dataset in Time Series," Papers 2402.06698, arXiv.org.
Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
Yangyang Yu & Haohang Li & Zhi Chen & Yuechen Jiang & Yang Li & Denghui Zhang & Rong Liu & Jordan W. Suchow & Khaldoun Khashanah, 2023. "FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design," Papers 2311.13743, arXiv.org, revised Dec 2023.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Klaus Grobys & James W. Kolari & Jere Rutanen, 2022. "Factor momentum, option-implied volatility scaling, and investor sentiment," Journal of Asset Management, Palgrave Macmillan, vol. 23(2), pages 138-155, March.
Onishchenko, Olena & Zhao, Jing & Kongahawatte, Sampath & Kuruppuarachchi, Duminda, 2024. "Investor heterogeneity and anchoring-induced momentum," Journal of Behavioral and Experimental Finance, Elsevier, vol. 42(C).
Eero Pätäri & Timo Leivo, 2017. "A Closer Look At Value Premium: Literature Review And Synthesis," Journal of Economic Surveys, Wiley Blackwell, vol. 31(1), pages 79-168, February.
Cakici, Nusret & Zaremba, Adam, 2022. "Salience theory and the cross-section of stock returns: International and further evidence," Journal of Financial Economics, Elsevier, vol. 146(2), pages 689-725.
Kwon, Oh Kang & Satchell, Stephen, 2018. "The distribution of cross sectional momentum returns," Journal of Economic Dynamics and Control, Elsevier, vol. 94(C), pages 225-241.
Yuming Li, 2017. "Risks and rewards for momentum and reversal portfolios," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 31(3), pages 289-315, August.
Kobana Abukari & Isaac Otchere, 2020. "Dominance of hybrid contratum strategies over momentum and contrarian strategies: half a century of evidence," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 34(4), pages 471-505, December.
Chen, Zhuo & Lu, Andrea, 2017. "Slow diffusion of information and price momentum in stocks: Evidence from options markets," Journal of Banking & Finance, Elsevier, vol. 75(C), pages 98-108.
Xingyue Pu & Stephen Roberts & Xiaowen Dong & Stefan Zohren, 2023. "Network Momentum across Asset Classes," Papers 2308.11294, arXiv.org.
Nicholas Apergis & Vasilios Plakandaras & Ioannis Pragidis, 2022. "Industry momentum and reversals in stock markets," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(3), pages 3093-3138, July.
Cakici, Nusret & Tang, Yi & Yan, An, 2016. "Do the size, value, and momentum factors drive stock returns in emerging markets?," Journal of International Money and Finance, Elsevier, vol. 69(C), pages 179-204.
Pätäri, Eero & Karell, Ville & Luukka, Pasi & Yeomans, Julian S, 2018. "Comparison of the multicriteria decision-making methods for equity portfolio selection: The U.S. evidence," European Journal of Operational Research, Elsevier, vol. 265(2), pages 655-672.
Carlo A. Favero & Alessandro Melone, 2019. "Asset Pricing vs Asset Expected Returning in Factor Models," Working Papers 651, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
de Oliveira Souza, Thiago, 2019. "A critique of momentum anomalies," Discussion Papers on Economics 5/2019, University of Southern Denmark, Department of Economics.
Blackburn, Douglas W. & Cakici, Nusret, 2017. "Overreaction and the cross-section of returns: International evidence," Journal of Empirical Finance, Elsevier, vol. 42(C), pages 1-14.
He, Chaohua & Jiang, Cheng & Molyboga, Marat, 2019. "Risk premia in Chinese commodity markets," Journal of Commodity Markets, Elsevier, vol. 15(C), pages 1-1.
Yu, Lin & Liu, Xiaoquan & Fung, Hung-Gay & Leung, Wai Kin, 2020. "Size and value effects in high-tech industries: The role of R&D investment," The North American Journal of Economics and Finance, Elsevier, vol. 51(C).
Adam Zaremba & Jacob Koby Shemer, 2018. "Price-Based Investment Strategies," Springer Books, Springer, number 978-3-319-91530-2, March.
Sharifkhani, Ali & Simutin, Mikhail, 2021. "Feedback loops in industry trade networks and the term structure of momentum profits," Journal of Financial Economics, Elsevier, vol. 141(3), pages 1171-1187.
Lindaas, Knut F. & Simlai, Prodosh, 2014. "The value premium, aggregate risk innovations, and average stock returns," Finance Research Letters, Elsevier, vol. 11(3), pages 303-317.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-EXP-2026-05-25 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:nxvdp_v1. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

When LLM Signals Hurt: A Coverage-Density Analysis of LLM-Augmented Reinforcement Learning for Stock Trading

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data