Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading

My bibliography Save this paper

Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading

Author

Listed:

Qizhao Chen
Hiroaki Kawashima

Registered:

Abstract

This paper proposes a reinforcement learning framework that employs Proximal Policy Optimization (PPO) to dynamically optimize the weights of multiple large language model (LLM)-generated formulaic alphas for stock trading strategies. Formulaic alphas are mathematically defined trading signals derived from price, volume, sentiment, and other data. Although recent studies have shown that LLMs can generate diverse and effective alphas, a critical challenge lies in how to adaptively integrate them under varying market conditions. To address this gap, we leverage the deepseek-r1-distill-llama-70b model to generate fifty alphas for five major stocks: Apple, HSBC, Pepsi, Toyota, and Tencent, and then use PPO to adjust their weights in real time. Experimental results demonstrate that the PPO-optimized strategy achieves strong returns and high Sharpe ratios across most stocks, outperforming both an equal-weighted alpha portfolio and traditional benchmarks such as the Nikkei 225, S&P 500, and Hang Seng Index. The findings highlight the importance of reinforcement learning in the allocation of alpha weights and show the potential of combining LLM-generated signals with adaptive optimization for robust financial forecasting and trading.

Suggested Citation

Qizhao Chen & Hiroaki Kawashima, 2025. "Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading," Papers 2509.01393, arXiv.org.

Handle: RePEc:arx:papers:2509.01393

Download full text from publisher

References listed on IDEAS

Wenjie Lu & Jiazheng Li & Yifan Li & Aijun Sun & Jingyang Wang, 2020. "A CNN-LSTM-Based Model to Forecast Stock Prices," Complexity, Hindawi, vol. 2020, pages 1-10, November.
Gang Huang & Xiaohua Zhou & Qingyang Song, 2024. "A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market," Papers 2412.18563, arXiv.org, revised Feb 2025.
Jilin Zhang & Lishi Ye & Yongzeng Lai, 2023. "Stock Price Prediction Using CNN-BiLSTM-Attention Model," Mathematics, MDPI, vol. 11(9), pages 1-18, April.
Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
Hongyang Yang & Xiao-Yang Liu & Christina Dan Wang, 2023. "FinGPT: Open-Source Financial Large Language Models," Papers 2306.06031, arXiv.org.
Zhizhuo Kou & Holam Yu & Junyu Luo & Jingshu Peng & Xujia Li & Chengzhong Liu & Juntao Dai & Lei Chen & Sirui Han & Yike Guo, 2024. "Automate Strategy Finding with LLM in Quant Investment," Papers 2409.06289, arXiv.org, revised Nov 2025.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yichen Luo & Yebo Feng & Jiahua Xu & Paolo Tasca & Yang Liu, 2025. "LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management," Papers 2501.00826, arXiv.org, revised Jan 2025.
Yang, Jiahao & Fang, Ran & Zhang, Ming & Zhang, Wenkai & Zhou, Jun, 2025. "Enhancing stock ranking forecasting by modeling returns with heteroscedastic Gaussian Distribution," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 664(C).
Ruoyu Guo & Haochen Qiu & Xuelun Hou, 2025. "A Novel Loss Function for Deep Learning Based Daily Stock Trading System," Papers 2502.17493, arXiv.org, revised Nov 2025.
Darima Fotheringham & Michael A. Wiles, 2023. "The effect of implementing chatbot customer service on stock returns: an event study analysis," Journal of the Academy of Marketing Science, Springer, vol. 51(4), pages 802-822, July.
Christiane Goodfellow & Dirk Schiereck & Steffen Wippler, 2013. "Are behavioural finance equity funds a superior investment? A note on fund performance and market efficiency," Journal of Asset Management, Palgrave Macmillan, vol. 14(2), pages 111-119, April.
Chuan-Hao Hsu & Hung-Gay Fung & Yi-Ping Chang, 2016. "The performance of Taiwanese firms after a share repurchase announcement," Review of Quantitative Finance and Accounting, Springer, vol. 47(4), pages 1251-1269, November.
Frederico Belo & Chen Xue & Lu Zhang, 2010. "Cross-sectional Tobin's Q," NBER Working Papers 16336, National Bureau of Economic Research, Inc.
Manuel Ammann & Philipp Horsch & David Oesch, 2016. "Competing with Superstars," Management Science, INFORMS, vol. 62(10), pages 2842-2858, October.
- Ammann, Manuel & Horsch, Philipp & Oesch, David, 2015. "Competing with Superstars," Working Papers on Finance 1510, University of St. Gallen, School of Finance, revised Jan 2016.
Bansal, Ravi & Kiku, Dana & Yaron, Amir, 2016. "Risks for the long run: Estimation with time aggregation," Journal of Monetary Economics, Elsevier, vol. 82(C), pages 52-69.
- Ravi Bansal & Dana Kiku & Amir Yaron, 2012. "Risks For the Long Run: Estimation with Time Aggregation," NBER Working Papers 18305, National Bureau of Economic Research, Inc.
David Hirshleifer & Danling Jiang, 2010. "A Financing-Based Misvaluation Factor and the Cross-Section of Expected Returns," The Review of Financial Studies, Society for Financial Studies, vol. 23(9), pages 3401-3436.
- Hirshleifer, David & Jiang, Danling, 2007. "A Financing-Based Misvaluation Factor and the Cross Section of Expected Returns," MPRA Paper 20636, University Library of Munich, Germany, revised 10 Feb 2010.
Arthur, Bruno R. & Katchova, Ani L., 2012. "Accruals Anomaly in Agriculture Financial Economics," 2012 Annual Meeting, February 4-7, 2012, Birmingham, Alabama 119822, Southern Agricultural Economics Association.
Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
David J. Moore & David McMillan, 2016. "A look at the actual cost of capital of US firms," Cogent Economics & Finance, Taylor & Francis Journals, vol. 4(1), pages 1233628-123, December.
Greg Hebb, 2021. "On the performance of Bank-managed mutual funds: Canadian evidence," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 45(1), pages 22-48, January.
Eun, Cheol & Lee, Kyuseok & Wei, Fengrong, 2023. "Dual role of the country factors in international asset pricing: The local factors and proxies for the global factors," International Review of Financial Analysis, Elsevier, vol. 89(C).
Muhammad Kashif & Thomas Leirvik, 2022. "The MAX Effect in an Oil Exporting Country: The Case of Norway," JRFM, MDPI, vol. 15(4), pages 1-16, March.
Venkatesh Shankar & Pablo Azar & Matthew Fuller, 2008. "—: A Multicategory Brand Equity Model and Its Application at Allstate," Marketing Science, INFORMS, vol. 27(4), pages 567-584, 07-08.
James Christopher Westland, 2020. "Predicting credit card fraud with Sarbanes‐Oxley assessments and Fama‐French risk factors," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(2), pages 95-107, April.
Siganos, Antonios, 2013. "Google attention and target price run ups," International Review of Financial Analysis, Elsevier, vol. 29(C), pages 219-226.
Bonhomme, Stphane & Robin, Jean-Marc, 2009. "Consistent noisy independent component analysis," Journal of Econometrics, Elsevier, vol. 149(1), pages 12-25, April.
- Stéphane Bonhomme & Jean-Marc Robin, 2008. "Consistent noisy independent component analysis," CeMMAP working papers CWP04/08, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Jean-Marc Robin & Stéphane Bonhomme, 2009. "Consistent Noisy Independent Component Analysis," SciencePo Working papers Main hal-01022621, HAL.
- Stéphane Bonhomme & Jean-Marc Robin, 2009. "Consistent Noisy Independent Component Analysis," PSE-Ecole d'économie de Paris (Postprint) hal-00642732, HAL.
- Stéphane Bonhomme & Jean-Marc Robin, 2009. "Consistent Noisy Independent Component Analysis," Post-Print hal-00642732, HAL.
- Jean-Marc Robin & Stéphane Bonhomme, 2009. "Consistent Noisy Independent Component Analysis," Post-Print hal-01022621, HAL.
- Stéphane Bonhomme & Jean-Marc Robin, 2009. "Consistent Noisy Independent Component Analysis," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00642732, HAL.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2025-09-08 (Computational Economics)
NEP-FMK-2025-09-08 (Financial Markets)
NEP-FOR-2025-09-08 (Forecasting)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2509.01393. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data