IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2509.01393.html
   My bibliography  Save this paper

Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading

Author

Listed:
  • Qizhao Chen
  • Hiroaki Kawashima

Abstract

This paper proposes a reinforcement learning framework that employs Proximal Policy Optimization (PPO) to dynamically optimize the weights of multiple large language model (LLM)-generated formulaic alphas for stock trading strategies. Formulaic alphas are mathematically defined trading signals derived from price, volume, sentiment, and other data. Although recent studies have shown that LLMs can generate diverse and effective alphas, a critical challenge lies in how to adaptively integrate them under varying market conditions. To address this gap, we leverage the deepseek-r1-distill-llama-70b model to generate fifty alphas for five major stocks: Apple, HSBC, Pepsi, Toyota, and Tencent, and then use PPO to adjust their weights in real time. Experimental results demonstrate that the PPO-optimized strategy achieves strong returns and high Sharpe ratios across most stocks, outperforming both an equal-weighted alpha portfolio and traditional benchmarks such as the Nikkei 225, S&P 500, and Hang Seng Index. The findings highlight the importance of reinforcement learning in the allocation of alpha weights and show the potential of combining LLM-generated signals with adaptive optimization for robust financial forecasting and trading.

Suggested Citation

  • Qizhao Chen & Hiroaki Kawashima, 2025. "Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading," Papers 2509.01393, arXiv.org.
  • Handle: RePEc:arx:papers:2509.01393
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2509.01393
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wenjie Lu & Jiazheng Li & Yifan Li & Aijun Sun & Jingyang Wang, 2020. "A CNN-LSTM-Based Model to Forecast Stock Prices," Complexity, Hindawi, vol. 2020, pages 1-10, November.
    2. Gang Huang & Xiaohua Zhou & Qingyang Song, 2024. "A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market," Papers 2412.18563, arXiv.org, revised Feb 2025.
    3. Jilin Zhang & Lishi Ye & Yongzeng Lai, 2023. "Stock Price Prediction Using CNN-BiLSTM-Attention Model," Mathematics, MDPI, vol. 11(9), pages 1-18, April.
    4. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    5. Hongyang Yang & Xiao-Yang Liu & Christina Dan Wang, 2023. "FinGPT: Open-Source Financial Large Language Models," Papers 2306.06031, arXiv.org.
    6. Zhizhuo Kou & Holam Yu & Junyu Luo & Jingshu Peng & Xujia Li & Chengzhong Liu & Juntao Dai & Lei Chen & Sirui Han & Yike Guo, 2024. "Automate Strategy Finding with LLM in Quant Investment," Papers 2409.06289, arXiv.org, revised Nov 2025.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yichen Luo & Yebo Feng & Jiahua Xu & Paolo Tasca & Yang Liu, 2025. "LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management," Papers 2501.00826, arXiv.org, revised Jan 2025.
    2. Yang, Jiahao & Fang, Ran & Zhang, Ming & Zhang, Wenkai & Zhou, Jun, 2025. "Enhancing stock ranking forecasting by modeling returns with heteroscedastic Gaussian Distribution," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 664(C).
    3. Ruoyu Guo & Haochen Qiu & Xuelun Hou, 2025. "A Novel Loss Function for Deep Learning Based Daily Stock Trading System," Papers 2502.17493, arXiv.org, revised Nov 2025.
    4. Darima Fotheringham & Michael A. Wiles, 2023. "The effect of implementing chatbot customer service on stock returns: an event study analysis," Journal of the Academy of Marketing Science, Springer, vol. 51(4), pages 802-822, July.
    5. Christiane Goodfellow & Dirk Schiereck & Steffen Wippler, 2013. "Are behavioural finance equity funds a superior investment? A note on fund performance and market efficiency," Journal of Asset Management, Palgrave Macmillan, vol. 14(2), pages 111-119, April.
    6. Chuan-Hao Hsu & Hung-Gay Fung & Yi-Ping Chang, 2016. "The performance of Taiwanese firms after a share repurchase announcement," Review of Quantitative Finance and Accounting, Springer, vol. 47(4), pages 1251-1269, November.
    7. Frederico Belo & Chen Xue & Lu Zhang, 2010. "Cross-sectional Tobin's Q," NBER Working Papers 16336, National Bureau of Economic Research, Inc.
    8. Manuel Ammann & Philipp Horsch & David Oesch, 2016. "Competing with Superstars," Management Science, INFORMS, vol. 62(10), pages 2842-2858, October.
    9. Bansal, Ravi & Kiku, Dana & Yaron, Amir, 2016. "Risks for the long run: Estimation with time aggregation," Journal of Monetary Economics, Elsevier, vol. 82(C), pages 52-69.
    10. David Hirshleifer & Danling Jiang, 2010. "A Financing-Based Misvaluation Factor and the Cross-Section of Expected Returns," The Review of Financial Studies, Society for Financial Studies, vol. 23(9), pages 3401-3436.
    11. Arthur, Bruno R. & Katchova, Ani L., 2012. "Accruals Anomaly in Agriculture Financial Economics," 2012 Annual Meeting, February 4-7, 2012, Birmingham, Alabama 119822, Southern Agricultural Economics Association.
    12. Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    13. David J. Moore & David McMillan, 2016. "A look at the actual cost of capital of US firms," Cogent Economics & Finance, Taylor & Francis Journals, vol. 4(1), pages 1233628-123, December.
    14. Greg Hebb, 2021. "On the performance of Bank-managed mutual funds: Canadian evidence," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 45(1), pages 22-48, January.
    15. Eun, Cheol & Lee, Kyuseok & Wei, Fengrong, 2023. "Dual role of the country factors in international asset pricing: The local factors and proxies for the global factors," International Review of Financial Analysis, Elsevier, vol. 89(C).
    16. Muhammad Kashif & Thomas Leirvik, 2022. "The MAX Effect in an Oil Exporting Country: The Case of Norway," JRFM, MDPI, vol. 15(4), pages 1-16, March.
    17. Venkatesh Shankar & Pablo Azar & Matthew Fuller, 2008. "—: A Multicategory Brand Equity Model and Its Application at Allstate," Marketing Science, INFORMS, vol. 27(4), pages 567-584, 07-08.
    18. James Christopher Westland, 2020. "Predicting credit card fraud with Sarbanes‐Oxley assessments and Fama‐French risk factors," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(2), pages 95-107, April.
    19. Siganos, Antonios, 2013. "Google attention and target price run ups," International Review of Financial Analysis, Elsevier, vol. 29(C), pages 219-226.
    20. Bonhomme, Stphane & Robin, Jean-Marc, 2009. "Consistent noisy independent component analysis," Journal of Econometrics, Elsevier, vol. 149(1), pages 12-25, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2509.01393. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.