IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.06088.html

PriceSeer: Evaluating Large Language Models in Real-Time Stock Prediction

Author

Listed:
  • Bohan Liang
  • Zijian Chen
  • Qi Jia
  • Kaiwei Zhang
  • Kaiyuan Ji
  • Guangtao Zhai

Abstract

Stock prediction, a subject closely related to people's investment activities in fully dynamic and live environments, has been widely studied. Current large language models (LLMs) have shown remarkable potential in various domains, exhibiting expert-level performance through advanced reasoning and contextual understanding. In this paper, we introduce PriceSeer, a live, dynamic, and data-uncontaminated benchmark specifically designed for LLMs performing stock prediction tasks. Specifically, PriceSeer includes 110 U.S. stocks from 11 industrial sectors, with each containing 249 historical data points. Our benchmark implements both internal and external information expansion, where LLMs receive extra financial indicators, news, and fake news to perform stock price prediction. We evaluate six cutting-edge LLMs under different prediction horizons, demonstrating their potential in generating investment strategies after obtaining accurate price predictions for different sectors. Additionally, we provide analyses of LLMs' suboptimal performance in long-term predictions, including the vulnerability to fake news and specific industries. The code and evaluation data will be open-sourced at https://github.com/BobLiang2113/PriceSeer.

Suggested Citation

  • Bohan Liang & Zijian Chen & Qi Jia & Kaiwei Zhang & Kaiyuan Ji & Guangtao Zhai, 2025. "PriceSeer: Evaluating Large Language Models in Real-Time Stock Prediction," Papers 2601.06088, arXiv.org.
  • Handle: RePEc:arx:papers:2601.06088
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.06088
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Haofei Yu & Fenghai Li & Jiaxuan You, 2025. "LiveTradeBench: Seeking Real-World Alpha with Large Language Models," Papers 2511.03628, arXiv.org.
    2. Tianyu Fan & Yuhao Yang & Yangqin Jiang & Yifei Zhang & Yuxuan Chen & Chao Huang, 2025. "AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets," Papers 2512.10971, arXiv.org.
    3. Kelvin J. L. Koa & Yunshan Ma & Ritchie Ng & Tat-Seng Chua, 2024. "Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models," Papers 2402.03659, arXiv.org, revised Feb 2024.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yijia Xiao & Edward Sun & Tong Chen & Fang Wu & Di Luo & Wei Wang, 2025. "Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning," Papers 2509.11420, arXiv.org.
    2. Wentao Zhang & Mingxuan Zhao & Jincheng Gao & Jieshun You & Huaiyu Jia & Yilei Zhao & Bo An & Shuo Sun, 2026. "AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models," Papers 2602.18481, arXiv.org.
    3. Yuan Li & Bingqiao Luo & Qian Wang & Nuo Chen & Xu Liu & Bingsheng He, 2024. "A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading," Papers 2407.09546, arXiv.org.
    4. Zuoyou Jiang & Li Zhao & Rui Sun & Ruohan Sun & Zhongjian Li & Jing Li & Daxin Jiang & Zuo Bai & Cheng Hua, 2025. "Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning," Papers 2512.23515, arXiv.org.
    5. Zefeng Chen & Darcy Pu, 2026. "Autonomous Market Intelligence: Agentic AI Nowcasting Predicts Stock Returns," Papers 2601.11958, arXiv.org.
    6. Yuzhe Yang & Yifei Zhang & Yan Hu & Yilin Guo & Ruoli Gan & Yueru He & Mingcong Lei & Xiao Zhang & Haining Wang & Qianqian Xie & Jimin Huang & Honghai Yu & Benyou Wang, 2024. "UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models," Papers 2410.14059, arXiv.org, revised Feb 2025.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.06088. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.