IDEAS home Printed from https://ideas.repec.org/a/aac/ijirss/v8y2025i5p1543-1550id9176.html
   My bibliography  Save this article

Aligning social signals with market outcomes: A reinforcement learning approach to noisy sentiment data in stock forecasting

Author

Listed:
  • Yangjun Lu

Abstract

Stock price prediction relies heavily on market sentiment, but it is still difficult to identify truly meaningful signals from social media because of the volume of noise and flimsy engagement measurements. In this paper, sentiment signal extraction is formulated as a delayed reward recommendation issue using a unique reinforcement learning framework. The method uses actual market input instead of static labels by simulating the task as an agent that learns to suggest tweets based on their potential to increase predicted accuracy. Due to the scarce and delayed nature of market signals, the agent may differentiate between those that are useful and those that are misleading or irrelevant by using a reward function that aligns with 48-hour post-publication stock movements. When combined with a Long Short-Term Memory (LSTM) network for price forecasting, the suggested approach shows that, in contrast to employing sentiment variables directly, integrating the RL-based suggestion enhances prediction performance. Tests conducted on Apple stock data from 2014 to 2016 demonstrate that using agent-selected tweets improves the system's R-squared scores and reduces prediction errors. The findings indicate that dynamically adjusting comment selection in unstable financial settings can be achieved using reinforcement learning. This study combines market-based incentive design, financial text mining, and reinforcement learning to provide a viable method for improving the reliability of sentiment-driven stock predictions. To further validate this framework's robustness, future research may expand it to include more equities and explore integrating it with other market signals.

Suggested Citation

  • Yangjun Lu, 2025. "Aligning social signals with market outcomes: A reinforcement learning approach to noisy sentiment data in stock forecasting," International Journal of Innovative Research and Scientific Studies, Innovative Research Publishing, vol. 8(5), pages 1543-1550.
  • Handle: RePEc:aac:ijirss:v:8:y:2025:i:5:p:1543-1550:id:9176
    as

    Download full text from publisher

    File URL: https://ijirss.com/index.php/ijirss/article/view/9176/2060
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aac:ijirss:v:8:y:2025:i:5:p:1543-1550:id:9176. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Natalie Jean (email available below). General contact details of provider: https://ijirss.com/index.php/ijirss/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.