IDEAS home Printed from https://ideas.repec.org/a/eee/intfor/v41y2025i4p1666-1695.html

Deep learning and NLP in cryptocurrency forecasting: Integrating financial, blockchain, and social media data

Author

Listed:
  • Gurgul, Vincent
  • Lessmann, Stefan
  • Härdle, Wolfgang Karl

Abstract

We introduce novel approaches to cryptocurrency price forecasting, leveraging Machine Learning (ML) and Natural Language Processing (NLP) techniques, with a focus on Bitcoin and Ethereum. By analysing news and social media content, primarily from Twitter and Reddit, we assess the impact of public sentiment on cryptocurrency markets. A distinctive feature of our methodology is the application of the BART MNLI zero-shot classification model to detect bullish and bearish trends, significantly advancing beyond traditional sentiment analysis. Additionally, we systematically compare a range of pre-trained and fine-tuned deep learning NLP models against conventional dictionary-based sentiment analysis methods. Another key contribution of our work is the adoption of local extrema alongside daily price movements as predictive targets, reducing trading frequency and portfolio volatility. Our findings demonstrate that integrating textual data into cryptocurrency price forecasting not only improves forecasting accuracy but also consistently enhances the profitability and Sharpe ratio across various validation scenarios, particularly when applying deep learning NLP techniques. The entire codebase of our experiments is available via an online repository: https://anonymous.4open.science/r/crypto-forecasting-public.

Suggested Citation

  • Gurgul, Vincent & Lessmann, Stefan & Härdle, Wolfgang Karl, 2025. "Deep learning and NLP in cryptocurrency forecasting: Integrating financial, blockchain, and social media data," International Journal of Forecasting, Elsevier, vol. 41(4), pages 1666-1695.
  • Handle: RePEc:eee:intfor:v:41:y:2025:i:4:p:1666-1695
    DOI: 10.1016/j.ijforecast.2025.02.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0169207025000147
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijforecast.2025.02.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Ahmet Murat Ozbayoglu & Mehmet Ugur Gudelek & Omer Berat Sezer, 2020. "Deep Learning for Financial Applications : A Survey," Papers 2002.05786, arXiv.org.
    2. S M Raju & Ali Mohammad Tarif, 2020. "Real-Time Prediction of BITCOIN Price using Machine Learning Techniques and Public Sentiment Analysis," Papers 2006.14473, arXiv.org.
    3. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    4. Fan Fang & Carmine Ventre & Michail Basios & Leslie Kanthan & David Martinez-Rego & Fan Wu & Lingbo Li, 2022. "Cryptocurrency trading: a comprehensive survey," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-59, December.
    5. Kate Murray & Andrea Rossi & Diego Carraro & Andrea Visentin, 2023. "On Forecasting Cryptocurrency Prices: A Comparison of Machine Learning, Deep Learning, and Ensembles," Forecasting, MDPI, vol. 5(1), pages 1-14, January.
    6. Chen, Cathy Yi-Hsuan & Després, Roméo & Guo, Li & Renault, Thomas, 2019. "What makes cryptocurrencies special? Investor sentiment and return predictability during the bubble," IRTG 1792 Discussion Papers 2019-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    7. Leung, Mark T. & Daouk, Hazem & Chen, An-Sing, 2000. "Forecasting stock indices: a comparison of classification and level estimation models," International Journal of Forecasting, Elsevier, vol. 16(2), pages 173-190.
    8. Vytautas Karalevicius & Niels Degrande & Jochen De Weerdt, 2018. "Using sentiment analysis to predict interday Bitcoin price movements," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 19(1), pages 56-75, December.
    9. Zhang, Guoqiang & Eddy Patuwo, B. & Y. Hu, Michael, 1998. "Forecasting with artificial neural networks:: The state of the art," International Journal of Forecasting, Elsevier, vol. 14(1), pages 35-62, March.
    10. Fan Fang & Carmine Ventre & Michail Basios & Leslie Kanthan & Lingbo Li & David Martinez-Regoband & Fan Wu, 2020. "Cryptocurrency Trading: A Comprehensive Survey," Papers 2003.11352, arXiv.org, revised Jan 2022.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Przemysław Grądzki & Piotr Wójcik & Stefan Lessmann, 2025. "Algorithmic crypto trading using information-driven bars, triple barrier labeling and deep learning," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 11(1), pages 1-43, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Muhammad Anas & Syed Jawad Hussain Shahzad & Larisa Yarovaya, 2024. "The use of high-frequency data in cryptocurrency research: a meta-review of literature with bibliometric analysis," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-31, December.
    2. Khaled Mokni & Ghassen El Montasser & Ahdi Noomen Ajmi & Elie Bouri, 2025. "On the Efficiency and Its Drivers in the Cryptocurrency Market: The Case of Bitcoin and Ethereum," Springer Books, in: Gang Kou & Yongqiang Li & Zongyi Zhang & J. Leon Zhao & Zhi Zhuo (ed.), Blockchain, Crypto Assets, and Financial Innovation, pages 162-191, Springer.
    3. Mitja Steinbacher & Matej Steinbacher & Matjaz Steinbacher, 2025. "Using CNN to Model Stock Prices," Computational Economics, Springer;Society for Computational Economics, vol. 66(6), pages 5299-5340, December.
    4. Haji Suleman Ali & Feiyan Jia & Zhiyuan Lou & Jingui Xie, 2023. "Effect of blockchain technology initiatives on firms’ market value," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-35, December.
    5. Przemysław Grądzki & Piotr Wójcik & Stefan Lessmann, 2025. "Algorithmic crypto trading using information-driven bars, triple barrier labeling and deep learning," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 11(1), pages 1-43, December.
    6. Osman, Myriam Ben & Urom, Christian & Guesmi, Khaled & Benkraiem, Ramzi, 2024. "Economic sentiment and the cryptocurrency market in the post-COVID-19 era," International Review of Financial Analysis, Elsevier, vol. 91(C).
    7. Hongshen Yang & Avinash Malik, 2024. "Optimal market-neutral currency trading on the cryptocurrency platform," Papers 2405.15461, arXiv.org, revised Aug 2024.
    8. Vincent Gurgul & Stefan Lessmann & Wolfgang Karl Hardle, 2023. "Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data," Papers 2311.14759, arXiv.org, revised Oct 2024.
    9. Dushmanta Kumar Padhi & Neelamadhab Padhy & Akash Kumar Bhoi & Jana Shafi & Muhammad Fazal Ijaz, 2021. "A Fusion Framework for Forecasting Financial Market Direction Using Enhanced Ensemble Models and Technical Indicators," Mathematics, MDPI, vol. 9(21), pages 1-31, October.
    10. Hassan Javed & Naveed Khan, 2025. "Do Bitcoin Shocks Dominate Other Cryptocurrencies? An Examination Through GARCH Based Dynamic Models," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 32(4), pages 1431-1457, December.
    11. Assaf, Ata & Mokni, Khaled & Yousaf, Imran & Bhandari, Avishek, 2023. "Long memory in the high frequency cryptocurrency markets using fractal connectivity analysis: The impact of COVID-19," Research in International Business and Finance, Elsevier, vol. 64(C).
    12. Shen, Dehua & Wu, Yize, 2025. "The role of Guru investor in Bitcoin: Evidence from Kolmogorov-Arnold Networks," Research in International Business and Finance, Elsevier, vol. 75(C).
    13. Fallah, Mir Feiz & Pourmansouri, Rezvan & Ahmadpour, Bahador, 2024. "Presenting a new deep learning-based method with the incorporation of error effects to predict certain cryptocurrencies," International Review of Financial Analysis, Elsevier, vol. 95(PC).
    14. Yousaf, Imran & Youssef, Manel & Goodell, John W., 2022. "Quantile connectedness between sentiment and financial markets: Evidence from the S&P 500 twitter sentiment index," International Review of Financial Analysis, Elsevier, vol. 83(C).
    15. Bouteska, Ahmed & Sharif, Taimur & Isskandarani, Layal & Abedin, Mohammad Zoynul, 2025. "Market efficiency and its determinants: Macro-level dynamics and micro-level characteristics of cryptocurrencies," International Review of Economics & Finance, Elsevier, vol. 98(C).
    16. Pedro Reis & Ana Paula Serra & Jo~ao Gama, 2025. "The Role of Deep Learning in Financial Asset Management: A Systematic Review," Papers 2503.01591, arXiv.org.
    17. Shuozhe Li & Du Cheng & Leqi Liu, 2026. "A Learnable Wavelet Transformer for Long-Short Equity Trading and Risk-Adjusted Return Optimization," Papers 2601.13435, arXiv.org, revised Mar 2026.
    18. Bartosz Bieganowski & Robert 'Slepaczuk, 2024. "Supervised Autoencoders with Fractionally Differentiated Features and Triple Barrier Labelling Enhance Predictions on Noisy Data," Papers 2411.12753, arXiv.org, revised Nov 2024.
    19. Soria, Jorge & Moya, Jorge & Mohazab, Amin, 2023. "Optimal mining in proof-of-work blockchain protocols," Finance Research Letters, Elsevier, vol. 53(C).
    20. Siu Hin Tang & Mathieu Rosenbaum & Chao Zhou, 2023. "Forecasting Volatility with Machine Learning and Rough Volatility: Example from the Crypto-Winter," Papers 2311.04727, arXiv.org, revised Feb 2024.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:intfor:v:41:y:2025:i:4:p:1666-1695. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijforecast .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.