IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2204.05781.html
   My bibliography  Save this paper

Forecasting Cryptocurrency Returns from Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision

Author

Listed:
  • Duygu Ider
  • Stefan Lessmann

Abstract

Anticipating price developments in financial markets is a topic of continued interest in forecasting. Funneled by advancements in deep learning and natural language processing (NLP) together with the availability of vast amounts of textual data in form of news articles, social media postings, etc., an increasing number of studies incorporate text-based predictors in forecasting models. We contribute to this literature by introducing weak learning, a recently proposed NLP approach to address the problem that text data is unlabeled. Without a dependent variable, it is not possible to finetune pretrained NLP models on a custom corpus. We confirm that finetuning using weak labels enhances the predictive value of text-based features and raises forecast accuracy in the context of predicting cryptocurrency returns. More fundamentally, the modeling paradigm we present, weak labeling domain-specific text and finetuning pretrained NLP models, is universally applicable in (financial) forecasting and unlocks new ways to leverage text data.

Suggested Citation

  • Duygu Ider & Stefan Lessmann, 2022. "Forecasting Cryptocurrency Returns from Sentiment Signals: An Analysis of BERT Classifiers and Weak Supervision," Papers 2204.05781, arXiv.org, revised Mar 2023.
  • Handle: RePEc:arx:papers:2204.05781
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2204.05781
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fernandez-Rodriguez, Fernando & Gonzalez-Martel, Christian & Sosvilla-Rivero, Simon, 2000. "On the profitability of technical trading rules based on artificial neural networks:: Evidence from the Madrid stock market," Economics Letters, Elsevier, vol. 69(1), pages 89-94, October.
    2. Timmermann, Allan & Granger, Clive W. J., 2004. "Efficient market hypothesis and forecasting," International Journal of Forecasting, Elsevier, vol. 20(1), pages 15-27.
    3. Young Bin Kim & Jun Gi Kim & Wook Kim & Jae Ho Im & Tae Hyeong Kim & Shin Jin Kang & Chang Hun Kim, 2016. "Predicting Fluctuations in Cryptocurrency Transactions Based on User Comments and Replies," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-17, August.
    4. Jordan, J S, 1983. "On the Efficient Markets Hypothesis," Econometrica, Econometric Society, vol. 51(5), pages 1325-1343, September.
    5. Stavros Stavroyiannis, 2018. "Value-at-risk and related measures for the Bitcoin," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 19(2), pages 127-136, March.
    6. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    7. Priyank Sonkiya & Vikas Bajpai & Anukriti Bansal, 2021. "Stock price prediction using BERT and GAN," Papers 2107.09055, arXiv.org.
    8. Kearney, Colm & Liu, Sha, 2014. "Textual sentiment in finance: A survey of methods and models," International Review of Financial Analysis, Elsevier, vol. 33(C), pages 171-185.
    9. Shapiro, Adam Hale & Sudhof, Moritz & Wilson, Daniel J., 2022. "Measuring news sentiment," Journal of Econometrics, Elsevier, vol. 228(2), pages 221-243.
    10. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    11. Suhwan Ji & Jongmin Kim & Hyeonseung Im, 2019. "A Comparative Study of Bitcoin Price Prediction Using Deep Learning," Mathematics, MDPI, vol. 7(10), pages 1-20, September.
    12. Sergey Nasekin & Cathy Yi-Hsuan Chen, 2020. "Deep learning-based cryptocurrency sentiment construction," Digital Finance, Springer, vol. 2(1), pages 39-67, September.
    13. Laura Alessandretti & Abeer ElBahrawy & Luca Maria Aiello & Andrea Baronchelli, 2018. "Anticipating cryptocurrency prices using machine learning," Papers 1805.08550, arXiv.org, revised Nov 2018.
    14. Laura Alessandretti & Abeer ElBahrawy & Luca Maria Aiello & Andrea Baronchelli, 2018. "Anticipating Cryptocurrency Prices Using Machine Learning," Complexity, Hindawi, vol. 2018, pages 1-16, November.
    15. Teo Jasic & Douglas Wood, 2004. "The profitability of daily stock market indices trades based on neural network predictions: case study for the S&P 500, the DAX, the TOPIX and the FTSE in the period 1965-1999," Applied Financial Economics, Taylor & Francis Journals, vol. 14(4), pages 285-297.
    16. Pekka Malo & Ankur Sinha & Pekka Korhonen & Jyrki Wallenius & Pyry Takala, 2014. "Good debt or bad debt: Detecting semantic orientations in economic texts," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 782-796, April.
    17. Kim, Thomas, 2017. "On the transaction cost of Bitcoin," Finance Research Letters, Elsevier, vol. 23(C), pages 300-305.
    18. Helder Sebastião & Pedro Godinho, 2021. "Forecasting and trading cryptocurrencies with machine learning under changing market conditions," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-30, December.
    19. Chen, Cathy Yi-Hsuan & Després, Roméo & Guo, Li & Renault, Thomas, 2019. "What makes cryptocurrencies special? Investor sentiment and return predictability during the bubble," IRTG 1792 Discussion Papers 2019-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yutong Quan & Xintong Wu & Wanlin Deng & Luyao Zhang, 2023. "Decoding Social Sentiment in DAO: A Comparative Analysis of Blockchain Governance Communities," Papers 2311.14676, arXiv.org.
    2. Quan, Yutong & Wu, Xintong & Deng, Wanlin & Zhang, Luyao, 2023. "Decoding Social Sentiment in DAO: A Comparative Analysis of Blockchain Governance Communities," OSF Preprints bq6tu, Center for Open Science.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ren, Yi-Shuai & Ma, Chao-Qun & Kong, Xiao-Lin & Baltas, Konstantinos & Zureigat, Qasim, 2022. "Past, present, and future of the application of machine learning in cryptocurrency research," Research in International Business and Finance, Elsevier, vol. 63(C).
    2. Bowden, James & Gemayel, Roland, 2022. "Sentiment and trading decisions in an ambiguous environment: A study on cryptocurrency traders," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    3. Chowdhury, Reaz & Rahman, M. Arifur & Rahman, M. Sohel & Mahdy, M.R.C., 2020. "An approach to predict and forecast the price of constituents and index of cryptocurrency using machine learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 551(C).
    4. Flori, Andrea, 2019. "News and subjective beliefs: A Bayesian approach to Bitcoin investments," Research in International Business and Finance, Elsevier, vol. 50(C), pages 336-356.
    5. Bekiros, Stelios D., 2010. "Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach," Journal of Economic Dynamics and Control, Elsevier, vol. 34(6), pages 1153-1170, June.
    6. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    7. Suhwan Ji & Jongmin Kim & Hyeonseung Im, 2019. "A Comparative Study of Bitcoin Price Prediction Using Deep Learning," Mathematics, MDPI, vol. 7(10), pages 1-20, September.
    8. Kizilaslan, Recep & Freund, Steven & Iseri, Ali, 2016. "A data analytic approach to forecasting daily stock returns in an emerging marketAuthor-Name: Oztekin, Asil," European Journal of Operational Research, Elsevier, vol. 253(3), pages 697-710.
    9. Sangram Keshari Jena & Aviral Kumar Tiwari & Buhari Doğan & Shawkat Hammoudeh, 2022. "Are the top six cryptocurrencies efficient? Evidence from time‐varying long memory," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(3), pages 3730-3740, July.
    10. Ankur Sinha & Satishwar Kedas & Rishu Kumar & Pekka Malo, 2022. "SEntFiN 1.0: Entity‐aware sentiment analysis for financial news," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(9), pages 1314-1335, September.
    11. Gil Cohen, 2022. "Algorithmic Trading and Financial Forecasting Using Advanced Artificial Intelligence Methodologies," Mathematics, MDPI, vol. 10(18), pages 1-13, September.
    12. Ahmed M. Khedr & Ifra Arif & Pravija Raj P V & Magdi El‐Bannany & Saadat M. Alhashmi & Meenu Sreedharan, 2021. "Cryptocurrency price prediction using traditional statistical and machine‐learning techniques: A survey," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 28(1), pages 3-34, January.
    13. Nuray Tosunoğlu & Hilal Abacı & Gizem Ateş & Neslihan Saygılı Akkaya, 2023. "Artificial neural network analysis of the day of the week anomaly in cryptocurrencies," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-24, December.
    14. Andrea Flori, 2019. "Cryptocurrencies In Finance: Review And Applications," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 22(05), pages 1-22, August.
    15. Kraaijeveld, Olivier & De Smedt, Johannes, 2020. "The predictive power of public Twitter sentiment for forecasting cryptocurrency prices," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 65(C).
    16. Silvia Bartolucci & Fabio Caccioli & Pierpaolo Vivo, 2019. "A percolation model for the emergence of the Bitcoin Lightning Network," Papers 1912.03556, arXiv.org.
    17. Abeer ElBahrawy & Laura Alessandretti & Andrea Baronchelli, 2019. "Wikipedia and Digital Currencies: Interplay Between Collective Attention and Market Performance," Papers 1902.04517, arXiv.org, revised Mar 2019.
    18. Fan Fang & Carmine Ventre & Michail Basios & Leslie Kanthan & Lingbo Li & David Martinez-Regoband & Fan Wu, 2020. "Cryptocurrency Trading: A Comprehensive Survey," Papers 2003.11352, arXiv.org, revised Jan 2022.
    19. Cheol‐Ho Park & Scott H. Irwin, 2007. "What Do We Know About The Profitability Of Technical Analysis?," Journal of Economic Surveys, Wiley Blackwell, vol. 21(4), pages 786-826, September.
    20. Wang, Yaqi & Wang, Chunfeng & Sensoy, Ahmet & Yao, Shouyu & Cheng, Feiyang, 2022. "Can investors’ informed trading predict cryptocurrency returns? Evidence from machine learning," Research in International Business and Finance, Elsevier, vol. 62(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2204.05781. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.