IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2006.09723.html
   My bibliography  Save this paper

A Tweet-based Dataset for Company-Level Stock Return Prediction

Author

Listed:
  • Karolina Sowinska
  • Pranava Madhyastha

Abstract

Public opinion influences events, especially related to stock market movement, in which a subtle hint can influence the local outcome of the market. In this paper, we present a dataset that allows for company-level analysis of tweet based impact on one-, two-, three-, and seven-day stock returns. Our dataset consists of 862, 231 labelled instances from twitter in English, we also release a cleaned subset of 85, 176 labelled instances to the community. We also provide baselines using standard machine learning algorithms and a multi-view learning based approach that makes use of different types of features. Our dataset, scripts and models are publicly available at: https://github.com/ImperialNLP/stockreturnpred.

Suggested Citation

  • Karolina Sowinska & Pranava Madhyastha, 2020. "A Tweet-based Dataset for Company-Level Stock Return Prediction," Papers 2006.09723, arXiv.org.
  • Handle: RePEc:arx:papers:2006.09723
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2006.09723
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Xi Zhang & Yunjia Zhang & Senzhang Wang & Yuntao Yao & Binxing Fang & Philip S. Yu, 2018. "Improving Stock Market Prediction via Heterogeneous Information Fusion," Papers 1801.00588, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    2. Linyi Yang & Yingpeng Ma & Yue Zhang, 2023. "Measuring Consistency in Text-based Financial Forecasting Models," Papers 2305.08524, arXiv.org, revised Jun 2023.
    3. Zhou, Zhongbao & Gao, Meng & Liu, Qing & Xiao, Helu, 2020. "Forecasting stock price movements with multiple data sources: Evidence from stock market in China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 542(C).
    4. Ashwini Saini & Anoop Sharma, 2022. "Predicting the Unpredictable: An Application of Machine Learning Algorithms in Indian Stock Market," Annals of Data Science, Springer, vol. 9(4), pages 791-799, August.
    5. Li-Chen Cheng & Wei-Ting Lu & Benjamin Yeo, 2023. "Predicting abnormal trading behavior from internet rumor propagation: a machine learning approach," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-23, December.
    6. Zhou, Zhongbao & Gao, Meng & Xiao, Helu & Wang, Rui & Liu, Wenbin, 2021. "Big data and portfolio optimization: A novel approach integrating DEA with multiple data sources," Omega, Elsevier, vol. 104(C).
    7. Francisco de Arriba-P'erez & Silvia Garc'ia-M'endez & Jos'e A. Regueiro-Janeiro & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of financial opportunities in micro-blogging data with a stacked classification system," Papers 2404.07224, arXiv.org.
    8. Omer Berat Sezer & Mehmet Ugur Gudelek & Ahmet Murat Ozbayoglu, 2019. "Financial Time Series Forecasting with Deep Learning : A Systematic Literature Review: 2005-2019," Papers 1911.13288, arXiv.org.
    9. Wen, Danyan & Ma, Chaoqun & Wang, Gang-Jin & Wang, Senzhang, 2018. "Investigating the features of pairs trading strategy: A network perspective on the Chinese stock market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 505(C), pages 903-918.
    10. Rosdyana Mangir Irawan Kusuma & Trang-Thi Ho & Wei-Chun Kao & Yu-Yen Ou & Kai-Lung Hua, 2019. "Using Deep Learning Neural Networks and Candlestick Chart Representation to Predict Stock Market," Papers 1903.12258, arXiv.org.
    11. Qiao Zhou & Ningning Liu, 2020. "A Stock Prediction Model Based on DCNN," Papers 2009.03239, arXiv.org.
    12. Junran Wu & Ke Xu & Jichang Zhao, 2019. "Online reviews can predict long-term returns of individual stocks," Papers 1905.03189, arXiv.org.
    13. Amit Milstein & Haoran Deng & Guy Revach & Hai Morgenstern & Nir Shlezinger, 2022. "Neural Augmented Kalman Filtering with Bollinger Bands for Pairs Trading," Papers 2210.15448, arXiv.org, revised Sep 2023.
    14. Mohammad Sahabuddin & Md. Aminul Islam & Mosab I. Tabash & Suhaib Anagreh & Rozina Akter & Md. Mizanur Rahman, 2022. "Co-Movement, Portfolio Diversification, Investors’ Behavior and Psychology: Evidence from Developed and Emerging Countries’ Stock Markets," JRFM, MDPI, vol. 15(8), pages 1-15, July.
    15. Xi Zhang & Yixuan Li & Senzhang Wang & Binxing Fang & Philip S. Yu, 2018. "Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data," Papers 1809.00306, arXiv.org.
    16. Ahmet Murat Ozbayoglu & Mehmet Ugur Gudelek & Omer Berat Sezer, 2020. "Deep Learning for Financial Applications : A Survey," Papers 2002.05786, arXiv.org.
    17. Marian Pompiliu Cristescu & Raluca Andreea Nerisanu & Dumitru Alexandru Mara & Simona-Vasilica Oprea, 2022. "Using Market News Sentiment Analysis for Stock Market Prediction," Mathematics, MDPI, vol. 10(22), pages 1-12, November.
    18. Tasnim M. A. Zayet & Maizatul Akmar Ismail & Kasturi Dewi Varathan & Rafidah M. D. Noor & Hui Na Chua & Angela Lee & Yeh Ching Low & Sheena Kaur Jaswant Singh, 2021. "Investigating transportation research based on social media analysis: a systematic mapping review," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6383-6421, August.
    19. Fuli Feng & Huimin Chen & Xiangnan He & Ji Ding & Maosong Sun & Tat-Seng Chua, 2018. "Enhancing Stock Movement Prediction with Adversarial Training," Papers 1810.09936, arXiv.org, revised Jun 2019.
    20. Yu Zhao & Huaming Du & Ying Liu & Shaopeng Wei & Xingyan Chen & Fuzhen Zhuang & Qing Li & Ji Liu & Gang Kou, 2022. "Stock Movement Prediction Based on Bi-typed Hybrid-relational Market Knowledge Graph via Dual Attention Networks," Papers 2201.04965, arXiv.org, revised Jan 2022.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2006.09723. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.