IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1801.00588.html
   My bibliography  Save this paper

Improving Stock Market Prediction via Heterogeneous Information Fusion

Author

Listed:
  • Xi Zhang
  • Yunjia Zhang
  • Senzhang Wang
  • Yuntao Yao
  • Binxing Fang
  • Philip S. Yu

Abstract

Traditional stock market prediction approaches commonly utilize the historical price-related data of the stocks to forecast their future trends. As the Web information grows, recently some works try to explore financial news to improve the prediction. Effective indicators, e.g., the events related to the stocks and the people's sentiments towards the market and stocks, have been proved to play important roles in the stocks' volatility, and are extracted to feed into the prediction models for improving the prediction accuracy. However, a major limitation of previous methods is that the indicators are obtained from only a single source whose reliability might be low, or from several data sources but their interactions and correlations among the multi-sourced data are largely ignored. In this work, we extract the events from Web news and the users' sentiments from social media, and investigate their joint impacts on the stock price movements via a coupled matrix and tensor factorization framework. Specifically, a tensor is firstly constructed to fuse heterogeneous data and capture the intrinsic relations among the events and the investors' sentiments. Due to the sparsity of the tensor, two auxiliary matrices, the stock quantitative feature matrix and the stock correlation matrix, are constructed and incorporated to assist the tensor decomposition. The intuition behind is that stocks that are highly correlated with each other tend to be affected by the same event. Thus, instead of conducting each stock prediction task separately and independently, we predict multiple correlated stocks simultaneously through their commonalities, which are enabled via sharing the collaboratively factorized low rank matrices between matrices and the tensor. Evaluations on the China A-share stock data and the HK stock data in the year 2015 demonstrate the effectiveness of the proposed model.

Suggested Citation

  • Xi Zhang & Yunjia Zhang & Senzhang Wang & Yuntao Yao & Binxing Fang & Philip S. Yu, 2018. "Improving Stock Market Prediction via Heterogeneous Information Fusion," Papers 1801.00588, arXiv.org.
  • Handle: RePEc:arx:papers:1801.00588
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1801.00588
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    2. Lun‐Wei Ku & Hsin‐Hsi Chen, 2007. "Mining opinions from the Web: Beyond relevance retrieval," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(12), pages 1838-1850, October.
    3. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    4. Ronny Luss & Alexandre D'Aspremont, 2015. "Predicting abnormal returns from news using text classification," Quantitative Finance, Taylor & Francis Journals, vol. 15(6), pages 999-1012, June.
    5. Chuang, Wen-I & Lee, Bong-Soo, 2006. "An empirical evaluation of the overconfidence hypothesis," Journal of Banking & Finance, Elsevier, vol. 30(9), pages 2489-2515, September.
    6. Mario A Bertella & Felipe R Pires & Ling Feng & Harry Eugene Stanley, 2014. "Confidence and the Stock Market: An Agent-Based Approach," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-9, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qiao Zhou & Ningning Liu, 2020. "A Stock Prediction Model Based on DCNN," Papers 2009.03239, arXiv.org.
    2. Junran Wu & Ke Xu & Jichang Zhao, 2019. "Online reviews can predict long-term returns of individual stocks," Papers 1905.03189, arXiv.org.
    3. Anna Marszal, 2022. "What news can really tell us? Evidence from a news-based sentiment index for financial markets analysis," NBP Working Papers 349, Narodowy Bank Polski.
    4. Ashwini Saini & Anoop Sharma, 2022. "Predicting the Unpredictable: An Application of Machine Learning Algorithms in Indian Stock Market," Annals of Data Science, Springer, vol. 9(4), pages 791-799, August.
    5. Fuli Feng & Huimin Chen & Xiangnan He & Ji Ding & Maosong Sun & Tat-Seng Chua, 2018. "Enhancing Stock Movement Prediction with Adversarial Training," Papers 1810.09936, arXiv.org, revised Jun 2019.
    6. Ahmet Murat Ozbayoglu & Mehmet Ugur Gudelek & Omer Berat Sezer, 2020. "Deep Learning for Financial Applications : A Survey," Papers 2002.05786, arXiv.org.
    7. Mohammad Sahabuddin & Md. Aminul Islam & Mosab I. Tabash & Suhaib Anagreh & Rozina Akter & Md. Mizanur Rahman, 2022. "Co-Movement, Portfolio Diversification, Investors’ Behavior and Psychology: Evidence from Developed and Emerging Countries’ Stock Markets," JRFM, MDPI, vol. 15(8), pages 1-15, July.
    8. Jinan Zou & Qingying Zhao & Yang Jiao & Haiyao Cao & Yanxi Liu & Qingsen Yan & Ehsan Abbasnejad & Lingqiao Liu & Javen Qinfeng Shi, 2022. "Stock Market Prediction via Deep Learning Techniques: A Survey," Papers 2212.12717, arXiv.org, revised Feb 2023.
    9. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    10. Cristescu Marian Pompiliu & Nerişanu Raluca Andreea & Mara Dumitru Alexandru, 2022. "Using Data Mining in the Sentiment Analysis Process on the Financial Market," Journal of Social and Economic Statistics, Sciendo, vol. 11(1-2), pages 36-58, December.
    11. Amit Milstein & Haoran Deng & Guy Revach & Hai Morgenstern & Nir Shlezinger, 2022. "Neural Augmented Kalman Filtering with Bollinger Bands for Pairs Trading," Papers 2210.15448, arXiv.org, revised Sep 2023.
    12. Zhou, Zhongbao & Gao, Meng & Xiao, Helu & Wang, Rui & Liu, Wenbin, 2021. "Big data and portfolio optimization: A novel approach integrating DEA with multiple data sources," Omega, Elsevier, vol. 104(C).
    13. Marian Pompiliu Cristescu & Raluca Andreea Nerisanu & Dumitru Alexandru Mara & Simona-Vasilica Oprea, 2022. "Using Market News Sentiment Analysis for Stock Market Prediction," Mathematics, MDPI, vol. 10(22), pages 1-12, November.
    14. Xi Zhang & Yixuan Li & Senzhang Wang & Binxing Fang & Philip S. Yu, 2018. "Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data," Papers 1809.00306, arXiv.org.
    15. Wen, Danyan & Ma, Chaoqun & Wang, Gang-Jin & Wang, Senzhang, 2018. "Investigating the features of pairs trading strategy: A network perspective on the Chinese stock market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 505(C), pages 903-918.
    16. Zhou, Zhongbao & Gao, Meng & Liu, Qing & Xiao, Helu, 2020. "Forecasting stock price movements with multiple data sources: Evidence from stock market in China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 542(C).
    17. Linyi Yang & Yingpeng Ma & Yue Zhang, 2023. "Measuring Consistency in Text-based Financial Forecasting Models," Papers 2305.08524, arXiv.org, revised Jun 2023.
    18. Li-Chen Cheng & Wei-Ting Lu & Benjamin Yeo, 2023. "Predicting abnormal trading behavior from internet rumor propagation: a machine learning approach," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-23, December.
    19. Tasnim M. A. Zayet & Maizatul Akmar Ismail & Kasturi Dewi Varathan & Rafidah M. D. Noor & Hui Na Chua & Angela Lee & Yeh Ching Low & Sheena Kaur Jaswant Singh, 2021. "Investigating transportation research based on social media analysis: a systematic mapping review," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6383-6421, August.
    20. Omer Berat Sezer & Mehmet Ugur Gudelek & Ahmet Murat Ozbayoglu, 2019. "Financial Time Series Forecasting with Deep Learning : A Systematic Literature Review: 2005-2019," Papers 1911.13288, arXiv.org.
    21. Karolina Sowinska & Pranava Madhyastha, 2020. "A Tweet-based Dataset for Company-Level Stock Return Prediction," Papers 2006.09723, arXiv.org.
    22. Yu Zhao & Huaming Du & Ying Liu & Shaopeng Wei & Xingyan Chen & Fuzhen Zhuang & Qing Li & Ji Liu & Gang Kou, 2022. "Stock Movement Prediction Based on Bi-typed Hybrid-relational Market Knowledge Graph via Dual Attention Networks," Papers 2201.04965, arXiv.org, revised Jan 2022.
    23. Francisco de Arriba-P'erez & Silvia Garc'ia-M'endez & Jos'e A. Regueiro-Janeiro & Francisco J. Gonz'alez-Casta~no, 2024. "Detection of financial opportunities in micro-blogging data with a stacked classification system," Papers 2404.07224, arXiv.org.
    24. Rosdyana Mangir Irawan Kusuma & Trang-Thi Ho & Wei-Chun Kao & Yu-Yen Ou & Kai-Lung Hua, 2019. "Using Deep Learning Neural Networks and Candlestick Chart Representation to Predict Stock Market," Papers 1903.12258, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David E. Allen & Michael McAleer & Abhay K. Singh, 2019. "Daily market news sentiment and stock prices," Applied Economics, Taylor & Francis Journals, vol. 51(30), pages 3212-3235, June.
    2. Ferdinand Graf, 2011. "Mechanically Extracted Company Signals and their Impact on Stock and Credit Markets," Working Paper Series of the Department of Economics, University of Konstanz 2011-18, Department of Economics, University of Konstanz.
    3. Xiaofei Zhao, 2017. "Does Information Intensity Matter for Stock Returns? Evidence from Form 8-K Filings," Management Science, INFORMS, vol. 63(5), pages 1382-1404, May.
    4. Lixing Mei & Yulei Rao & Mei Wang & Jianxin Wang, 2019. "Do investors post messages differently from mobile devices? The correlation between mobile Internet messages posting and stock returns," International Review of Economics, Springer;Happiness Economics and Interpersonal Relations (HEIRS), vol. 66(4), pages 423-452, December.
    5. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    6. Wu, Chunying & Xiong, Xiong & Gao, Ya, 2022. "The role of different information sources in information spread: Evidence from three media channels in China," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 327-341.
    7. Al-Nasseri, Alya & Menla Ali, Faek & Tucker, Allan, 2021. "Investor sentiment and the dispersion of stock returns: Evidence based on the social network of investors," International Review of Financial Analysis, Elsevier, vol. 78(C).
    8. Chong, Terence Tai Leung & Wu, Zhang & Liu, Yuchen, 2019. "Market Reaction to iPhone Rumors," MPRA Paper 92014, University Library of Munich, Germany.
    9. Vegard Høghaug Larsen & Leif Anders Thorsrud, 2022. "Asset returns, news topics, and media effects," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 838-868, July.
    10. Huang, Tzu-Lun, 2018. "The puzzling media effect in the Chinese stock market," Pacific-Basin Finance Journal, Elsevier, vol. 49(C), pages 129-146.
    11. Asem, Ebenezer, 2009. "Dividends and price momentum," Journal of Banking & Finance, Elsevier, vol. 33(3), pages 486-494, March.
    12. Chen, Zilin & Guo, Li & Tu, Jun, 2021. "Media connection and return comovement," Journal of Economic Dynamics and Control, Elsevier, vol. 130(C).
    13. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    14. Seshadri Tirunillai & Gerard J. Tellis, 2012. "Does Chatter Really Matter? Dynamics of User-Generated Content and Stock Performance," Marketing Science, INFORMS, vol. 31(2), pages 198-215, March.
    15. Xing Wang & Yijun Wang & Bin Weng & Aleksandr Vinel, 2020. "Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network," Papers 2010.01197, arXiv.org.
    16. Du, Hanyu & Hao, Jing & He, Feng & Xi, Wenze, 2022. "Media sentiment and cross-sectional stock returns in the Chinese stock market," Research in International Business and Finance, Elsevier, vol. 60(C).
    17. Heusel, Nicola & Mager, Ferdinand, 2023. "Pension funding and the cross section of stock returns - The case of Germany," Journal of Banking & Finance, Elsevier, vol. 150(C).
    18. Narayan, Paresh Kumar, 2019. "Can stale oil price news predict stock returns?," Energy Economics, Elsevier, vol. 83(C), pages 430-444.
    19. Patrick Houlihan & Germán G. Creamer, 2021. "Leveraging Social Media to Predict Continuation and Reversal in Asset Prices," Computational Economics, Springer;Society for Computational Economics, vol. 57(2), pages 433-453, February.
    20. Angelos J. Doukas & Jie (Michael) Guo & Herbert Y. T. Lam & Sarah (Hong) Xiao, 2016. "Media Endorsements of New Product Announcements: A New Marketing Strategy," European Financial Management, European Financial Management Association, vol. 22(3), pages 394-426, June.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1801.00588. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.