IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1809.00306.html
   My bibliography  Save this paper

Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data

Author

Listed:
  • Xi Zhang
  • Yixuan Li
  • Senzhang Wang
  • Binxing Fang
  • Philip S. Yu

Abstract

Traditional stock market prediction methods commonly only utilize the historical trading data, ignoring the fact that stock market fluctuations can be impacted by various other information sources such as stock related events. Although some recent works propose event-driven prediction approaches by considering the event data, how to leverage the joint impacts of multiple data sources still remains an open research problem. In this work, we study how to explore multiple data sources to improve the performance of the stock prediction. We introduce an Extended Coupled Hidden Markov Model incorporating the news events with the historical trading data. To address the data sparsity issue of news events for each single stock, we further study the fluctuation correlations between the stocks and incorporate the correlations into the model to facilitate the prediction task. Evaluations on China A-share market data in 2016 show the superior performance of our model against previous methods.

Suggested Citation

  • Xi Zhang & Yixuan Li & Senzhang Wang & Binxing Fang & Philip S. Yu, 2018. "Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data," Papers 1809.00306, arXiv.org.
  • Handle: RePEc:arx:papers:1809.00306
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1809.00306
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    2. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    3. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    4. Xi Zhang & Yunjia Zhang & Senzhang Wang & Yuntao Yao & Binxing Fang & Philip S. Yu, 2018. "Improving Stock Market Prediction via Heterogeneous Information Fusion," Papers 1801.00588, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thomas Dierckx & Jesse Davis & Wim Schoutens, 2020. "Using Machine Learning and Alternative Data to Predict Movements in Market Risk," Papers 2009.07947, arXiv.org.
    2. Ashish Kumar & Abeer Alsadoon & P. W. C. Prasad & Salma Abdullah & Tarik A. Rashid & Duong Thu Hang Pham & Tran Quoc Vinh Nguyen, 2021. "Generative Adversarial Network (GAN) and Enhanced Root Mean Square Error (ERMSE): Deep Learning for Stock Price Movement Prediction," Papers 2112.03946, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    2. Yang-Cheng Lu & Yu-Chen Wei, 2013. "The Chinese News Sentiment around Earnings Announcements," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(3), pages 44-58, October.
    3. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    4. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    5. Paul Brockman & Jim Cicon, 2013. "The Information Content Of Management Earnings Forecasts: An Analysis Of Hard Versus Soft Information," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 36(2), pages 147-174, June.
    6. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    7. Chong, Terence Tai Leung & Wu, Zhang & Liu, Yuchen, 2019. "Market Reaction to iPhone Rumors," MPRA Paper 92014, University Library of Munich, Germany.
    8. Eric. W. K. See-To & Yang Yang, 2017. "Market sentiment dispersion and its effects on stock return and volatility," Electronic Markets, Springer;IIM University of St. Gallen, vol. 27(3), pages 283-296, August.
    9. Steffen Nauhaus & Johannes Luger & Sebastian Raisch, 2021. "Strategic Decision Making in the Digital Age: Expert Sentiment and Corporate Capital Allocation," Journal of Management Studies, Wiley Blackwell, vol. 58(7), pages 1933-1961, November.
    10. Xin Li & Kun Chen & Sherry X. Sun & Terrance Fung & Huaiqing Wang & Daniel D. Zeng, 2016. "A Commonsense Knowledge-Enabled Textual Analysis Approach for Financial Market Surveillance," INFORMS Journal on Computing, INFORMS, vol. 28(2), pages 278-294, May.
    11. Jacob Boudoukh & Ronen Feldman & Shimon Kogan & Matthew Richardson, 2013. "Which News Moves Stock Prices? A Textual Analysis," NBER Working Papers 18725, National Bureau of Economic Research, Inc.
    12. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    13. Miwa, Kotaro, 2023. "Divergent opinions on social media," International Review of Economics & Finance, Elsevier, vol. 86(C), pages 182-196.
    14. Goodell, John W. & Kumar, Satish & Lim, Weng Marc & Pattnaik, Debidutta, 2021. "Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    15. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    16. Seshadri Tirunillai & Gerard J. Tellis, 2012. "Does Chatter Really Matter? Dynamics of User-Generated Content and Stock Performance," Marketing Science, INFORMS, vol. 31(2), pages 198-215, March.
    17. Mao, Huina & Counts, Scott & Bollen, Johan, 2015. "Quantifying the effects of online bullishness on international financial markets," Statistics Paper Series 09, European Central Bank.
    18. Qingbin Meng & Congyi Ju & Qinghua Huang & Song Wang, 2023. "The informativeness of investor communication with corporate insiders: Evidence from China," International Finance, Wiley Blackwell, vol. 26(2), pages 189-207, August.
    19. Loughran, Tim & McDonald, Bill & Pragidis, Ioannis, 2019. "Assimilation of oil news into prices," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 105-118.
    20. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1809.00306. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.