IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v39y2020i1p104-116.html
   My bibliography  Save this article

Using social media mining technology to improve stock price forecast accuracy

Author

Listed:
  • Jia‐Yen Huang
  • Jin‐Hao Liu

Abstract

Many stock investors make investment decisions based on stock‐price‐related chip indicators. However, in addition to quantified data, financial news often has a nonnegligible impact on stock price. Nowadays, as new reviews are posted daily on social media, there may be value in using web opinions to improve the performance of stock price prediction. To this end, we use logistic regression to screen the chip indicators and establish a basic stock price prediction model. Then, we employ text mining technology to quantify the unstructured data of social media opinions on stock‐related news into sentiment scores, which are found to correlate significantly with the change extent of the stock price. Based on the findings that the higher the sentiment scores, the lower the prediction accuracy of the logistic regression model, we propose an improved prediction approach that integrates sentiment scores into the logistic regression model. Our results show that the proposed model can improve the prediction accuracy for stock prices, and can thus provide a new reference for investment strategies for stock investors.

Suggested Citation

  • Jia‐Yen Huang & Jin‐Hao Liu, 2020. "Using social media mining technology to improve stock price forecast accuracy," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(1), pages 104-116, January.
  • Handle: RePEc:wly:jforec:v:39:y:2020:i:1:p:104-116
    DOI: 10.1002/for.2616
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.2616
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.2616?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jia-Yen Huang, 2014. "Innovative replenishment management for perishable items using logistic regression and grey analysis," International Journal of Business Performance Management, Inderscience Enterprises Ltd, vol. 15(2), pages 138-157.
    2. Liam A. Gallagher & Mark P. Taylor, 2002. "Permanent and Temporary Components of Stock Prices: Evidence from Assessing Macroeconomic Shocks," Southern Economic Journal, John Wiley & Sons, vol. 69(2), pages 345-362, October.
    3. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    4. Steve Y. Yang & Sheung Yin Kevin Mo & Anqi Liu, 2015. "Twitter financial community sentiment and its predictive relationship to stock market movement," Quantitative Finance, Taylor & Francis Journals, vol. 15(10), pages 1637-1656, October.
    5. G. Rubbaniy & Robel Asmerom & Syed Kumail Abbas Rizvi & Bushra Naqvi, 2014. "Do fear indices help predict stock returns?," Quantitative Finance, Taylor & Francis Journals, vol. 14(5), pages 831-847, May.
    6. Rechenthin, Michael & Street, W. Nick & Srinivasan, Padmini, 2013. "Stock chatter: Using stock sentiment to predict price direction," Algorithmic Finance, IOS Press, vol. 2(3-4), pages 169-196.
    7. Kanas, Angelos & Yannopoulos, Andreas, 2001. "Comparing linear and nonlinear forecasts for stock returns," International Review of Economics & Finance, Elsevier, vol. 10(4), pages 383-398, December.
    8. Zhang, Guoqiang & Eddy Patuwo, B. & Y. Hu, Michael, 1998. "Forecasting with artificial neural networks:: The state of the art," International Journal of Forecasting, Elsevier, vol. 14(1), pages 35-62, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Erdinc Akyildirim & Oguzhan Cepni & Shaen Corbet & Gazi Salah Uddin, 2023. "Forecasting mid-price movement of Bitcoin futures using machine learning," Annals of Operations Research, Springer, vol. 330(1), pages 553-584, November.
    2. Weiguo Zhang & Xue Gong & Chao Wang & Xin Ye, 2021. "Predicting stock market volatility based on textual sentiment: A nonlinear analysis," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(8), pages 1479-1500, December.
    3. Steven Buigut and Burcu Kapar, 2022. "Do COVID-19 Incidence and Government Intervention Influence Media Indices?," Bulletin of Applied Economics, Risk Market Journals, vol. 9(2), pages 79-100.
    4. Chorong Youn & Hye Jung Jung, 2021. "Semantic Network Analysis to Explore the Concept of Sustainability in the Apparel and Textile Industry," Sustainability, MDPI, vol. 13(7), pages 1-17, March.
    5. M. Eren Akbiyik & Mert Erkul & Killian Kaempf & Vaiva Vasiliauskaite & Nino Antulov-Fantulin, 2021. "Ask "Who", Not "What": Bitcoin Volatility Forecasting with Twitter Data," Papers 2110.14317, arXiv.org, revised Dec 2022.
    6. Farnoush Ronaghi & Mohammad Salimibeni & Farnoosh Naderkhani & Arash Mohammadi, 2021. "COVID19-HPSMP: COVID-19 Adopted Hybrid and Parallel Deep Information Fusion Framework for Stock Price Movement Prediction," Papers 2101.02287, arXiv.org, revised Jul 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Xinjie & Xiang, Zhiqiang & Xu, Weike & Yuan, Peixuan, 2022. "The causal relationship between social media sentiment and stock return: Experimental evidence from an online message forum," Economics Letters, Elsevier, vol. 216(C).
    2. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    3. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    4. Balkin, Sandy, 2001. "On Forecasting Exchange Rates Using Neural Networks: P.H. Franses and P.V. Homelen, 1998, Applied Financial Economics, 8, 589-596," International Journal of Forecasting, Elsevier, vol. 17(1), pages 139-140.
    5. Papapostolou, Nikos C. & Pouliasis, Panos K. & Nomikos, Nikos K. & Kyriakou, Ioannis, 2016. "Shipping investor sentiment and international stock return predictability," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 96(C), pages 81-94.
    6. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    7. Daniel Buncic, 2012. "Understanding forecast failure of ESTAR models of real exchange rates," Empirical Economics, Springer, vol. 43(1), pages 399-426, August.
    8. Tsukioka, Yasutomo & Yanagi, Junya & Takada, Teruko, 2018. "Investor sentiment extracted from internet stock message boards and IPO puzzles," International Review of Economics & Finance, Elsevier, vol. 56(C), pages 205-217.
    9. Jiao Ji & Oleksandr Talavera & Shuxing Yin, 2018. "The Hidden Information Content: Evidence from the Tone of Independent Director Reports," Working Papers 2018-28, Swansea University, School of Management.
    10. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    11. Christopher N. Avery & Judith A. Chevalier & Richard J. Zeckhauser, 2016. "The "CAPS" Prediction System and Stock Market Returns," Review of Finance, European Finance Association, vol. 20(4), pages 1363-1381.
    12. Ebrahimpour, Reza & Nikoo, Hossein & Masoudnia, Saeed & Yousefi, Mohammad Reza & Ghaemi, Mohammad Sajjad, 2011. "Mixture of MLP-experts for trend forecasting of time series: A case study of the Tehran stock exchange," International Journal of Forecasting, Elsevier, vol. 27(3), pages 804-816, July.
    13. Hewamalage, Hansika & Bergmeir, Christoph & Bandara, Kasun, 2021. "Recurrent Neural Networks for Time Series Forecasting: Current status and future directions," International Journal of Forecasting, Elsevier, vol. 37(1), pages 388-427.
    14. Qingshuo Song & Qing Zhang, 2013. "An Optimal Pairs-Trading Rule," Papers 1302.6120, arXiv.org.
    15. Chen, Cathy Yi-Hsuan & Fengler, Matthias R. & Härdle, Wolfgang Karl & Liu, Yanchu, 2022. "Media-expressed tone, option characteristics, and stock return predictability," Journal of Economic Dynamics and Control, Elsevier, vol. 134(C).
    16. Leung, Philip C.M. & Lee, Eric W.M., 2013. "Estimation of electrical power consumption in subway station design by intelligent approach," Applied Energy, Elsevier, vol. 101(C), pages 634-643.
    17. Yang-Cheng Lu & Yu-Chen Wei, 2013. "The Chinese News Sentiment around Earnings Announcements," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(3), pages 44-58, October.
    18. Donya Rahmani & Saeed Heravi & Hossein Hassani & Mansi Ghodsi, 2016. "Forecasting time series with structural breaks with Singular Spectrum Analysis, using a general form of recurrent formula," Papers 1605.02188, arXiv.org.
    19. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    20. Wei Sun & Yujun He & Hong Chang, 2015. "Forecasting Fossil Fuel Energy Consumption for Power Generation Using QHSA-Based LSSVM Model," Energies, MDPI, vol. 8(2), pages 1-21, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:39:y:2020:i:1:p:104-116. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.