IDEAS home Printed from https://ideas.repec.org/p/iim/iimawp/14607.html
   My bibliography  Save this paper

Buy, Sell or Hold: Entity-Aware Classification of Business News

Author

Listed:
  • Sinha, Ankur
  • Kedas, Satishwar
  • Kumar, Rishu
  • Malo, Pekka

Abstract

Financial sector is expected to be at the forefront of the adoption of machine learning methods, driven by the superior performance of the data-driven approaches over traditional modelling approaches. There has been a widespread interest in automatically extracting information from financial news flow as the signals might be useful for investment decisions. While quantitative finance focuses on analysis of structured financial data for investment decisions, the potential of utilizing unstructured news flow in decision making is not fully tapped. Research in financial news analytics tries to address this gap by detecting events and aspects that provide buy, sell or hold information in news, commonly interpreted as financial sentiments. In this paper, we develop a framework utilizing information theoretic concepts and machine learning methods that understands the context and is capable of extracting buy, sell or hold information contained within news headlines. The proposed framework is also capable of detecting conflicting sentiments on multiple companies within the same news headline, which to our best knowledge has not been studied earlier. Further, we develop an information system which analyzes the news flow in real-time, allowing users to track financial sentiments by company, sector and index via a dashboard. Through this study we make three dataset related contributions - firstly, a training dataset consisting of more than 12,000 news headlines annotated for entities and their relevant financial sentiments by multiple annotators, secondly, an entity database of over 1,000 financial and economic entities relevant to Indian economy and their forms of appearance in news media amounting to over 5,000 phrases and thirdly, make improvements in existing financial dictionaries. Using the proposed system, we study the effect of the information derived from daily news flow in the years 2012 to 2017, over the Indian broad market equity index NSE 500, and show that the information has predictive value.

Suggested Citation

  • Sinha, Ankur & Kedas, Satishwar & Kumar, Rishu & Malo, Pekka, 2019. "Buy, Sell or Hold: Entity-Aware Classification of Business News," IIMA Working Papers WP 2019-04-02, Indian Institute of Management Ahmedabad, Research and Publication Department.
  • Handle: RePEc:iim:iimawp:14607
    as

    Download full text from publisher

    File URL: https://www.iima.ac.in/sites/default/files/rnpfiles/18844866432019-04-02.pdf
    File Function: English Version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Warner, Jerold B. & Watts, Ross L. & Wruck, Karen H., 1988. "Stock prices and top management changes," Journal of Financial Economics, Elsevier, vol. 20(1-2), pages 461-492, January.
    2. Plakandaras, Vasilios & Gupta, Rangan & Gogas, Periklis & Papadimitriou, Theophilos, 2015. "Forecasting the U.S. real house price index," Economic Modelling, Elsevier, vol. 45(C), pages 259-267.
    3. Chambers, Ae & Penman, Sh, 1984. "Timeliness Of Reporting And The Stock-Price Reaction To Earnings Announcements," Journal of Accounting Research, Wiley Blackwell, vol. 22(1), pages 21-47.
    4. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    5. Rangan Gupta & Anandamayee Majumdar, 2015. "Forecasting US real house price returns over 1831-2013: evidence from copula models," Applied Economics, Taylor & Francis Journals, vol. 47(48), pages 5204-5213, October.
    6. Gabriele Ranco & Darko Aleksovski & Guido Caldarelli & Miha Grčar & Igor Mozetič, 2015. "The Effects of Twitter Sentiment on Stock Price Returns," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-21, September.
    7. Pekka Malo & Ankur Sinha & Pekka Korhonen & Jyrki Wallenius & Pyry Takala, 2014. "Good debt or bad debt: Detecting semantic orientations in economic texts," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 782-796, April.
    8. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    9. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    10. JaeHong Park & Prabhudev Konana & Bin Gu & Alok Kumar & Rajagopal Raghunathan, 2013. "Information Valuation and Confirmation Bias in Virtual Communities: Evidence from Stock Message Boards," Information Systems Research, INFORMS, vol. 24(4), pages 1050-1067, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ankur Sinha & Satishwar Kedas & Rishu Kumar & Pekka Malo, 2022. "SEntFiN 1.0: Entity‐aware sentiment analysis for financial news," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(9), pages 1314-1335, September.
    2. Femg, Xunan & Johansson, Anders C., 2019. "News or Noise? The Information Content of Social Media in China," Stockholm School of Economics Asia Working Paper Series 2019-52, Stockholm School of Economics, Stockholm China Economic Research Institute.
    3. Chen, Cathy Yi-Hsuan & Fengler, Matthias R. & Härdle, Wolfgang Karl & Liu, Yanchu, 2022. "Media-expressed tone, option characteristics, and stock return predictability," Journal of Economic Dynamics and Control, Elsevier, vol. 134(C).
    4. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    5. Marlene Amstad & Leonardo Gambacorta & Chao He & Dora Xia, 2021. "Trade sentiment and the stock market: new evidence based on big data textual analysis of Chinese media," BIS Working Papers 917, Bank for International Settlements.
    6. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    7. Arcuri, Maria Cristina & Gandolfi, Gino & Russo, Ivan, 2023. "Does fake news impact stock returns? Evidence from US and EU stock markets," Journal of Economics and Business, Elsevier, vol. 125.
    8. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    9. Costola, Michele & Hinz, Oliver & Nofer, Michael & Pelizzon, Loriana, 2023. "Machine learning sentiment analysis, COVID-19 news and stock market reactions," Research in International Business and Finance, Elsevier, vol. 64(C).
    10. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    11. Fiordelisi, Franco & Ricci, Ornella, 2014. "Corporate culture and CEO turnover," Journal of Corporate Finance, Elsevier, vol. 28(C), pages 66-82.
    12. Jiang, Zhe & Zhang, Lin & Zhang, Lingling & Wen, Bo, 2022. "Investor sentiment and machine learning: Predicting the price of China's crude oil futures market," Energy, Elsevier, vol. 247(C).
    13. Szymon Lis, 2022. "Investor Sentiment in Asset Pricing Models: A Review," Working Papers 2022-14, Faculty of Economic Sciences, University of Warsaw.
    14. Li, Xiao, 2020. "When financial literacy meets textual analysis: A conceptual review," Journal of Behavioral and Experimental Finance, Elsevier, vol. 28(C).
    15. Yi-Hsuan Chen, Cathy & Fengler, Matthias & Härdle, Wolfgang Karl & Liu, Yanchu, 2018. "Textual Sentiment, Option Characteristics, and Stock Return Predictability," Economics Working Paper Series 1808, University of St. Gallen, School of Economics and Political Science.
    16. Jozef Barunik & Cathy Yi-Hsuan Chen & Jan Vecer, 2019. "Sentiment-Driven Stochastic Volatility Model: A High-Frequency Textual Tool for Economists," Papers 1906.00059, arXiv.org.
    17. Jie Ren & Hang Dong & Balaji Padmanabhan & Jeffrey V. Nickerson, 2021. "How does social media sentiment impact mass media sentiment? A study of news in the financial markets," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(9), pages 1183-1197, September.
    18. Enwei Zhu & Jing Wu & Hongyu Liu & Keyang Li, 2023. "A Sentiment Index of the Housing Market in China: Text Mining of Narratives on Social Media," The Journal of Real Estate Finance and Economics, Springer, vol. 66(1), pages 77-118, January.
    19. Arezoo Hatefi Ghahfarrokhi & Mehrnoush Shamsfard, 2019. "Tehran Stock Exchange Prediction Using Sentiment Analysis of Online Textual Opinions," Papers 1909.03792, arXiv.org, revised Sep 2019.
    20. Rui Fan & Oleksandr Talavera & Vu Tran, 2020. "Social media bots and stock markets," European Financial Management, European Financial Management Association, vol. 26(3), pages 753-777, June.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iim:iimawp:14607. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/eciimin.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.