IDEAS home Printed from https://ideas.repec.org/a/eee/ininma/v34y2014i2p272-284.html
   My bibliography  Save this article

BizPro: Extracting and categorizing business intelligence factors from textual news articles

Author

Listed:
  • Chung, Wingyan

Abstract

Company movements and market changes often are headlines of the news, providing managers with important business intelligence (BI). While existing corporate analyses are often based on numerical financial figures, relatively little work has been done to reveal from textual news articles factors that represent BI. In this research, we developed BizPro, an intelligent system for extracting and categorizing BI factors from news articles. BizPro consists of novel text mining procedures and BI factor modeling and categorization. Expert guidance and human knowledge (with high inter-rater reliability) were used to inform system development and profiling of BI factors. We conducted a case study of using the system to profile BI factors of four major IT companies based on 6859 sentences extracted from 231 news articles published in major news sources. The results show that the chosen techniques used in BizPro – Naïve Bayes (NB) and Logistic Regression (LR) – significantly outperformed a benchmark technique. NB was found to outperform LR in terms of precision, recall, F-measure, and area under ROC curve. This research contributes to developing a new system for profiling company BI factors from news articles, to providing new empirical findings to enhance understanding in BI factor extraction and categorization, and to addressing an important yet under-explored concern of BI analysis.

Suggested Citation

  • Chung, Wingyan, 2014. "BizPro: Extracting and categorizing business intelligence factors from textual news articles," International Journal of Information Management, Elsevier, vol. 34(2), pages 272-284.
  • Handle: RePEc:eee:ininma:v:34:y:2014:i:2:p:272-284
    DOI: 10.1016/j.ijinfomgt.2014.01.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0268401214000024
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijinfomgt.2014.01.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chung, Wingyan, 2012. "Managing web repositories in emerging economies: Case studies of browsing web directories," International Journal of Information Management, Elsevier, vol. 32(3), pages 232-238.
    2. Abraham, Santhosh & Cox, Paul, 2007. "Analysing the determinants of narrative risk information in UK FTSE 100 annual reports," The British Accounting Review, Elsevier, vol. 39(3), pages 227-248.
    3. Li, Feng, 2008. "Annual report readability, current earnings, and earnings persistence," Journal of Accounting and Economics, Elsevier, vol. 45(2-3), pages 221-247, August.
    4. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    5. Linsley, Philip M. & Shrives, Philip J., 2006. "Risk reporting: A study of risk disclosures in the annual reports of UK companies," The British Accounting Review, Elsevier, vol. 38(4), pages 387-404.
    6. G. Salton & C. S. Yang & C. T. Yu, 1975. "A theory of term importance in automatic text analysis," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 26(1), pages 33-44, January.
    7. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    8. Antonina Kloptchenko & Tomas Eklund & Jonas Karlsson & Barbro Back & Hannu Vanharanta & Ari Visa, 2004. "Combining data and text mining techniques for analysing financial reports," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 12(1), pages 29-41, January.
    9. S. le Cessie & J. C. van Houwelingen, 1992. "Ridge Estimators in Logistic Regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 41(1), pages 191-201, March.
    10. Balakrishnan, Ramji & Qiu, Xin Ying & Srinivasan, Padmini, 2010. "On the predictive ability of narrative disclosures in annual reports," European Journal of Operational Research, Elsevier, vol. 202(3), pages 789-801, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ransome Epie Bawack & Samuel Fosso Wamba & Kevin Daniel André Carillo & Shahriar Akter, 2022. "Artificial intelligence in E-Commerce: a bibliometric study and literature review," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(1), pages 297-338, March.
    2. Tanzeela AQIF & Abdul WAHAB, 2022. "Reshaping The Future Of Retail Marketing Through Big Data: A Review From 2009 To 2022," Management Research and Practice, Research Centre in Public Administration and Public Services, Bucharest, Romania, vol. 14(3), pages 5-24, September.
    3. Agarwal, Shweta & Kumar, Shailendra & Goel, Utkarsh, 2019. "Stock market response to information diffusion through internet sources: A literature review," International Journal of Information Management, Elsevier, vol. 45(C), pages 118-131.
    4. Gandomi, Amir & Haider, Murtaza, 2015. "Beyond the hype: Big data concepts, methods, and analytics," International Journal of Information Management, Elsevier, vol. 35(2), pages 137-144.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    2. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    3. Nadine Gatzert & Dinah Heidinger, 2020. "An Empirical Analysis of Market Reactions to the First Solvency and Financial Condition Reports in the European Insurance Industry," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 87(2), pages 407-436, June.
    4. Jia, Jing & Li, Zhongtian, 2022. "Risk management committees and readability of risk management disclosure," Journal of Contemporary Accounting and Economics, Elsevier, vol. 18(3).
    5. KIM, Hyonok & YASUDA, Yukihiro & 安田, 行宏, 2016. "A new approach to identify the economic effects of disclosure: Information content of business risk disclosures in Japanese firms," Working Paper Series G-1-13, Hitotsubashi University Center for Financial Research.
    6. Sagarika Mishra & Michael T. Ewing & Holly B. Cooper, 2022. "Artificial intelligence focus and firm performance," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1176-1197, November.
    7. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    8. Paul Brockman & Jim Cicon, 2013. "The Information Content Of Management Earnings Forecasts: An Analysis Of Hard Versus Soft Information," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 36(2), pages 147-174, June.
    9. Kumar, Rahul & Deb, Soumya Guha & Mukherjee, Shubhadeep, 2020. "Do words reveal the latent truth? Identifying communication patterns of corporate losers," Journal of Behavioral and Experimental Finance, Elsevier, vol. 26(C).
    10. An, Suwei, 2023. "Essays on incentive contracts, M&As, and firm risk," Other publications TiSEM dd97d2f5-1c9d-47c5-ba62-f, Tilburg University, School of Economics and Management.
    11. Renato Camodeca & Alex Almici & Umberto Sagliaschi, 2018. "Sustainability Disclosure in Integrated Reporting: Does It Matter to Investors? A Cheap Talk Approach," Sustainability, MDPI, vol. 10(12), pages 1-34, November.
    12. Tsai, Feng-Tse & Lu, Hsin-Min & Hung, Mao-Wei, 2016. "The impact of news articles and corporate disclosure on credit risk valuation," Journal of Banking & Finance, Elsevier, vol. 68(C), pages 100-116.
    13. Tim Loughran & Bill McDonald, 2014. "Regulation and financial disclosure: The impact of plain English," Journal of Regulatory Economics, Springer, vol. 45(1), pages 94-113, February.
    14. Cookson, J. Anthony & Moon, S. Katie & Noh, Joonki, 2020. "Imprecise and Informative: Lessons from Market Reactions to Imprecise Disclosure," SocArXiv akt2c, Center for Open Science.
    15. Qingbin Meng & Congyi Ju & Qinghua Huang & Song Wang, 2023. "The informativeness of investor communication with corporate insiders: Evidence from China," International Finance, Wiley Blackwell, vol. 26(2), pages 189-207, August.
    16. Anand, Abhinav & Basu, Sankarshan & Pathak, Jalaj & Thampy, Ashok, 2021. "The impact of sentiment on emerging stock markets," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 161-177.
    17. Yan, Shan, 2015. "Managerial attitudes and takeover outcomes: Evidence from corporate filings," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 35(C), pages 30-44.
    18. Fiordelisi, Franco & Ricci, Ornella, 2014. "Corporate culture and CEO turnover," Journal of Corporate Finance, Elsevier, vol. 28(C), pages 66-82.
    19. Price, S. McKay & Doran, James S. & Peterson, David R. & Bliss, Barbara A., 2012. "Earnings conference calls and stock returns: The incremental informativeness of textual tone," Journal of Banking & Finance, Elsevier, vol. 36(4), pages 992-1011.
    20. Joon Mahn Lee & Byoung‐Hyoun Hwang & Hailiang Chen, 2017. "Are founder CEOs more overconfident than professional CEOs? Evidence from S&P 1500 companies," Strategic Management Journal, Wiley Blackwell, vol. 38(3), pages 751-769, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ininma:v:34:y:2014:i:2:p:272-284. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/international-journal-of-information-management .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.