IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-03511405.html
   My bibliography  Save this paper

Big data, news diversity and financial market crash

Author

Listed:
  • Sabri Boubaker

    (Métis Lab EM Normandie - EM Normandie - École de Management de Normandie, VNU - Vietnam National University [Hanoï])

  • Zhenya Liu

    (CERGAM - Centre d'Études et de Recherche en Gestion d'Aix-Marseille - AMU - Aix Marseille Université - UTLN - Université de Toulon, Renmin University of China = Université Renmin de Chine)

  • Ling Zhai

    (Renmin University of China = Université Renmin de Chine)

Abstract

A vast quantity of high-dimensional, unstructured textual news data is produced every day, more than two decades after the launch of the global Internet. These big data have a significant influence on the way that decisions are made in business and finance, due to the cost, scalability, and transparency benefits that they bring. However, limited studies have fully exploited big data to analyze changes in news diversity or to predict financial market movements, specifically stock market crashes. Based on modern methods of textual analysis, this paper investigates the relationship between news diversity and financial market crashes by applying the change-point detection approach. The empirical analysis shows that (1) big data is a relatively new and useful tool for assessing financial market movements, (2) there is a relationship between news diversity and financial market movements. News diversity tends to decline when the market falls and volatility soars, and increases when the market is on an upward trend and in recovery, and (3) the multiple structural breaks detected improve the ability to forecast stock price movements. Therefore, changes to news diversity, embedded in big data, can be a useful indicator of financial market crashes and recoveries.

Suggested Citation

  • Sabri Boubaker & Zhenya Liu & Ling Zhai, 2021. "Big data, news diversity and financial market crash," Post-Print hal-03511405, HAL.
  • Handle: RePEc:hal:journl:hal-03511405
    DOI: 10.1016/j.techfore.2021.120755
    Note: View the original document on HAL open archive server: https://hal.science/hal-03511405
    as

    Download full text from publisher

    File URL: https://hal.science/hal-03511405/document
    Download Restriction: no

    File URL: https://libkey.io/10.1016/j.techfore.2021.120755?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Alexander Aue & Lajos Horváth, 2013. "Structural breaks in time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 34(1), pages 1-16, January.
    2. Haroon, Omair & Rizvi, Syed Aun R., 2020. "COVID-19: Media coverage and financial markets behavior—A sectoral inquiry," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    3. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    4. Kenneth R. French & Martin N. Baily & John Y. Campbell & John H. Cochrane & Douglas W. Diamond & Darrell Duffie & Anil K Kashyap & Frederic S. Mishkin & Raghuram G. Rajan & David S. Scharfstein & Robe, 2010. "The Squam Lake Report: Fixing the Financial System," Economics Books, Princeton University Press, edition 1, number 9261.
    5. Birz, Gene & Lott Jr., John R., 2011. "The effect of macroeconomic news on stock returns: New evidence from newspaper coverage," Journal of Banking & Finance, Elsevier, vol. 35(11), pages 2791-2800, November.
    6. Ryohei Hisano & Didier Sornette & Takayuki Mizuno & Takaaki Ohnishi & Tsutomu Watanabe, 2013. "High Quality Topic Extraction from Business News Explains Abnormal Financial Market Volatility," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-12, June.
    7. Jegadeesh, Narasimhan & Wu, Di, 2013. "Word power: A new approach for content analysis," Journal of Financial Economics, Elsevier, vol. 110(3), pages 712-729.
    8. Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
    9. John M. Griffin & Nicholas H. Hirschey & Patrick J. Kelly, 2011. "How Important Is the Financial Media in Global Markets?," The Review of Financial Studies, Society for Financial Studies, vol. 24(12), pages 3941-3992.
    10. Leif Anders Thorsrud, 2020. "Words are the New Numbers: A Newsy Coincident Index of the Business Cycle," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 393-409, April.
    11. Kayser, Victoria & Blind, Knut, 2017. "Extending the knowledge base of foresight: The contribution of text mining," Technological Forecasting and Social Change, Elsevier, vol. 116(C), pages 208-215.
    12. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    13. Lily Fang & Joel Peress, 2009. "Media Coverage and the Cross‐section of Stock Returns," Journal of Finance, American Finance Association, vol. 64(5), pages 2023-2052, October.
    14. Granger, C W J, 1969. "Investigating Causal Relations by Econometric Models and Cross-Spectral Methods," Econometrica, Econometric Society, vol. 37(3), pages 424-438, July.
    15. Ryohei Hisano & Didier Sornette & Takayuki Mizuno & Takaaki Ohnishi & Tsutomu Watanabe, 2012. "High quality topic extraction from business news explains abnormal financial market volatility," Papers 1210.6321, arXiv.org, revised Mar 2013.
    16. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    17. Joseph E. Engelberg & Christopher A. Parsons, 2011. "The Causal Impact of Media in Financial Markets," Journal of Finance, American Finance Association, vol. 66(1), pages 67-97, February.
    18. Zhang, Yongjie & Zhang, Zuochao & Liu, Lanbiao & Shen, Dehua, 2017. "The interaction of financial news between mass media and new media: Evidence from news on Chinese stock market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 486(C), pages 535-541.
    19. Joel Peress & Daniel Schmidt, 2020. "Glued to the TV: Distracted Noise Traders and Stock Market Liquidity," Journal of Finance, American Finance Association, vol. 75(2), pages 1083-1133, April.
    20. Lajos Horváth & William Pouliot & Shixuan Wang, 2017. "Detecting at-Most-m Changes in Linear Regression Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(4), pages 552-590, July.
    21. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    22. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    23. Liao, Rose & Wang, Xinjie & Wu, Ge, 2021. "The role of media in mergers and acquisitions," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 74(C).
    24. Zhang, Yi & Zhang, Guangquan & Chen, Hongshu & Porter, Alan L. & Zhu, Donghua & Lu, Jie, 2016. "Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research," Technological Forecasting and Social Change, Elsevier, vol. 105(C), pages 179-191.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Boubaker, Sabri & Liu, Zhenya & Sui, Tianqing & Zhai, Ling, 2022. "The mirror of history: How to statistically identify stock market bubble bursts," Journal of Economic Behavior & Organization, Elsevier, vol. 204(C), pages 128-147.
    2. Shi, Tao & Li, Chongyang & Wanyan, Hong & Xu, Ying & Zhang, Wei, 2022. "The lending risk predicting of the folk informal financial organization from big data using the deep learning hybrid model," Finance Research Letters, Elsevier, vol. 50(C).
    3. Kanzari, Dalel & Nakhli, Mohamed Sahbi & Gaies, Brahim & Sahut, Jean-Michel, 2023. "Predicting macro-financial instability – How relevant is sentiment? Evidence from long short-term memory networks," Research in International Business and Finance, Elsevier, vol. 65(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zuochao Zhang & Yongjie Zhang & Dehua Shen & Wei Zhang, 2018. "The Dynamic Cross-Correlations between Mass Media News, New Media News, and Stock Returns," Complexity, Hindawi, vol. 2018, pages 1-11, February.
    2. Yen-Ju Hsu & Yang-Cheng Lu & J. Jimmy Yang, 2021. "News sentiment and stock market volatility," Review of Quantitative Finance and Accounting, Springer, vol. 57(3), pages 1093-1122, October.
    3. Ammann, Manuel & Frey, Roman & Verhofen, Michael, 2012. "Do Newspaper Articles Predict Aggregate Stock Returns?," Working Papers on Finance 1204, University of St. Gallen, School of Finance.
    4. Buehlmaier, Matthias M. M. & Zechner, Josef, 2016. "Financial media, price discovery, and merger arbitrage," CFS Working Paper Series 551, Center for Financial Studies (CFS).
    5. Caporale, Guglielmo Maria & Spagnolo, Fabio & Spagnolo, Nicola, 2016. "Macro news and stock returns in the Euro area: A VAR-GARCH-in-mean analysis," International Review of Financial Analysis, Elsevier, vol. 45(C), pages 180-188.
    6. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    7. Guglielmo Maria Caporale & Fabio Spagnolo & Nicola Spagnolo, 2017. "Macro News and Commodity Returns," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 22(1), pages 68-80, January.
    8. Donald B. Keim & Massimo Massa & Bastian von Beschwitz, 2018. "First to \"Read\" the News: New Analytics and Algorithmic Trading," International Finance Discussion Papers 1233, Board of Governors of the Federal Reserve System (U.S.).
    9. Atri, Hanen & Kouki, Saoussen & Gallali, Mohamed imen, 2021. "The impact of COVID-19 news, panic and media coverage on the oil and gold prices: An ARDL approach," Resources Policy, Elsevier, vol. 72(C).
    10. Massa, Massimo & von Beschwitz, Bastian & Keim, Donald B, 2015. "First to ?Read? the News: News Analytics and Institutional Trading," CEPR Discussion Papers 10534, C.E.P.R. Discussion Papers.
    11. Feipeng Zhang & Yun Hong & Yanhui Jiang & Jiayi Yu, 2022. "Impact of national media reporting concerning COVID-19 on stock market in China: empirical evidence from a quantile regression," Applied Economics, Taylor & Francis Journals, vol. 54(33), pages 3861-3881, July.
    12. Prajwal Eachempati & Praveen Ranjan Srivastava, 2021. "Accounting for unadjusted news sentiment for asset pricing," Qualitative Research in Financial Markets, Emerald Group Publishing Limited, vol. 13(3), pages 383-422, May.
    13. Wei, Yu-Chen & Lu, Yang-Cheng & Chen, Jen-Nan & Hsu, Yen-Ju, 2017. "Informativeness of the market news sentiment in the Taiwan stock market," The North American Journal of Economics and Finance, Elsevier, vol. 39(C), pages 158-181.
    14. Call, Andrew C. & Emett, Scott A. & Maksymov, Eldar & Sharp, Nathan Y., 2022. "Meet the press: Survey evidence on financial journalists as information intermediaries," Journal of Accounting and Economics, Elsevier, vol. 73(2).
    15. Chouliaras, Andreas, 2016. "The Effect of Infomation on Financial Markets: A Survey," MPRA Paper 71396, University Library of Munich, Germany.
    16. Rambaccussing, Dooruj & Kwiatkowski, Andrzej, 2020. "Forecasting with news sentiment: Evidence with UK newspapers," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1501-1516.
    17. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    18. Roberto Casarin & Flaminio Squazzoni, 2013. "Being on the Field When the Game Is Still Under Way. The Financial Press and Stock Markets in Times of Crisis," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-14, July.
    19. Caporale, Guglielmo Maria & Menla Ali, Faek & Spagnolo, Fabio & Spagnolo, Nicola, 2022. "Cross-border portfolio flows and news media coverage," Journal of International Money and Finance, Elsevier, vol. 126(C).
    20. Takayuki Mizuno & Takaaki Ohnishi & Tsutomu Watanabe, 2015. "Novel and topical business news and their impact on stock market activities," CARF F-Series CARF-F-366, Center for Advanced Research in Finance, Faculty of Economics, The University of Tokyo.

    More about this item

    Keywords

    Big data; News diversity; Textual analysis; Change-point; Financial crisis;
    All these keywords.

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • G01 - Financial Economics - - General - - - Financial Crises
    • G14 - Financial Economics - - General Financial Markets - - - Information and Market Efficiency; Event Studies; Insider Trading
    • O16 - Economic Development, Innovation, Technological Change, and Growth - - Economic Development - - - Financial Markets; Saving and Capital Investment; Corporate Finance and Governance

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-03511405. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.