IDEAS home Printed from https://ideas.repec.org/a/eee/finlet/v34y2020ics1544612319304064.html
   My bibliography  Save this article

The forecasting power of the multi-language narrative of sell-side research: A machine learning evaluation

Author

Listed:
  • Rybinski, Krzysztof

Abstract

This is probably the first ever analysis of sell-side daily economic research to use Natural Language Processing, and it shows that the narrative of such reports can be used to predict economic time series. The NLP indexes are based on Polish and English language reports released at the same time and exhibit predictive power for different sets of economic variables. VAR models with the NLP indexes generate smaller forecast errors than ARIMA. The wordscores scaling model uses Monetary Policy Council statements to generate scores and allows NLP indexes to be created with better forecasting power than the sentiment-based ones.

Suggested Citation

  • Rybinski, Krzysztof, 2020. "The forecasting power of the multi-language narrative of sell-side research: A machine learning evaluation," Finance Research Letters, Elsevier, vol. 34(C).
  • Handle: RePEc:eee:finlet:v:34:y:2020:i:c:s1544612319304064
    DOI: 10.1016/j.frl.2019.08.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1544612319304064
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.frl.2019.08.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Grimmer, Justin & Stewart, Brandon M., 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts," Political Analysis, Cambridge University Press, vol. 21(3), pages 267-297, July.
    2. David-Jan Jansen & Jakob de Haan, 2010. "An Assessment of the Consistency of ECB Communication using Wordscores," DNB Working Papers 259, Netherlands Central Bank, Research Department.
    3. Ingrid E. Fisher & Margaret R. Garnsey & Mark E. Hughes, 2016. "Natural Language Processing in Accounting, Auditing and Finance: A Synthesis of the Literature with a Roadmap for Future Research," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 157-214, July.
    4. Mirjana Pejić Bach & Živko Krstić & Sanja Seljan & Lejla Turulja, 2019. "Text Mining for Big Data Analysis in Financial Sector: A Literature Review," Sustainability, MDPI, vol. 11(5), pages 1-27, February.
    5. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    6. Yuan Luo & William K. Thompson & Timothy M. Herr & Zexian Zeng & Mark A. Berendsen & Siddhartha R. Jonnalagadda & Matthew B. Carson & Justin Starren, 2017. "Natural Language Processing for EHR-Based Pharmacovigilance: A Structured Review," Drug Safety, Springer, vol. 40(11), pages 1075-1089, November.
    7. Laver, Michael & Benoit, Kenneth & Garry, John, 2003. "Extracting Policy Positions from Political Texts Using Words as Data," American Political Science Review, Cambridge University Press, vol. 97(2), pages 311-331, May.
    8. Granger, C W J, 1969. "Investigating Causal Relations by Econometric Models and Cross-Spectral Methods," Econometrica, Econometric Society, vol. 37(3), pages 424-438, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rybinski, Krzysztof, 2021. "Ranking professional forecasters by the predictive power of their narratives," International Journal of Forecasting, Elsevier, vol. 37(1), pages 186-204.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rybinski, Krzysztof, 2021. "Ranking professional forecasters by the predictive power of their narratives," International Journal of Forecasting, Elsevier, vol. 37(1), pages 186-204.
    2. Senave, Elseline & Jans, Mieke J. & Srivastava, Rajendra P., 2023. "The application of text mining in accounting," International Journal of Accounting Information Systems, Elsevier, vol. 50(C).
    3. Berkin, Anil & Aerts, Walter & Van Caneghem, Tom, 2023. "Feasibility analysis of machine learning for performance-related attributional statements," International Journal of Accounting Information Systems, Elsevier, vol. 48(C).
    4. Diaf, Sami & Döpke, Jörg & Fritsche, Ulrich & Rockenbach, Ida, 2022. "Sharks and minnows in a shoal of words: Measuring latent ideological positions based on text mining techniques," European Journal of Political Economy, Elsevier, vol. 75(C).
    5. Aaryan Gupta & Vinya Dengre & Hamza Abubakar Kheruwala & Manan Shah, 2020. "Comprehensive review of text-mining applications in finance," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 6(1), pages 1-25, December.
    6. Pastwa, Anna M. & Shrestha, Prabal & Thewissen, James & Torsin, Wouter, 2021. "Unpacking the black box of ICO white papers: a topic modeling approach," LIDAM Discussion Papers LFIN 2021018, Université catholique de Louvain, Louvain Finance (LFIN).
    7. Shuangyan Li & Guangrui Wang & Yongli Luo, 2022. "Tone of language, financial disclosure, and earnings management: a textual analysis of form 20-F," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-24, December.
    8. Pierre-Marc Daigneault & Dominic Duval & Louis M. Imbeau, 2018. "Supervised scaling of semi-structured interview transcripts to characterize the ideology of a social policy reform," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(5), pages 2151-2162, September.
    9. Murat Aydogdu & Hakan Saraoglu & David Louton, 2019. "Using long short‐term memory neural networks to analyze SEC 13D filings: A recipe for human and machine interaction," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 26(4), pages 153-163, October.
    10. Hayo, Bernd & Henseler, Kai & Steffen Rapp, Marc & Zahner, Johannes, 2022. "Complexity of ECB communication and financial market trading," Journal of International Money and Finance, Elsevier, vol. 128(C).
    11. Ulrich Fritsche & Johannes Puckelwald, 2018. "Deciphering Professional Forecasters’ Stories - Analyzing a Corpus of Textual Predictions for the German Economy," Macroeconomics and Finance Series 201804, University of Hamburg, Department of Socioeconomics.
    12. Seraphine F. Maerz & Carsten Q. Schneider, 2020. "Comparing public communication in democracies and autocracies: automated text analyses of speeches by heads of government," Quality & Quantity: International Journal of Methodology, Springer, vol. 54(2), pages 517-545, April.
    13. Hayo, Bernd & Henseler, Kai & Rapp, Marc Steffen, 2019. "Estimating the monetary policy interest-rate-to-performance sensitivity of the European banking sector at the zero lower bound," Finance Research Letters, Elsevier, vol. 31(C).
    14. Rebecca Cordell & Kristian Skrede Gleditsch & Florian G Kern & Laura Saavedra-Lux, 2020. "Measuring institutional variation across American Indian constitutions using automated content analysis," Journal of Peace Research, Peace Research Institute Oslo, vol. 57(6), pages 777-788, November.
    15. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    16. Barth, Andreas & Radev, Deyan, 2022. "Integration culture of global banks and the transmission of lending shocks," Journal of Banking & Finance, Elsevier, vol. 134(C).
    17. Born, Andreas & Janssen, Aljoscha, 2020. "Does a District-Vote Matter for the Behavior of Politicians? A Textual Analysis of Parliamentary Speeches," Working Paper Series 1320, Research Institute of Industrial Economics.
    18. Sanders James & Lisi Giulio & Schonhardt-Bailey Cheryl, 2017. "Themes and Topics in Parliamentary Oversight Hearings: A New Direction in Textual Data Analysis," Statistics, Politics and Policy, De Gruyter, vol. 8(2), pages 153-194, December.
    19. Bennani, Hamza, 2014. "The art of central banks' forward guidance at the zero lower bound," MPRA Paper 57043, University Library of Munich, Germany.

    More about this item

    Keywords

    Economic research; Forecasting; Text mining; NLP; Sentiment analysis; Wordscores;
    All these keywords.

    JEL classification:

    • C54 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Quantitative Policy Modeling
    • E47 - Macroeconomics and Monetary Economics - - Money and Interest Rates - - - Forecasting and Simulation: Models and Applications
    • E52 - Macroeconomics and Monetary Economics - - Monetary Policy, Central Banking, and the Supply of Money and Credit - - - Monetary Policy

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:finlet:v:34:y:2020:i:c:s1544612319304064. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/frl .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.