IDEAS home Printed from https://ideas.repec.org/a/bla/jecsur/v34y2020i3p512-547.html
   My bibliography  Save this article

Econometrics Meets Sentiment: An Overview Of Methodology And Applications

Author

Listed:
  • Andres Algaba
  • David Ardia
  • Keven Bluteau
  • Samuel Borms
  • Kris Boudt

Abstract

The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software.

Suggested Citation

  • Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
  • Handle: RePEc:bla:jecsur:v:34:y:2020:i:3:p:512-547
    DOI: 10.1111/joes.12370
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/joes.12370
    Download Restriction: no

    File URL: https://libkey.io/10.1111/joes.12370?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Cindy K Soo, 2018. "Quantifying Sentiment with News Media across Local Housing Markets," The Review of Financial Studies, Society for Financial Studies, vol. 31(10), pages 3689-3719.
    2. Robert F Engle & Stefano Giglio & Bryan Kelly & Heebum Lee & Johannes Stroebel, 2020. "Hedging Climate Change News," The Review of Financial Studies, Society for Financial Studies, vol. 33(3), pages 1184-1216.
    3. Larsen, Vegard H. & Thorsrud, Leif Anders & Zhulanova, Julia, 2021. "News-driven inflation expectations and information rigidities," Journal of Monetary Economics, Elsevier, vol. 117(C), pages 507-520.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    5. Grimmer, Justin & Stewart, Brandon M., 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts," Political Analysis, Cambridge University Press, vol. 21(3), pages 267-297, July.
    6. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    7. Lesley Chiou & Catherine Tucker, 2017. "Content aggregation by platforms: The case of the news media," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 26(4), pages 782-805, December.
    8. Denny, Matthew J. & Spirling, Arthur, 2018. "Text Preprocessing For Unsupervised Learning: Why It Matters, When It Misleads, And What To Do About It," Political Analysis, Cambridge University Press, vol. 26(2), pages 168-189, April.
    9. Nimark, Kristoffer P. & Pitschner, Stefan, 2019. "News media and delegated information choice," Journal of Economic Theory, Elsevier, vol. 181(C), pages 160-196.
    10. Picault, Matthieu & Renault, Thomas, 2017. "Words are not all created equal: A new measure of ECB communication," Journal of International Money and Finance, Elsevier, vol. 79(C), pages 136-156.
    11. George‐Marios Angeletos & Fabrice Collard & Harris Dellas, 2018. "Quantifying Confidence," Econometrica, Econometric Society, vol. 86(5), pages 1689-1726, September.
    12. Arslan-Ayaydin, Özgür & Boudt, Kris & Thewissen, James, 2016. "Managers set the tone: Equity incentives and the tone of earnings press releases," Journal of Banking & Finance, Elsevier, vol. 72(S), pages 132-147.
    13. Tim Loughran & Bill Mcdonald, 2014. "Measuring Readability in Financial Disclosures," Journal of Finance, American Finance Association, vol. 69(4), pages 1643-1671, August.
    14. Diamantis Petropoulos Petalas & Hein van Schie & Paul Hendriks Vettehen, 2017. "Forecasted economic change and the self-fulfilling prophecy in economic decision-making," PLOS ONE, Public Library of Science, vol. 12(3), pages 1-18, March.
    15. Glasserman, Paul & Mamaysky, Harry, 2019. "Does Unusual News Forecast Market Stress?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 54(5), pages 1937-1974, October.
    16. Svetlana Borovkova & Evgeny Garmaev & Philip Lammers & Jordi Rustige, 2017. "SenSR: A sentiment-based systemic risk indicator," DNB Working Papers 553, Netherlands Central Bank, Research Department.
    17. Bajo, Emanuele & Raimondo, Carlo, 2017. "Media sentiment and IPO underpricing," Journal of Corporate Finance, Elsevier, vol. 46(C), pages 139-153.
    18. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    19. Douglas W. Diamond & Philip H. Dybvig, 2000. "Bank runs, deposit insurance, and liquidity," Quarterly Review, Federal Reserve Bank of Minneapolis, vol. 24(Win), pages 14-23.
    20. Massimiliano Caporin & Francesco Poli, 2017. "Building News Measures from Textual Data and an Application to Volatility Forecasting," Econometrics, MDPI, vol. 5(3), pages 1-46, August.
    21. E. Andreou & P. Gagliardini & E. Ghysels & M. Rubin, 2019. "Inference in Group Factor Models With an Application to Mixed‐Frequency Data," Econometrica, Econometric Society, vol. 87(4), pages 1267-1305, July.
    22. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    23. Malcolm Baker & Jeffrey Wurgler, 2007. "Investor Sentiment in the Stock Market," Journal of Economic Perspectives, American Economic Association, vol. 21(2), pages 129-152, Spring.
    24. Calomiris, Charles W. & Mamaysky, Harry, 2019. "How news and its context drive risk and returns around the world," Journal of Financial Economics, Elsevier, vol. 133(2), pages 299-336.
    25. Kearney, Colm & Liu, Sha, 2014. "Textual sentiment in finance: A survey of methods and models," International Review of Financial Analysis, Elsevier, vol. 33(C), pages 171-185.
    26. De Long, J Bradford & Andrei Shleifer & Lawrence H. Summers & Robert J. Waldmann, 1990. "Noise Trader Risk in Financial Markets," Journal of Political Economy, University of Chicago Press, vol. 98(4), pages 703-738, August.
    27. Matthieu Picault & Thomas Renault, 2017. "Words are not all created equal: A new measure of ECB communication," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-03205121, HAL.
    28. David Bholat & Stephen Hans & Pedro Santos & Cheryl Schonhardt-Bailey, 2015. "Text mining for central banks," Handbooks, Centre for Central Banking Studies, Bank of England, number 33, April.
    29. Chang, Chuang-Chang & Hsieh, Pei-Fang & Wang, Yaw-Huei, 2015. "Sophistication, Sentiment, and Misreaction," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 50(4), pages 903-928, August.
    30. Arina Wischnewsky & David‐Jan Jansen & Matthias Neuenkirch, 2021. "Financial stability and the Fed: Evidence from congressional hearings," Economic Inquiry, Western Economic Association International, vol. 59(3), pages 1192-1214, July.
    31. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    32. Sarah Gelper & Christophe Croux, 2010. "On the Construction of the European Economic Sentiment Indicator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(1), pages 47-62, February.
    33. Vegard H�ghaug Larsen & Leif Anders Thorsrud, 2018. "Business cycle narratives," Working Papers No 6/2018, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    34. Robert B. Barsky & Eric R. Sims, 2012. "Information, Animal Spirits, and the Meaning of Innovations in Consumer Confidence," American Economic Review, American Economic Association, vol. 102(4), pages 1343-1377, June.
    35. Jegadeesh, Narasimhan & Wu, Di, 2013. "Word power: A new approach for content analysis," Journal of Financial Economics, Elsevier, vol. 110(3), pages 712-729.
    36. Margaret E. Roberts & Brandon M. Stewart & Edoardo M. Airoldi, 2016. "A Model of Text for Experimentation in the Social Sciences," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 988-1003, July.
    37. Manela, Asaf & Moreira, Alan, 2017. "News implied volatility and disaster concerns," Journal of Financial Economics, Elsevier, vol. 123(1), pages 137-162.
    38. Vasilev, Aleksandar & Maksumov, Rashid, 2010. "Critical analysis of Chapter 23 of Keynes’s Notes on Mercantilism in The General Theory of Employment, Interest and Money (1936)," EconStor Research Reports 155318, ZBW - Leibniz Information Centre for Economics.
    39. Marcel Garz, 2014. "Good news and bad news: evidence of media bias in unemployment reports," Public Choice, Springer, vol. 161(3), pages 499-515, December.
    40. Larsen, Vegard H. & Thorsrud, Leif A., 2019. "The value of news for economic developments," Journal of Econometrics, Elsevier, vol. 210(1), pages 203-218.
    41. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    42. Sydney C. Ludvigson, 2004. "Consumer Confidence and Consumer Spending," Journal of Economic Perspectives, American Economic Association, vol. 18(2), pages 29-50, Spring.
    43. Arina Wischnewsky & David‐Jan Jansen & Matthias Neuenkirch, 2021. "Financial stability and the Fed: Evidence from congressional hearings," Economic Inquiry, Western Economic Association International, vol. 59(3), pages 1192-1214, July.
    44. Feinerer, Ingo & Hornik, Kurt & Meyer, David, 2008. "Text Mining Infrastructure in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i05).
    45. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    46. Boudt, Kris & Thewissen, James & Torsin, Wouter, 2018. "When does the tone of earnings press releases matter?," International Review of Financial Analysis, Elsevier, vol. 57(C), pages 231-245.
    47. Jess Benhabib & Mark M Spiegel, 2019. "Sentiments and Economic Activity: Evidence from US States," The Economic Journal, Royal Economic Society, vol. 129(618), pages 715-733.
    48. Roman Kräussl & Elizaveta Mirgorodskaya, 2017. "Media, sentiment and market performance in the long run," The European Journal of Finance, Taylor & Francis Journals, vol. 23(11), pages 1059-1082, September.
    49. Kristian D. Allee & Matthew D. Deangelis, 2015. "The Structure of Voluntary Disclosure Narratives: Evidence from Tone Dispersion," Journal of Accounting Research, Wiley Blackwell, vol. 53(2), pages 241-274, May.
    50. Gregory P. Casey & Ann L. Owen, 2013. "Good News, Bad News, and Consumer Confidence," Social Science Quarterly, Southwestern Social Science Association, vol. 94(1), pages 292-315, March.
    51. Adam Nowak & Patrick Smith, 2017. "Textual Analysis in Real Estate," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(4), pages 896-918, June.
    52. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey, 2017. "Double/Debiased/Neyman Machine Learning of Treatment Effects," American Economic Review, American Economic Association, vol. 107(5), pages 261-265, May.
    53. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    54. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    55. Matthew Gentzkow & Jesse M. Shapiro, 2010. "What Drives Media Slant? Evidence From U.S. Daily Newspapers," Econometrica, Econometric Society, vol. 78(1), pages 35-71, January.
    56. Kris Boudt & James Thewissen, 2019. "Jockeying for Position in CEO Letters: Impression Management and Sentiment Analytics," Financial Management, Financial Management Association International, vol. 48(1), pages 77-115, March.
    57. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Dec 2017.
    58. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    59. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    60. Matthew Gentzkow & Jesse M. Shapiro & Matt Taddy, 2019. "Measuring Group Differences in High‐Dimensional Choices: Method and Application to Congressional Speech," Econometrica, Econometric Society, vol. 87(4), pages 1307-1340, July.
    61. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    62. Duo Qin, 2011. "Rise Of Var Modelling Approach," Journal of Economic Surveys, Wiley Blackwell, vol. 25(1), pages 156-174, February.
    63. William J. Mayew & Mohan Venkatachalam, 2012. "The Power of Voice: Managerial Affective States and Future Firm Performance," Journal of Finance, American Finance Association, vol. 67(1), pages 1-44, February.
    64. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    65. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    66. Craig Lewis & Steven Young, 2019. "Fad or future? Automated analysis of financial text and its implications for corporate reporting," Accounting and Business Research, Taylor & Francis Journals, vol. 49(5), pages 587-615, July.
    67. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simionescu, Mihaela, 2022. "Econometrics of sentiments- sentometrics and machine learning: The improvement of inflation predictions in Romania using sentiment analysis," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    2. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    3. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    4. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    5. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    6. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2022. "Making text count: Economic forecasting using newspaper text," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 896-919, August.
    7. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    8. Tom Marty & Bruce Vanstone & Tobias Hahn, 2020. "News media analytics in finance: a survey," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(2), pages 1385-1434, June.
    9. Ardia, David & Bluteau, Keven & Boudt, Kris, 2022. "Media abnormal tone, earnings announcements, and the stock market," Journal of Financial Markets, Elsevier, vol. 61(C).
    10. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    11. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    12. Anand, Abhinav & Basu, Sankarshan & Pathak, Jalaj & Thampy, Ashok, 2021. "The impact of sentiment on emerging stock markets," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 161-177.
    13. Nyman, Rickard & Kapadia, Sujit & Tuckett, David, 2021. "News and narratives in financial systems: Exploiting big data for systemic risk assessment," Journal of Economic Dynamics and Control, Elsevier, vol. 127(C).
    14. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    15. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    16. Senave, Elseline & Jans, Mieke J. & Srivastava, Rajendra P., 2023. "The application of text mining in accounting," International Journal of Accounting Information Systems, Elsevier, vol. 50(C).
    17. Yuting Chen & Don Bredin & Valerio Potì & Roman Matkovskyy, 2022. "COVID risk narratives: a computational linguistic approach to the econometric identification of narrative risk during a pandemic," Digital Finance, Springer, vol. 4(1), pages 17-61, March.
    18. Vegard Høghaug Larsen & Leif Anders Thorsrud, 2022. "Asset returns, news topics, and media effects," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 838-868, July.
    19. Bai, Xiwen & Lam, Jasmine Siu Lee & Jakher, Astha, 2021. "Shipping sentiment and the dry bulk shipping freight market: New evidence from newspaper coverage," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 155(C).
    20. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2020. "News Media vs. FRED-MD for Macroeconomic Forecasting," CESifo Working Paper Series 8639, CESifo.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jecsur:v:34:y:2020:i:3:p:512-547. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0950-0804 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.