IDEAS home Printed from https://ideas.repec.org/a/eee/jfinec/v147y2023i3p525-549.html
   My bibliography  Save this article

The colour of finance words

Author

Listed:
  • García, Diego
  • Hu, Xiaowen
  • Rohrer, Maximilian

Abstract

Our paper relies on stock price reactions to colour words, in order to provide new dictionaries of positive and negative words in a finance context. We extend the machine learning algorithm of Taddy (2013), adding a cross-validation layer to avoid over-fitting. In head-to-head comparisons, our dictionaries outperform the standard bag-of-words approach (Loughran and McDonald, 2011) when predicting stock price movements out-of-sample. By comparing their composition, word-by-word, our method refines and expands the sentiment dictionaries in the literature. The breadth of our dictionaries and their ability to disambiguate words using bigrams both help to colour finance discourse better.

Suggested Citation

  • García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
  • Handle: RePEc:eee:jfinec:v:147:y:2023:i:3:p:525-549
    DOI: 10.1016/j.jfineco.2022.11.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304405X22002422
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jfineco.2022.11.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    2. Hanley, Kathleen Weiss & Hoberg, Gerard, 2012. "Litigation risk, strategic disclosure and the underpricing of initial public offerings," Journal of Financial Economics, Elsevier, vol. 103(2), pages 235-254.
    3. Eugene F. Fama & Kenneth R. French, 2001. "Disappearing Dividends: Changing Firm Characteristics Or Lower Propensity To Pay?," Journal of Applied Corporate Finance, Morgan Stanley, vol. 14(1), pages 67-79, March.
    4. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    5. Stephen Hansen & Michael McMahon & Andrea Prat, 2018. "Transparency and Deliberation Within the FOMC: A Computational Linguistics Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 133(2), pages 801-870.
    6. Glasserman, Paul & Mamaysky, Harry, 2019. "Does Unusual News Forecast Market Stress?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 54(5), pages 1937-1974, October.
    7. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    8. Kathleen Weiss Hanley & Gerard Hoberg, 2019. "Dynamic Interpretation of Emerging Risks in the Financial Sector," Review of Financial Studies, Society for Financial Studies, vol. 32(12), pages 4543-4603.
    9. Tarek A Hassan & Stephan Hollander & Laurence van Lent & Ahmed Tahoun, 2019. "Firm-Level Political Risk: Measurement and Effects," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(4), pages 2135-2202.
    10. Jegadeesh, Narasimhan & Wu, Di, 2013. "Word power: A new approach for content analysis," Journal of Financial Economics, Elsevier, vol. 110(3), pages 712-729.
    11. Gerard Hoberg & Gordon Phillips, 2016. "Text-Based Network Industries and Endogenous Product Differentiation," Journal of Political Economy, University of Chicago Press, vol. 124(5), pages 1423-1465.
    12. Tim Loughran & Bill McDonald, 2020. "Textual Analysis in Finance," Annual Review of Financial Economics, Annual Reviews, vol. 12(1), pages 357-375, December.
    13. Jason V. Chen & Venky Nagar & Jordan Schoenfeld, 2018. "Manager-analyst conversations in earnings conference calls," Review of Accounting Studies, Springer, vol. 23(4), pages 1315-1354, December.
    14. Fama, Eugene F & MacBeth, James D, 1973. "Risk, Return, and Equilibrium: Empirical Tests," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 607-636, May-June.
    15. Matt Taddy, 2013. "Multinomial Inverse Regression for Text Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 755-770, September.
    16. David H. Solomon, 2012. "Selective Publicity and Stock Prices," Journal of Finance, American Finance Association, vol. 67(2), pages 599-638, April.
    17. Pierre Jinghong Liang & Vitaly Meursault & Bryan B. Routledge & Madeline Marco Scanlon, 2021. "PEAD.txt: Post-Earnings-Announcement Drift Using Text," Working Papers 21-07, Federal Reserve Bank of Philadelphia.
    18. Margaret Roberts & Brandon Stewart & Tingley, Dustin & Edoardo Airoldi, 2013. "The structural topic model and applied social science," Working Paper 132666, Harvard University OpenScholar.
    19. Frankel, R & Johnson, M & Skinner, DJ, 1999. "An empirical examination of conference calls as a voluntary disclosure medium," Journal of Accounting Research, Wiley Blackwell, vol. 37(1), pages 133-150.
    20. Volkan Muslu & Suresh Radhakrishnan & K. R. Subramanyam & Dongkuk Lim, 2015. "Forward-Looking MD&A Disclosures and the Information Environment," Management Science, INFORMS, vol. 61(5), pages 931-948, May.
    21. Manela, Asaf & Moreira, Alan, 2017. "News implied volatility and disaster concerns," Journal of Financial Economics, Elsevier, vol. 123(1), pages 137-162.
    22. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    23. Hoberg, Gerard & Lewis, Craig, 2017. "Do fraudulent firms produce abnormal disclosure?," Journal of Corporate Finance, Elsevier, vol. 43(C), pages 58-85.
    24. Zheng Tracy Ke & Bryan T. Kelly & Dacheng Xiu, 2019. "Predicting Returns With Text Data," NBER Working Papers 26186, National Bureau of Economic Research, Inc.
    25. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    26. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    27. García, Diego & Norli, Øyvind, 2012. "Geographic dispersion and stock returns," Journal of Financial Economics, Elsevier, vol. 106(3), pages 547-565.
    28. J. Anthony Cookson & Marina Niessner, 2020. "Why Don't We Agree? Evidence from a Social Network of Investors," Journal of Finance, American Finance Association, vol. 75(1), pages 173-228, February.
    29. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    30. Anastassia Fedyk, 2021. "Disagreement after News: Gradual Information Diffusion or Differences of Opinion?," The Review of Asset Pricing Studies, Society for Financial Studies, vol. 11(3), pages 465-501.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sudarshan Kumar & Sobhesh Kumar Agarwalla & Jayanth R. Varma & Vineet Virmani, 2023. "Harvesting the volatility smile in a large emerging market: A Dynamic Nelson–Siegel approach," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 43(11), pages 1615-1644, November.
    2. Agam Shah & Sudheer Chava, 2023. "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks," Papers 2305.16633, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Fritzsch & Philipp Scharner & Gregor Weiß, 2021. "Estimating the relation between digitalization and the market value of insurers," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 529-567, September.
    2. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    3. Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
    4. Faccini, Renato & Matin, Rastin & Skiadopoulos, George, 2023. "Dissecting climate risks: Are they reflected in stock prices?," Journal of Banking & Finance, Elsevier, vol. 155(C).
    5. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    6. Li, Ken, 2022. "Textual fundamentals in earnings press releases," Advances in accounting, Elsevier, vol. 57(C).
    7. Jeon, Yoontae & McCurdy, Thomas H. & Zhao, Xiaofei, 2022. "News as sources of jumps in stock returns: Evidence from 21 million news articles for 9000 companies," Journal of Financial Economics, Elsevier, vol. 145(2), pages 1-17.
    8. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    9. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    10. Alejandro Lopez-Lira & Yuehua Tang, 2023. "Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models," Papers 2304.07619, arXiv.org, revised Sep 2023.
    11. Paul E. Soto, 2021. "Breaking the Word Bank: Measurement and Effects of Bank Level Uncertainty," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 1-45, April.
    12. Bryan T. Kelly & Asaf Manela & Alan Moreira, 2019. "Text Selection," NBER Working Papers 26517, National Bureau of Economic Research, Inc.
    13. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    14. Chris Florakis & Christodoulos Louca & Roni Michaely & Michael Weber, 2020. "Cybersecurity Risk," Working Papers 2020-178, Becker Friedman Institute for Research In Economics.
    15. Miwa, Kotaro, 2023. "Divergent opinions on social media," International Review of Economics & Finance, Elsevier, vol. 86(C), pages 182-196.
    16. Loughran, Tim & McDonald, Bill & Pragidis, Ioannis, 2019. "Assimilation of oil news into prices," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 105-118.
    17. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    18. Hansen, Stephen & Davis, Steven & Seminario-Amez, Cristhian, 2020. "Firm-level Risk Exposures and Stock Returns in the Wake of COVID-19," CEPR Discussion Papers 15314, C.E.P.R. Discussion Papers.
    19. Buehlmaier, Matthias M. M. & Zechner, Josef, 2016. "Financial media, price discovery, and merger arbitrage," CFS Working Paper Series 551, Center for Financial Studies (CFS).
    20. Kothari, Pratik & Chance, Don M. & Ferris, Stephen P., 2021. "Bragging rights: Does corporate boasting imply value creation?," Journal of Corporate Finance, Elsevier, vol. 67(C).

    More about this item

    Keywords

    Measuring sentiment; Machine learning; Earnings calls; 10-Ks; WSJ;
    All these keywords.

    JEL classification:

    • D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information; Mechanism Design
    • G14 - Financial Economics - - General Financial Markets - - - Information and Market Efficiency; Event Studies; Insider Trading

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jfinec:v:147:y:2023:i:3:p:525-549. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505576 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.