IDEAS home Printed from https://ideas.repec.org/p/kyo/wpaper/1082.html
   My bibliography  Save this paper

Efficient Market Hypothesis Test with Stock Tweets and Natural Language Processing Models

Author

Listed:
  • Bolin Mao

    (Kyoto Institute of Economic Research, Kyoto University)

  • Chenhui Chu

    (Graduate School of Informatics, Kyoto University)

  • Yuta Nakashima

    (Institute for Datability Science, Osaka University)

  • Hajime Nagahara

    (Institute for Datability Science, Osaka University)

Abstract

The efficient market hypothesis (EMH) plays a fundamental role in modern financial theory. Previous empirical studies have tested the weak and semi-strong forms of EMH with typical financial data, such as historical stock prices and annual earnings. However, few tests have been extended to include alternative data such as tweets. In this study, we use 1) two stock tweet datasets that have different features and 2) nine natural language processing (NLP)-based deep learning models to test the semi-strong form EMH in the United States stock market. None of our experimental results show that stock tweets with NLP-based models can prominently improve the daily stock price prediction accuracy compared with random guesses. Our experiment provides evidence that the semi-strong form of EMH holds in the United States stock market on a daily basis when considering stock tweet information with the NLP-based models.

Suggested Citation

  • Bolin Mao & Chenhui Chu & Yuta Nakashima & Hajime Nagahara, 2022. "Efficient Market Hypothesis Test with Stock Tweets and Natural Language Processing Models," KIER Working Papers 1082, Kyoto University, Institute of Economic Research.
  • Handle: RePEc:kyo:wpaper:1082
    as

    Download full text from publisher

    File URL: https://www.kier.kyoto-u.ac.jp/wp/wp-content/uploads/2022/09/DP1082.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John Y. Campbell, Robert J. Shiller, 1988. "The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors," The Review of Financial Studies, Society for Financial Studies, vol. 1(3), pages 195-228.
    2. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    3. Ball, R & Brown, P, 1968. "Empirical Evaluation Of Accounting Income Numbers," Journal of Accounting Research, Wiley Blackwell, vol. 6(2), pages 159-178.
    4. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    5. Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
    6. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    7. Fama, Eugene F, et al, 1969. "The Adjustment of Stock Prices to New Information," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 10(1), pages 1-21, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiang, Hao & Li, Sophia Zhengzi & Wang, Hao, 2021. "Pervasive underreaction: Evidence from high-frequency data," Journal of Financial Economics, Elsevier, vol. 141(2), pages 573-599.
    2. Daniel, Kent & Hirshleifer, David & Teoh, Siew Hong, 2002. "Investor psychology in capital markets: evidence and policy implications," Journal of Monetary Economics, Elsevier, vol. 49(1), pages 139-209, January.
    3. Godfrey, Keith R.L., 2017. "Toward a model-free measure of market efficiency," Pacific-Basin Finance Journal, Elsevier, vol. 44(C), pages 97-112.
    4. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    5. Shariq Ahmad Bhat, 2018. "Informational efficiency of sovereign bond markets of India and China: evidence from Toda and Yamamoto Granger causality (1995)," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 45(4), pages 313-323, December.
    6. Na, Haejung & Kim, Soonho, 2021. "Predicting stock prices based on informed traders’ activities using deep neural networks," Economics Letters, Elsevier, vol. 204(C).
    7. Rácz, Dávid Andor & Huszár, Gergely, 2019. "The Effects of Earnings Surprises in Quarterly Reports on S&P 500 Components," Public Finance Quarterly, Corvinus University of Budapest, vol. 64(2), pages 239-259.
    8. Fernando Rubio, 2005. "Eficiencia De Mercado, Administracion De Carteras De Fondos Y Behavioural Finance," Finance 0503028, University Library of Munich, Germany, revised 23 Jul 2005.
    9. I. Marta Miranda García & María‐Jesús Segovia‐Vargas & Usue Mori & José A. Lozano, 2023. "Early prediction of Ibex 35 movements," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1150-1166, August.
    10. Asiya Sohail & Attiya Yasmin Javid, 2014. "The Global Financial Crisis and Investors’ Behaviour; Evidence from the Karachi Stock Exchange," PIDE-Working Papers 2014:106, Pakistan Institute of Development Economics.
    11. Committee, Nobel Prize, 2013. "Understanding Asset Prices," Nobel Prize in Economics documents 2013-1, Nobel Prize Committee.
    12. Antonio Marsi, 2023. "Predicting European stock returns using machine learning," SN Business & Economics, Springer, vol. 3(7), pages 1-25, July.
    13. Azevedo, Vitor, 2023. "Analysts’ underreaction and momentum strategies," Journal of Economic Dynamics and Control, Elsevier, vol. 146(C).
    14. Thanh Huong Nguyen, 2019. "Information and Noise in Stock Markets: Evidence on the Determinants and Effects Using New Empirical Measures," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 7-2019.
    15. Marquardt, Philipp & Noussair, Charles N & Weber, Martin, 2019. "Rational expectations in an experimental asset market with shocks to market trends," European Economic Review, Elsevier, vol. 114(C), pages 116-140.
    16. Dionysia Dionysiou, 2015. "Choosing Among Alternative Long-Run Event-Study Techniques," Journal of Economic Surveys, Wiley Blackwell, vol. 29(1), pages 158-198, February.
    17. Alexander S. Sangare, 2005. "Efficience des marchés : un siècle après Bachelier," Revue d'Économie Financière, Programme National Persée, vol. 81(4), pages 107-132.
    18. Laura Alfaro & Anusha Chari & Andrew N. Greenland & Peter K. Schott, 2020. "Aggregate and Firm-Level Stock Returns During Pandemics, in Real Time," NBER Working Papers 26950, National Bureau of Economic Research, Inc.
    19. Taufiq Choudhry & Ranadeva Jayasekera, 2015. "Level of efficiency in the UK equity market: empirical study of the effects of the global financial crisis," Review of Quantitative Finance and Accounting, Springer, vol. 44(2), pages 213-242, February.
    20. Meredith Beechey & David Gruen & James Vickery, 2000. "The Efficient Market Hypothesis: A Survey," RBA Research Discussion Papers rdp2000-01, Reserve Bank of Australia.

    More about this item

    Keywords

    Efficient Market Hypothesis Test; Daily Stock Price Prediction; Stock Tweet; Natural Language Processing;
    All these keywords.

    JEL classification:

    • C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics
    • C5 - Mathematical and Quantitative Methods - - Econometric Modeling
    • G1 - Financial Economics - - General Financial Markets

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kyo:wpaper:1082. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Makoto Watanabe (email available below). General contact details of provider: https://edirc.repec.org/data/iekyojp.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.