IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0260132.html
   My bibliography  Save this article

Predicting standardized absolute returns using rolling-sample textual modelling

Author

Listed:
  • Ka Kit Tang
  • Ka Ching Li
  • Mike K P So

Abstract

Understanding how textual information impacts financial market volatility has been one of the growing topics in financial econometric research. In this paper, we aim to examine the relationship between the volatility measure that is extracted from GARCH modelling and textual news information both publicly available and from subscription, and the performances of the two datasets are compared. We utilize a latent Dirichlet allocation method to capture the dynamic features of the textual data overtime by summarizing their statistical outputs, such as topic distributions in documents and word distributions in topics. In addition, we transform various measures representing the popularity and diversity of topics to form predictors for a rolling regression model to assess the usefulness of textual information. The proposed method captures the statistical properties of textual information over different time periods and its performance is evaluated in an out-of-sample analysis. Our results show that the topic measures are more useful for predicting our volatility proxy, the unexplained variance from the GARCH model than the simple moving average. The finding indicates that our method is helpful in extracting significant textual information to improve the prediction of stock market volatility.

Suggested Citation

  • Ka Kit Tang & Ka Ching Li & Mike K P So, 2021. "Predicting standardized absolute returns using rolling-sample textual modelling," PLOS ONE, Public Library of Science, vol. 16(12), pages 1-28, December.
  • Handle: RePEc:plo:pone00:0260132
    DOI: 10.1371/journal.pone.0260132
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0260132
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0260132&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0260132?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Takayuki Morimoto & Yoshinori Kawasaki, 2017. "Forecasting Financial Market Volatility Using a Dynamic Topic Model," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 24(3), pages 149-167, September.
    2. Calomiris, Charles W. & Mamaysky, Harry, 2019. "How news and its context drive risk and returns around the world," Journal of Financial Economics, Elsevier, vol. 133(2), pages 299-336.
    3. Ryohei Hisano & Didier Sornette & Takayuki Mizuno & Takaaki Ohnishi & Tsutomu Watanabe, 2013. "High Quality Topic Extraction from Business News Explains Abnormal Financial Market Volatility," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-12, June.
    4. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    5. Ryohei Hisano & Didier Sornette & Takayuki Mizuno & Takaaki Ohnishi & Tsutomu Watanabe, 2012. "High quality topic extraction from business news explains abnormal financial market volatility," Papers 1210.6321, arXiv.org, revised Mar 2013.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shinya Kawata & Yoshi Fujiwara, 2016. "Constructing of network from topics and their temporal change in the Nikkei newspaper articles," Evolutionary and Institutional Economics Review, Springer, vol. 13(2), pages 423-436, December.
    2. Nida Çakır Melek & Charles W. Calomiris & Harry Mamaysky, 2020. "Mining for Oil Forecasts," Research Working Paper RWP 20-20, Federal Reserve Bank of Kansas City.
    3. Obaid, Khaled & Pukthuanthong, Kuntara, 2022. "A picture is worth a thousand words: Measuring investor sentiment by combining machine learning and photos from news," Journal of Financial Economics, Elsevier, vol. 144(1), pages 273-297.
    4. Fraiberger, Samuel P. & Lee, Do & Puy, Damien & Ranciere, Romain, 2021. "Media sentiment and international asset prices," Journal of International Economics, Elsevier, vol. 133(C).
    5. Alejandro Bernales & Marcela Valenzuela & Ilknur Zer, 2023. "Effects of Information Overload on Financial Markets: How Much Is Too Much?," International Finance Discussion Papers 1372, Board of Governors of the Federal Reserve System (U.S.).
    6. T. T. Chen & B. Zheng & Y. Li & X. F. Jiang, 2017. "New approaches in agent-based modeling of complex financial systems," Papers 1703.06840, arXiv.org.
    7. Ali Kabiri & Harold James & John Landon‐Lane & David Tuckett & Rickard Nyman, 2023. "The role of sentiment in the US economy: 1920 to 1934," Economic History Review, Economic History Society, vol. 76(1), pages 3-30, February.
    8. Kirtac, Kemal & Germano, Guido, 2024. "Sentiment trading with large language models," LSE Research Online Documents on Economics 122592, London School of Economics and Political Science, LSE Library.
    9. Boubaker, Sabri & Liu, Zhenya & Zhai, Ling, 2021. "Big data, news diversity and financial market crash," Technological Forecasting and Social Change, Elsevier, vol. 168(C).
    10. Möller, Rouven & Reichmann, Doron, 2023. "COVID-19 related TV news and stock returns: Evidence from major US TV stations," The Quarterly Review of Economics and Finance, Elsevier, vol. 87(C), pages 95-109.
    11. Ali Kabiri & Harold James & John Landon-Lane & David Tuckett & Rickard Nyman, 2020. "The Role of Sentiment in the Economy: 1920 to 1934," CESifo Working Paper Series 8336, CESifo.
    12. Thomas J Hwang, 2013. "Stock Market Returns and Clinical Trial Results of Investigational Compounds: An Event Study Analysis of Large Biopharmaceutical Companies," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-8, August.
    13. David Lenz & Peter Winker, 2020. "Measuring the diffusion of innovations with paragraph vector topic models," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-18, January.
    14. Andreas Dibiasi & David Iselin, 2021. "Measuring Knightian uncertainty," Empirical Economics, Springer, vol. 61(4), pages 2113-2141, October.
    15. Yoshifumi Tahira & Takayuki Mizuno, 2016. "Trading strategy of a stock index based on the frequency of news releases for listed companies," Evolutionary and Institutional Economics Review, Springer, vol. 13(2), pages 437-444, December.
    16. Chen, Ting-Ting & Zheng, Bo & Li, Yan & Jiang, Xiong-Fei, 2018. "Information driving force and its application in agent-based modeling," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 593-601.
    17. Paul E. Soto, 2021. "Breaking the Word Bank: Measurement and Effects of Bank Level Uncertainty," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 1-45, April.
    18. Sharpe, Steven A. & Sinha, Nitish R. & Hollrah, Christopher A., 2023. "The power of narrative sentiment in economic forecasts," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1097-1121.
    19. Daniel Borup & Jorge Wolfgang Hansen & Benjamin Dybro Liengaard & Erik Christian Montes Schütte, 2023. "Quantifying investor narratives and their role during COVID‐19," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(4), pages 512-532, June.
    20. Poza, Carlos & Monge, Manuel, 2020. "A real time leading economic indicator based on text mining for the Spanish economy. Fractional cointegration VAR and Continuous Wavelet Transform analysis," International Economics, Elsevier, vol. 163(C), pages 163-175.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0260132. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.