IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2405.10449.html

Optimal Text-Based Time-Series Indices

Author

Listed:
  • David Ardia
  • Keven Bluteau

Abstract

We propose an approach to construct text-based time-series indices in an optimal way--typically, indices that maximize the contemporaneous relation or the predictive performance with respect to a target variable, such as inflation. We illustrate our methodology with a corpus of news articles from the Wall Street Journal by optimizing text-based indices focusing on tracking the VIX index and inflation expectations. Our results highlight the superior performance of our approach compared to existing indices.

Suggested Citation

  • David Ardia & Keven Bluteau, 2024. "Optimal Text-Based Time-Series Indices," Papers 2405.10449, arXiv.org.
  • Handle: RePEc:arx:papers:2405.10449
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2405.10449
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Dario Caldara & Matteo Iacoviello, 2022. "Measuring Geopolitical Risk," American Economic Review, American Economic Association, vol. 112(4), pages 1194-1225, April.
    2. Eugster, Patrick & Uhl, Matthias W., 2024. "Forecasting inflation using sentiment," Economics Letters, Elsevier, vol. 236(C).
    3. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2023. "Forecasting with Economic News," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(3), pages 708-719, July.
    4. Massimiliano Marcellino & Dalibor Stevanovic, 2022. "The demand and supply of information about inflation," CIRANO Working Papers 2022s-27, CIRANO.
    5. Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
    6. Scrucca, Luca, 2013. "GA: A Package for Genetic Algorithms in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i04).
    7. Baker, Scott R. & Bloom, Nicholas & Davis, Steven J. & Kost, Kyle, 2026. "Policy news and stock market volatility," Journal of Financial Economics, Elsevier, vol. 175(C).
    8. Ardia, David & Bluteau, Keven & Boudt, Kris, 2022. "Media abnormal tone, earnings announcements, and the stock market," Journal of Financial Markets, Elsevier, vol. 61(C).
    9. Uhlig, Harald, 2005. "What are the effects of monetary policy on output? Results from an agnostic identification procedure," Journal of Monetary Economics, Elsevier, vol. 52(2), pages 381-419, March.
    10. Ardia, David & Bluteau, Keven & Kassem, Alaa, 2021. "A century of Economic Policy Uncertainty through the French–Canadian lens," Economics Letters, Elsevier, vol. 205(C).
    11. Stephen Hansen & Michael McMahon & Andrea Prat, 2018. "Transparency and Deliberation Within the FOMC: A Computational Linguistics Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 133(2), pages 801-870.
    12. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    13. Tim Loughran & Bill Mcdonald, 2014. "Measuring Readability in Financial Disclosures," Journal of Finance, American Finance Association, vol. 69(4), pages 1643-1671, August.
    14. Clark, Todd E. & McCracken, Michael W., 2001. "Tests of equal forecast accuracy and encompassing for nested models," Journal of Econometrics, Elsevier, vol. 105(1), pages 85-110, November.
    15. Damjan Pfajfar & Emiliano Santoro, 2013. "News on Inflation and the Epidemiology of Inflation Expectations," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 45(6), pages 1045-1067, September.
    16. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    17. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    18. Kyle Handley & Nuno Limão, 2022. "Trade Policy Uncertainty," Annual Review of Economics, Annual Reviews, vol. 14(1), pages 363-395, August.
    19. Husted, Lucas & Rogers, John & Sun, Bo, 2020. "Monetary policy uncertainty," Journal of Monetary Economics, Elsevier, vol. 115(C), pages 20-36.
    20. David Ardia & Keven Bluteau & Mohammad‐Abbas Meghani, 2024. "Thirty years of academic finance," Journal of Economic Surveys, Wiley Blackwell, vol. 38(3), pages 1008-1042, July.
    21. Jegadeesh, Narasimhan & Wu, Di, 2013. "Word power: A new approach for content analysis," Journal of Financial Economics, Elsevier, vol. 110(3), pages 712-729.
    22. Kim, Ji-Hyun, 2009. "Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3735-3745, September.
    23. David Ardia & Keven Bluteau & Kris Boudt & Koen Inghelbrecht, 2023. "Climate Change Concerns and the Performance of Green vs. Brown Stocks," Management Science, INFORMS, vol. 69(12), pages 7607-7632, December.
    24. Andrews, Donald W K & Monahan, J Christopher, 1992. "An Improved Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimator," Econometrica, Econometric Society, vol. 60(4), pages 953-966, July.
    25. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    26. Manela, Asaf & Moreira, Alan, 2017. "News implied volatility and disaster concerns," Journal of Financial Economics, Elsevier, vol. 123(1), pages 137-162.
    27. Christina D. Romer & David H. Romer, 2023. "Presidential Address: Does Monetary Policy Matter? The Narrative Approach after 35 Years," American Economic Review, American Economic Association, vol. 113(6), pages 1395-1423, June.
    28. Canova, Fabio & Nicolo, Gianni De, 2002. "Monetary disturbances matter for business fluctuations in the G-7," Journal of Monetary Economics, Elsevier, vol. 49(6), pages 1131-1159, September.
    29. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Karsten Müller, 2022. "German forecasters’ narratives: How informative are German business cycle forecast reports?," Empirical Economics, Springer, vol. 62(5), pages 2373-2415, May.
    2. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2022. "Making text count: Economic forecasting using newspaper text," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 896-919, August.
    3. Andres Algaba & David Ardia & Keven Bluteau & Samuel Borms & Kris Boudt, 2020. "Econometrics Meets Sentiment: An Overview Of Methodology And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 34(3), pages 512-547, July.
    4. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    5. Julian Ashwin & Eleni Kalamara & Lorena Saiz, 2024. "Nowcasting Euro area GDP with news sentiment: A tale of two crises," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(5), pages 887-905, August.
    6. Kirtac, Kemal & Germano, Guido, 2024. "Sentiment trading with large language models," Finance Research Letters, Elsevier, vol. 62(PB).
    7. Nonejad, Nima, 2022. "Predicting equity premium out-of-sample by conditioning on newspaper-based uncertainty measures: A comparative study," International Review of Financial Analysis, Elsevier, vol. 83(C).
    8. Murad, S.M. Woahid & Pathan, Shams & Durand, Robert B. & Zheng, Chen, 2025. "Understanding Bank-Level Uncertainty: New insights into banking activity and its macroeconomic impacts," International Review of Economics & Finance, Elsevier, vol. 102(C).
    9. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    10. Stolbov, Mikhail & Shchepeleva, Maria & Karminsky, Alexander, 2022. "When central bank research meets Google search: A sentiment index of global financial stress," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 81(C).
    11. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.
    12. Hubert, Paul & Labondance, Fabien, 2021. "The signaling effects of central bank tone," European Economic Review, Elsevier, vol. 133(C).
    13. Fadda, Pietro & Hanifi, Rayane & Istrefi, Klodiana & Penalver, Adrian, 2025. "Central bank communication of uncertainty," Journal of International Money and Finance, Elsevier, vol. 157(C).
    14. Massimo Ferrari Minesso & Frederik Kurcz & Maria Sole Pagliari, 2022. "Do words hurt more than actions? The impact of trade tensions on financial markets," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(6), pages 1138-1159, September.
    15. Shovon Sengupta & Sunny Kumar Singh & Tanujit Chakraborty, 2025. "Macroeconomic Forecasting for the G7 countries under Uncertainty Shocks," Papers 2510.23347, arXiv.org.
    16. Jiang, Fuwei & Liu, Hongkui & Yu, Jiasheng & Zhang, Huajing, 2023. "International stock return predictability: The role of U.S. uncertainty spillover," Pacific-Basin Finance Journal, Elsevier, vol. 82(C).
    17. Dorinth van Dijk & Jasper de Winter, 2023. "Nowcasting GDP using tone-adjusted time varying news topics: Evidence from the financial press," Working Papers 766, DNB.
    18. Himounet, Nicolas, 2022. "Searching the nature of uncertainty: Macroeconomic and financial risks VS geopolitical and pandemic risks," International Economics, Elsevier, vol. 170(C), pages 1-31.
    19. Li, Shuaibing & Ma, Yong, 2025. "News-based equity market uncertainty aligned: An informative predictor for gold market volatility," Journal of Commodity Markets, Elsevier, vol. 40(C).
    20. Tengfei Zhang, 2020. "Manager Uncertainty and Cross-Sectional Stock Returns," 2020 Papers pzh934, Job Market Papers.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.10449. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.