IDEAS home Printed from https://ideas.repec.org/p/ime/imedps/26-e-07.html

Forecasting Recessions Using Machine Learning on Text Data and Mixed-Frequency Predictors

Author

Listed:
  • Yusuke Oh

    (Deputy Director, Institute for Monetary and Economic Studies, Bank of Japan (E-mail: yuusuke.ou@boj.or.jp))

  • Mototsugu Shintani

    (The University of Tokyo (E-mail: shintani@e.u-tokyo.ac.jp))

Abstract

We forecast Japanese recessions by integrating machine learning methods, mixed-frequency data, and text-based indicators within an unrestricted mixed data sampling (U-MIDAS) framework. The model combines monthly macroeconomic variables with weekly financial indicators and newspaper-based text indicators. A pseudo-real-time forecasting exercise over three decades shows that machine learning models consistently outperform traditional logit benchmarks. The model confidence set (MCS) suggests horizon dependence: Text indicators are more informative at short horizons, while financial variables are more informative at longer horizons. To improve interpretability, we apply sparse principal component analysis (Sparse PCA) to the text indicators and identify three economic narratives: 'Corporate Distress,' 'Financial Distress,' and 'Deflationary Pressure.' Furthermore, SHAP (SHapley Additive exPlanations) analysis indicates that different recession episodes are associated with different combinations of these narratives, underscoring the heterogeneous nature of economic downturns.

Suggested Citation

  • Yusuke Oh & Mototsugu Shintani, 2026. "Forecasting Recessions Using Machine Learning on Text Data and Mixed-Frequency Predictors," IMES Discussion Paper Series 26-E-07, Institute for Monetary and Economic Studies, Bank of Japan.
  • Handle: RePEc:ime:imedps:26-e-07
    as

    Download full text from publisher

    File URL: https://www.imes.boj.or.jp/research/papers/english/26-E-07.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Okimoto, Tatsuyoshi & Takaoka, Sumiko, 2017. "The term structure of credit spreads and business cycle in Japan," Journal of the Japanese and International Economies, Elsevier, vol. 45(C), pages 27-36.
    2. Bernard, Henri & Gerlach, Stefan, 1998. "Does the Term Structure Predict Recessions? The International Evidence," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 3(3), pages 195-215, July.
    3. Watanabe, Toshiaki, 2003. "Measuring Business Cycle Turning Points in Japan with a Dynamic Markov Switching Factor Model," Monetary and Economic Studies, Institute for Monetary and Economic Studies, Bank of Japan, vol. 21(1), pages 35-68, February.
    4. Vrontos, Spyridon D. & Galakis, John & Vrontos, Ioannis D., 2021. "Modeling and predicting U.S. recessions using machine learning techniques," International Journal of Forecasting, Elsevier, vol. 37(2), pages 647-671.
    5. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    6. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    7. Claudia Foroni & Massimiliano Marcellino & Christian Schumacher, 2015. "Unrestricted mixed data sampling (MIDAS): MIDAS regressions with unrestricted lag polynomials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(1), pages 57-82, January.
    8. Ghysels, Eric & Santa-Clara, Pedro & Valkanov, Rossen, 2005. "There is a risk-return trade-off after all," Journal of Financial Economics, Elsevier, vol. 76(3), pages 509-548, June.
    9. Ana Beatriz Galvão & Michael Owyang, 2022. "Forecasting low‐frequency macroeconomic events with high‐frequency data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(7), pages 1314-1333, November.
    10. Borio, Claudio & Drehmann, Mathias & Xia, Fan Dora, 2020. "Forecasting recessions: the importance of the financial cycle," Journal of Macroeconomics, Elsevier, vol. 66(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Degiannakis, Stavros & Filis, George, 2018. "Forecasting oil prices: High-frequency financial data are indeed useful," Energy Economics, Elsevier, vol. 76(C), pages 388-402.
    2. Alain Hecq & Marie Ternes & Ines Wilms, 2025. "Hierarchical Regularizers for Reverse Unrestricted Mixed Data Sampling Regressions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(6), pages 1946-1968, September.
    3. Dhaene, Geert & Wu, Jianbin, 2020. "Incorporating overnight and intraday returns into multivariate GARCH volatility models," Journal of Econometrics, Elsevier, vol. 217(2), pages 471-495.
    4. Degiannakis, Stavros & Filis, George, 2017. "Forecasting oil prices," MPRA Paper 77531, University Library of Munich, Germany.
    5. Li, Dongxin & Zhang, Li & Li, Lihong, 2023. "Forecasting stock volatility with economic policy uncertainty: A smooth transition GARCH-MIDAS model," International Review of Financial Analysis, Elsevier, vol. 88(C).
    6. Zheng, Tingguo & Fan, Xinyue & Jin, Wei & Fang, Kuangnan, 2024. "Words or numbers? Macroeconomic nowcasting with textual and macroeconomic data," International Journal of Forecasting, Elsevier, vol. 40(2), pages 746-761.
    7. Marcellino, Massimiliano & Foroni, Claudia & Casarin, Roberto & Ravazzolo, Francesco, 2017. "Uncertainty Through the Lenses of A Mixed-Frequency Bayesian Panel Markov Switching Model," CEPR Discussion Papers 12339, Centre for Economic Policy Research.
    8. Borup, Daniel & Rapach, David E. & Schütte, Erik Christian Montes, 2023. "Mixed-frequency machine learning: Nowcasting and backcasting weekly initial claims with daily internet search volume data," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1122-1144.
    9. Roberto Casarin & Claudia Foroni & Massimiliano Marcellino & Francesco Ravazzolo, 2016. "Uncertainty Through the Lenses of A Mixed-Frequency Bayesian Panel Markov Switching Model," Working Papers 585, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    10. Daniel Borup & David E. Rapach & Erik Christian Montes Schütte, 2021. "Now- and Backcasting Initial Claims with High-Dimensional Daily Internet Search-Volume Data," CREATES Research Papers 2021-02, Department of Economics and Business Economics, Aarhus University.
    11. Jan Radovan & Igor Masten, 2025. "Nowcasting economic activity in a small open CESEE economy using mixed frequency data," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 52(4), pages 721-776, November.
    12. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    13. Denisa Banulescu-Radu & Christophe Hurlin & Bertrand Candelon & Sébastien Laurent, 2016. "Do We Need High Frequency Data to Forecast Variances?," Annals of Economics and Statistics, GENES, issue 123-124, pages 135-174.
    14. Bandi, Federico M. & Bretscher, Lorenzo & Tamoni, Andrea, 2023. "Return predictability with endogenous growth," Journal of Financial Economics, Elsevier, vol. 150(3).
    15. Mei, Dexiang & Ma, Feng & Liao, Yin & Wang, Lu, 2020. "Geopolitical risk uncertainty and oil future volatility: Evidence from MIDAS models," Energy Economics, Elsevier, vol. 86(C).
    16. Chao Liang & Yaojie Zhang & Xiafei Li & Feng Ma, 2022. "Which predictor is more predictive for Bitcoin volatility? And why?," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(2), pages 1947-1961, April.
    17. Bellotti, Anthony & Brigo, Damiano & Gambetti, Paolo & Vrins, Frédéric, 2021. "Forecasting recovery rates on non-performing loans with machine learning," International Journal of Forecasting, Elsevier, vol. 37(1), pages 428-444.
    18. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2024. "Forecasting GDP in Europe with textual data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 338-355, March.
    19. Santiago Etchegaray Alvarez, 2022. "Proyecciones macroeconómicas con datos en frecuencias mixtas. Modelos ADL-MIDAS, U-MIDAS y TF-MIDAS con aplicaciones para Uruguay," Documentos de trabajo 2022004, Banco Central del Uruguay.
    20. Leila Hedhili Zaier & Khaled Mokni & Ahdi Noomen Ajmi, 2024. "Causality relationships between climate policy uncertainty, renewable energy stocks, and oil prices: a mixed-frequency causality analysis," Future Business Journal, Springer, vol. 10(1), pages 1-11, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C32 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes; State Space Models
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • E37 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Forecasting and Simulation: Models and Applications
    • O53 - Economic Development, Innovation, Technological Change, and Growth - - Economywide Country Studies - - - Asia including Middle East

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ime:imedps:26-e-07. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kinken (email available below). General contact details of provider: https://edirc.repec.org/data/imegvjp.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.