IDEAS home Printed from https://ideas.repec.org/a/ibn/ijefaa/v16y2024i6p92.html
   My bibliography  Save this article

Improvement in Inflation Forecasting: Ensembling Text Mining with Macro Data in Machine Learning Models

Author

Listed:
  • Pijush Kanti Das
  • Prabir Kumar Das

Abstract

We forecast inflation using a large news corpus and machine learning methods. Over 3.9 million daily newspaper headlines from January 2001 to June, 2023 are decomposed into monthly time series and integrated with machine learning models to predict inflation. The addition of Text mining in models outperformed the numerical predictions based on the machine learning models without text mining as published by the authors earlier in Das and Das (2024). In addition, the variable importance while analyzing the predictors provides further insights into new variables came out from text mining for which structured data was not available earlier. A dictionary of words sentimental to inflation forecasting has been prepared possibly for the first time. The forecasting model that used text words sentimental to inflation as additional inputs in artificial neural network performed better than all the other models in terms of forecast accuracy. Overall, we provide a novel representation of improvements in adding text mining in machine learning models in inflation forecasting.

Suggested Citation

  • Pijush Kanti Das & Prabir Kumar Das, 2024. "Improvement in Inflation Forecasting: Ensembling Text Mining with Macro Data in Machine Learning Models," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 16(6), pages 1-92, June.
  • Handle: RePEc:ibn:ijefaa:v:16:y:2024:i:6:p:92
    as

    Download full text from publisher

    File URL: https://ccsenet.org/journal/index.php/ijef/article/download/0/0/50265/54408
    Download Restriction: no

    File URL: https://ccsenet.org/journal/index.php/ijef/article/view/0/50265
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. D’Amuri, Francesco & Marcucci, Juri, 2017. "The predictive power of Google searches in forecasting US unemployment," International Journal of Forecasting, Elsevier, vol. 33(4), pages 801-816.
    2. Jan J. J. Groen & Richard Paap & Francesco Ravazzolo, 2013. "Real-Time Inflation Forecasting in a Changing World," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(1), pages 29-44, January.
    3. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2020. "Making text count: economic forecasting using newspaper text," Bank of England working papers 865, Bank of England.
    4. Olivier Coibion & Yuriy Gorodnichenko, 2015. "Information Rigidity and the Expectations Formation Process: A Simple Framework and New Facts," American Economic Review, American Economic Association, vol. 105(8), pages 2644-2678, August.
    5. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    6. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2022. "Making text count: Economic forecasting using newspaper text," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 896-919, August.
    7. Amrendra Pandey & Jagadish Shettigar & Amarnath Bose, 2021. "Evaluation of the Inflation Forecasting Process of the Reserve Bank of India: A Text Analysis Approach," SAGE Open, , vol. 11(3), pages 21582440211, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mikhaylov, Dmitry, 2023. "Macroeconomic Forecasting with the Use of News Data," Working Papers w20220250, Russian Presidential Academy of National Economy and Public Administration.
    2. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2022. "Learning from revisions: an algorithm to detect errors in banks’ balance sheet statistical reporting," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4025-4059, December.
    3. Kohns, David & Bhattacharjee, Arnab, 2023. "Nowcasting growth using Google Trends data: A Bayesian Structural Time Series model," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1384-1412.
    4. Sebastian Doerr & Leonardo Gambacorta & José María Serena Garralda, 2021. "Big data and machine learning in central banking," BIS Working Papers 930, Bank for International Settlements.
    5. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    6. Hongcheng Ding & Xuanze Zhao & Zixiao Jiang & Shamsul Nahar Abdullah & Deshinta Arrova Dewi, 2024. "EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods," Papers 2408.13214, arXiv.org.
    7. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2024. "Forecasting GDP in Europe with textual data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 338-355, March.
    8. Martin Baumgaertner & Johannes Zahner, 2021. "Whatever it takes to understand a central banker - Embedding their words using neural networks," MAGKS Papers on Economics 202130, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    9. Berge, Travis J., 2018. "Understanding survey-based inflation expectations," International Journal of Forecasting, Elsevier, vol. 34(4), pages 788-801.
    10. Massimiliano Marcellino & Dalibor Stevanovic, 2022. "The demand and supply of information about inflation," Working Papers 22-06, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Nov 2022.
    11. Jonathan Benchimol & Lahcen Bounader & Mario Dotta, 2025. "Estimating Behavioral Inattention," CAMA Working Papers 2025-34, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
    12. Leonardo N. Ferreira, 2021. "Forecasting with VAR-teXt and DFM-teXt Models:exploring the predictive power of central bank communication," Working Papers Series 559, Central Bank of Brazil, Research Department.
    13. Massimo Ferrari Minesso & Laura Lebastard & Helena Mezo, 2023. "Text-Based Recession Probabilities," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 71(2), pages 415-438, June.
    14. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2020. "News Media vs. FRED-MD for Macroeconomic Forecasting," CESifo Working Paper Series 8639, CESifo.
    15. Erik Andres-Escayola & Corinna Ghirelli & Luis Molina & Javier J. Pérez & Elena Vidal, 2022. "Using newspapers for textual indicators: which and how many?," Working Papers 2235, Banco de España.
    16. Stolbov, Mikhail & Shchepeleva, Maria & Karminsky, Alexander, 2022. "When central bank research meets Google search: A sentiment index of global financial stress," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 81(C).
    17. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.
    18. Adämmer, Philipp & Prüser, Jan & Schüssler, Rainer A., 2025. "Forecasting macroeconomic tail risk in real time: Do textual data add value?," International Journal of Forecasting, Elsevier, vol. 41(1), pages 307-320.
    19. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2022. "News media versus FRED‐MD for macroeconomic forecasting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(1), pages 63-81, January.
    20. Vegard H ghaug Larsen & Leif Anders Thorsrud, 2018. "Business cycle narratives," Working Papers No 6/2018, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.

    More about this item

    JEL classification:

    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General
    • Z0 - Other Special Topics - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ibn:ijefaa:v:16:y:2024:i:6:p:92. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Canadian Center of Science and Education (email available below). General contact details of provider: https://edirc.repec.org/data/cepflch.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.