IDEAS home Printed from https://ideas.repec.org/a/bkr/journl/v77y2018i4p26-41.html
   My bibliography  Save this article

Text Mining-based Economic Activity Estimation

Author

Listed:
  • Ksenia Yakovleva

    (Bank of Russia)

Abstract

This paper outlines a methodology for constructing a high-frequency indicator of economic activity in Russia. News stories from internet resources are used as data sources. News data is analyzed using text mining and machine learning methods, which, although developed relatively recently, have quickly found wide application in scientific research, including economic studies. This is because news is not only a key source of information but a way to gauge the sentiment of journalists and survey respondents about the current situation and convert it into quantitative data.

Suggested Citation

  • Ksenia Yakovleva, 2018. "Text Mining-based Economic Activity Estimation," Russian Journal of Money and Finance, Bank of Russia, vol. 77(4), pages 26-41, December.
  • Handle: RePEc:bkr:journl:v:77:y:2018:i:4:p:26-41
    DOI: 10.31477/rjmf.201804.26
    as

    Download full text from publisher

    File URL: https://rjmf.econs.online/upload/iblock/00a/RJMF_77-04_ENG_Yakovleva.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.31477/rjmf.201804.26?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Galbraith, John W. & Tkacz, Greg, 2015. "Nowcasting GDP with electronic payments data," Statistics Paper Series 10, European Central Bank.
    2. Dirk Ulbricht & Konstantin A. Kholodilin & Tobias Thomas, 2017. "Do Media Data Help to Predict German Industrial Production?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 36(5), pages 483-496, August.
    3. I. Goloshchapova & M. Andreev., 2017. "Measuring inflation expectations of the Russian population with the help of machine learning," VOPROSY ECONOMIKI, N.P. Redaktsiya zhurnala "Voprosy Economiki", vol. 6.
    4. Leif Anders Thorsrud, 2020. "Words are the New Numbers: A Newsy Coincident Index of the Business Cycle," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 393-409, April.
    5. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    6. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    7. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Petrova, Diana & Trunin, Pavel, 2020. "Revealing the mood of economic agents based on search queries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 71-87.
    2. Mikhaylov, Dmitry, 2023. "Macroeconomic Forecasting with the Use of News Data," Working Papers w20220250, Russian Presidential Academy of National Economy and Public Administration.
    3. Stankevich, Ivan, 2023. "Application of Markov-Switching MIDAS models to nowcasting of GDP and its components," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 70, pages 122-143.
    4. Oleg Semiturkin & Andrey Shevelev, 2023. "Correct Comparison of Predictive Features of Machine Learning Models: The Case of Forecasting Inflation Rates in Siberia," Russian Journal of Money and Finance, Bank of Russia, vol. 82(1), pages 87-103, March.
    5. Stankevich, Ivan, 2020. "Comparison of macroeconomic indicators nowcasting methods: Russian GDP case," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 113-127.
    6. Filipp Ulyankin, 2020. "Forecasting Russian Macroeconomic Indicators Based on Information from News and Search Queries," Russian Journal of Money and Finance, Bank of Russia, vol. 79(4), pages 75-97, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marc Burri & Daniel Kaufmann, 2020. "A daily fever curve for the Swiss economy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 156(1), pages 1-11, December.
    2. Dorine Boumans & Henrik Müller & Stefan Sauer, 2022. "How Media Content Influences Economic Expectations: Evidence from a Global Expert Survey," ifo Working Paper Series 380, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    3. Tingguo Zheng & Xinyue Fan & Wei Jin & Kuangnan Fang, 2024. "Forecasting CPI with multisource data: The value of media and internet information," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(3), pages 702-753, April.
    4. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2020. "News Media vs. FRED-MD for Macroeconomic Forecasting," CESifo Working Paper Series 8639, CESifo.
    5. Erik Andres-Escayola & Corinna Ghirelli & Luis Molina & Javier J. Pérez & Elena Vidal, 2022. "Using newspapers for textual indicators: which and how many?," Working Papers 2235, Banco de España.
    6. Stolbov, Mikhail & Shchepeleva, Maria & Karminsky, Alexander, 2022. "When central bank research meets Google search: A sentiment index of global financial stress," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 81(C).
    7. Philip ME Garboden, 2019. "Sources and Types of Big Data for Macroeconomic Forecasting," Working Papers 2019-3, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    8. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2022. "News media versus FRED‐MD for macroeconomic forecasting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(1), pages 63-81, January.
    9. Simionescu, Mihaela, 2022. "Econometrics of sentiments- sentometrics and machine learning: The improvement of inflation predictions in Romania using sentiment analysis," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    10. Filipp Ulyankin, 2020. "Forecasting Russian Macroeconomic Indicators Based on Information from News and Search Queries," Russian Journal of Money and Finance, Bank of Russia, vol. 79(4), pages 75-97, December.
    11. Barbaglia, Luca & Frattarolo, Lorenzo & Onorante, Luca & Pericoli, Filippo Maria & Ratto, Marco & Tiozzo Pezzoli, Luca, 2023. "Testing big data in a big crisis: Nowcasting under Covid-19," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1548-1563.
    12. Wohlfarth, Paul, 2018. "Measuring the impact of monetary policy attention on global asset volatility using search data," Economics Letters, Elsevier, vol. 173(C), pages 15-18.
    13. repec:hal:spmain:info:hdl:2441/3mgbd73vkp9f9oje7utooe7vpg is not listed on IDEAS
    14. Dorinth van Dijk & Jasper de Winter, 2023. "Nowcasting GDP using tone-adjusted time varying news topics: Evidence from the financial press," Working Papers 766, DNB.
    15. Leif Anders Thorsrud, 2016. "Nowcasting using news topics Big Data versus big bank," Working Papers No 6/2016, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    16. Laura Battaglia & Timothy M. Christensen & Stephen Hansen & Szymon Sacher, 2024. "Inference for regression with variables generated from unstructured data," CeMMAP working papers 10/24, Institute for Fiscal Studies.
    17. Paul Hubert & Fabien Labondance, 2019. "Central bank tone and the dispersion of views within monetary policy committees," Sciences Po publications 2019 – 08, Sciences Po.
    18. Ali Kabiri & Harold James & John Landon-Lane & David Tuckett & Rickard Nyman, 2020. "The Role of Sentiment in the Economy: 1920 to 1934," CESifo Working Paper Series 8336, CESifo.
    19. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2024. "Forecasting GDP in Europe with textual data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 338-355, March.
    20. Larsen, Vegard H. & Thorsrud, Leif A., 2019. "The value of news for economic developments," Journal of Econometrics, Elsevier, vol. 210(1), pages 203-218.
    21. Algaba, Andres & Borms, Samuel & Boudt, Kris & Verbeken, Brecht, 2023. "Daily news sentiment and monthly surveys: A mixed-frequency dynamic factor model for nowcasting consumer confidence," International Journal of Forecasting, Elsevier, vol. 39(1), pages 266-278.

    More about this item

    Keywords

    economic activity estimates; nowcasting; text mining; machine learning; Big Data; data mining; topic modelling; sentiment analysis;
    All these keywords.

    JEL classification:

    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • E37 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Forecasting and Simulation: Models and Applications

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bkr:journl:v:77:y:2018:i:4:p:26-41. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Olga Kuvshinova (email available below). General contact details of provider: https://edirc.repec.org/data/cbrgvru.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.