IDEAS home Printed from https://ideas.repec.org/a/bkr/journl/v77y2018i4p26-41.html
   My bibliography  Save this article

Text Mining-based Economic Activity Estimation

Author

Listed:
  • Ksenia Yakovleva

    (Bank of Russia)

Abstract

This paper outlines a methodology for constructing a high-frequency indicator of economic activity in Russia. News stories from internet resources are used as data sources. News data is analyzed using text mining and machine learning methods, which, although developed relatively recently, have quickly found wide application in scientific research, including economic studies. This is because news is not only a key source of information but a way to gauge the sentiment of journalists and survey respondents about the current situation and convert it into quantitative data.

Suggested Citation

  • Ksenia Yakovleva, 2018. "Text Mining-based Economic Activity Estimation," Russian Journal of Money and Finance, Bank of Russia, vol. 77(4), pages 26-41, December.
  • Handle: RePEc:bkr:journl:v:77:y:2018:i:4:p:26-41
    DOI: 10.31477/rjmf.201804.26
    as

    Download full text from publisher

    File URL: https://rjmf.econs.online/upload/iblock/00a/RJMF_77-04_ENG_Yakovleva.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.31477/rjmf.201804.26?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dirk Ulbricht & Konstantin A. Kholodilin & Tobias Thomas, 2017. "Do Media Data Help to Predict German Industrial Production?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 36(5), pages 483-496, August.
    2. Scott R. Baker & Nicholas Bloom & Steven J. Davis, 2016. "Measuring Economic Policy Uncertainty," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(4), pages 1593-1636.
    3. Leif Anders Thorsrud, 2020. "Words are the New Numbers: A Newsy Coincident Index of the Business Cycle," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 393-409, April.
    4. Ardia, David & Bluteau, Keven & Boudt, Kris, 2019. "Questioning the news about economic growth: Sparse forecasting using thousands of news-based sentiment values," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1370-1386.
    5. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    6. Galbraith, John W. & Tkacz, Greg, 2015. "Nowcasting GDP with electronic payments data," Statistics Paper Series 10, European Central Bank.
    7. I. Goloshchapova & M. Andreev., 2017. "Measuring inflation expectations of the Russian population with the help of machine learning," VOPROSY ECONOMIKI, N.P. Redaktsiya zhurnala "Voprosy Economiki", vol. 6.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Filipp Ulyankin, 2020. "Forecasting Russian Macroeconomic Indicators Based on Information from News and Search Queries," Russian Journal of Money and Finance, Bank of Russia, vol. 79(4), pages 75-97, December.
    2. Petrova, Diana & Trunin, Pavel, 2020. "Revealing the mood of economic agents based on search queries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 71-87.
    3. Mikhaylov, Dmitry, 2023. "Macroeconomic Forecasting with the Use of News Data," Working Papers w20220250, Russian Presidential Academy of National Economy and Public Administration.
    4. Stankevich, Ivan, 2023. "Application of Markov-Switching MIDAS models to nowcasting of GDP and its components," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 70, pages 122-143.
    5. Oleg Semiturkin & Andrey Shevelev, 2023. "Correct Comparison of Predictive Features of Machine Learning Models: The Case of Forecasting Inflation Rates in Siberia," Russian Journal of Money and Finance, Bank of Russia, vol. 82(1), pages 87-103, March.
    6. Stankevich, Ivan, 2020. "Comparison of macroeconomic indicators nowcasting methods: Russian GDP case," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 113-127.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marc Burri & Daniel Kaufmann, 2020. "A daily fever curve for the Swiss economy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 156(1), pages 1-11, December.
    2. Dorine Boumans & Henrik Müller & Stefan Sauer, 2022. "How Media Content Influences Economic Expectations: Evidence from a Global Expert Survey," ifo Working Paper Series 380, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    3. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2020. "News Media vs. FRED-MD for Macroeconomic Forecasting," CESifo Working Paper Series 8639, CESifo.
    4. Erik Andres-Escayola & Corinna Ghirelli & Luis Molina & Javier J. Pérez & Elena Vidal, 2022. "Using newspapers for textual indicators: which and how many?," Working Papers 2235, Banco de España.
    5. Stolbov, Mikhail & Shchepeleva, Maria & Karminsky, Alexander, 2022. "When central bank research meets Google search: A sentiment index of global financial stress," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 81(C).
    6. Philip ME Garboden, 2019. "Sources and Types of Big Data for Macroeconomic Forecasting," Working Papers 2019-3, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    7. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2022. "News media versus FRED‐MD for macroeconomic forecasting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(1), pages 63-81, January.
    8. Barbaglia, Luca & Frattarolo, Lorenzo & Onorante, Luca & Pericoli, Filippo Maria & Ratto, Marco & Tiozzo Pezzoli, Luca, 2023. "Testing big data in a big crisis: Nowcasting under Covid-19," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1548-1563.
    9. Simionescu, Mihaela, 2022. "Econometrics of sentiments- sentometrics and machine learning: The improvement of inflation predictions in Romania using sentiment analysis," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    10. Filipp Ulyankin, 2020. "Forecasting Russian Macroeconomic Indicators Based on Information from News and Search Queries," Russian Journal of Money and Finance, Bank of Russia, vol. 79(4), pages 75-97, December.
    11. repec:hal:spmain:info:hdl:2441/3mgbd73vkp9f9oje7utooe7vpg is not listed on IDEAS
    12. Leif Anders Thorsrud, 2016. "Nowcasting using news topics Big Data versus big bank," Working Papers No 6/2016, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    13. Yuting Chen & Don Bredin & Valerio Potì & Roman Matkovskyy, 2022. "COVID risk narratives: a computational linguistic approach to the econometric identification of narrative risk during a pandemic," Digital Finance, Springer, vol. 4(1), pages 17-61, March.
    14. Youngjoon Lee & Soohyon Kim & Ki Young Park, 2018. "Deciphering Monetary Policy Committee Minutes with Text Mining Approach: A Case of South Korea," Working papers 2018rwp-132, Yonsei University, Yonsei Economics Research Institute.
    15. Beckmann, Joscha & Jannsen, Nils, 2023. "Mediendaten für die Konjunkturanalyse," Kiel Insight 2023.11, Kiel Institute for the World Economy (IfW Kiel).
    16. Mikhaylov, Dmitry, 2023. "Macroeconomic Forecasting with the Use of News Data," Working Papers w20220250, Russian Presidential Academy of National Economy and Public Administration.
    17. Paul Hubert & Fabien Labondance, 2019. "Central bank tone and the dispersion of views within monetary policy committees," Sciences Po publications 2019 – 08, Sciences Po.
    18. Massimo Ferrari Minesso & Laura Lebastard & Helena Mezo, 2023. "Text-Based Recession Probabilities," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 71(2), pages 415-438, June.
    19. Bai, Xiwen & Lam, Jasmine Siu Lee & Jakher, Astha, 2021. "Shipping sentiment and the dry bulk shipping freight market: New evidence from newspaper coverage," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 155(C).
    20. Chi, Tsung-Li & Liu, Hung-Tsen & Chang, Chia-Chien, 2023. "Hedging performance using google Trends–Evidence from the indian forex options market," International Review of Economics & Finance, Elsevier, vol. 85(C), pages 107-123.
    21. David Kohns & Arnab Bhattacharjee, 2020. "Nowcasting Growth using Google Trends Data: A Bayesian Structural Time Series Model," Papers 2011.00938, arXiv.org, revised May 2022.

    More about this item

    Keywords

    economic activity estimates; nowcasting; text mining; machine learning; Big Data; data mining; topic modelling; sentiment analysis;
    All these keywords.

    JEL classification:

    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • E37 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Forecasting and Simulation: Models and Applications

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bkr:journl:v:77:y:2018:i:4:p:26-41. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Olga Kuvshinova (email available below). General contact details of provider: https://edirc.repec.org/data/cbrgvru.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.