IDEAS home Printed from https://ideas.repec.org/a/spr/empeco/v55y2018i2d10.1007_s00181-017-1286-6.html
   My bibliography  Save this article

Forecasting with large datasets: compressing information before, during or after the estimation?

Author

Listed:
  • Inske Pirschel

    (Swiss National Bank)

  • Maik H. Wolters

    (University of Jena
    Kiel Institute for the World Economy
    Institute for Monetary and Financial Stability at Goethe University Frankfurt)

Abstract

We study the forecasting performance of three alternative large data forecasting approaches. These three approaches handle the dimensionality problem evoked by a large dataset by compressing its informational content, yet at different stages of the forecasting process. We consider different factor models, a large Bayesian vector autoregression and model averaging techniques, where the data compression takes place before, during and after the estimation of the respective forecasting models. We use a quarterly dataset for Germany that consists of 123 variables and find that overall the large Bayesian vector autoregression and the Bayesian factor augmented vector autoregression provide the most precise forecasts for a set of 11 core macroeconomic variables. Further, we find that the performance of these two models is very robust to the exact specification of the forecasting model.

Suggested Citation

  • Inske Pirschel & Maik H. Wolters, 2018. "Forecasting with large datasets: compressing information before, during or after the estimation?," Empirical Economics, Springer, vol. 55(2), pages 573-596, September.
  • Handle: RePEc:spr:empeco:v:55:y:2018:i:2:d:10.1007_s00181-017-1286-6
    DOI: 10.1007/s00181-017-1286-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00181-017-1286-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00181-017-1286-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    2. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2005. "The Generalized Dynamic Factor Model: One-Sided Estimation and Forecasting," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 830-840, September.
    3. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2003. "Do financial variables help forecasting inflation and real activity in the euro area?," Journal of Monetary Economics, Elsevier, vol. 50(6), pages 1243-1255, September.
    4. Klaus Wohlrabe & Teresa Buchen, 2014. "Assessing the Macroeconomic Forecasting Performance of Boosting: Evidence for the United States, the Euro Area and Germany," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 33(4), pages 231-242, July.
    5. Jacob A. Mincer & Victor Zarnowitz, 1969. "The Evaluation of Economic Forecasts," NBER Chapters, in: Economic Forecasts and Expectations: Analysis of Forecasting Behavior and Performance, pages 3-46, National Bureau of Economic Research, Inc.
    6. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    7. Raffaella Giacomini & Halbert White, 2006. "Tests of Conditional Predictive Ability," Econometrica, Econometric Society, vol. 74(6), pages 1545-1578, November.
    8. Christoffersen, Peter F & Diebold, Francis X, 1998. "Cointegration and Long-Horizon Forecasting," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(4), pages 450-458, October.
    9. Dimitris Korobilis, 2013. "Var Forecasting Using Bayesian Variable Selection," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(2), pages 204-230, March.
    10. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    11. Negro, Marco Del & Schorfheide, Frank, 2013. "DSGE Model-Based Forecasting," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 57-140, Elsevier.
    12. Christian Schumacher, 2007. "Forecasting German GDP using alternative factor models based on large datasets," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 26(4), pages 271-302.
    13. Litterman, Robert B, 1986. "Forecasting with Bayesian Vector Autoregressions-Five Years of Experience," Journal of Business & Economic Statistics, American Statistical Association, vol. 4(1), pages 25-38, January.
    14. Bai, Jushan & Ng, Serena, 2007. "Determining the Number of Primitive Shocks in Factor Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 52-60, January.
    15. Hans Christian Müller-Dröge & Tara M. Sinclair & H.O. Stekler, 2014. "Evaluating Forecasts of a Vector of Variables: a German Forecasting Competition," CAMA Working Papers 2014-55, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
    16. Kholodilin Konstantin Arkadievich & Siliverstovs Boriss, 2006. "On the Forecasting Properties of the Alternative Leading Indicators for the German GDP: Recent Evidence," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 226(3), pages 234-259, June.
    17. Dreger, Christian & Schumacher, Christian, 2002. "Estimating Large-Scale Factor Models for Economic Activity in Germany: Do They Outperform Simpler Models?," Discussion Paper Series 26321, Hamburg Institute of International Economics.
    18. Bernanke, Ben S. & Boivin, Jean, 2003. "Monetary policy in a data-rich environment," Journal of Monetary Economics, Elsevier, vol. 50(3), pages 525-546, April.
    19. Pirschel, Inske & Wolters, Maik, 2014. "Forecasting German key macroeconomic variables using large dataset methods," VfS Annual Conference 2014 (Hamburg): Evidence-based Economic Policy 100587, Verein für Socialpolitik / German Economic Association.
    20. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    21. De Mol, Christine & Giannone, Domenico & Reichlin, Lucrezia, 2008. "Forecasting using a large number of predictors: Is Bayesian shrinkage a valid alternative to principal components?," Journal of Econometrics, Elsevier, vol. 146(2), pages 318-328, October.
    22. Maik H. Wolters, 2015. "Evaluating Point and Density Forecasts of DSGE Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(1), pages 74-96, January.
    23. Wolters, Maik H., 2011. "Forecasting under Model Uncertainty," VfS Annual Conference 2011 (Frankfurt, Main): The Order of the World Economy - Lessons from the Crisis 48723, Verein für Socialpolitik / German Economic Association.
    24. Schumacher, Christian, 2010. "Factor forecasting using international targeted predictors: The case of German GDP," Economics Letters, Elsevier, vol. 107(2), pages 95-98, May.
    25. Mario Forni & Marc Hallin & Marco Lippi & Lucrezia Reichlin, 2000. "The Generalized Dynamic-Factor Model: Identification And Estimation," The Review of Economics and Statistics, MIT Press, vol. 82(4), pages 540-554, November.
    26. Sims, Christopher A & Zha, Tao, 1998. "Bayesian Methods for Dynamic Multivariate Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 949-968, November.
    27. Marie Diron, 2008. "Short-term forecasts of euro area real GDP growth: an assessment of real-time performance based on vintage data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 27(5), pages 371-390.
    28. Kadiyala, K Rao & Karlsson, Sune, 1997. "Numerical Methods for Estimation and Inference in Bayesian VAR-Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 12(2), pages 99-132, March-Apr.
    29. Vladimir Kuzin & Massimiliano Marcellino & Christian Schumacher, 2013. "Pooling Versus Model Selection For Nowcasting Gdp With Many Predictors: Empirical Evidence For Six Industrialized Countries," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(3), pages 392-411, April.
    30. Bai, Jushan & Ng, Serena, 2008. "Forecasting economic time series using targeted predictors," Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317, October.
    31. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2015. "Prior Selection for Vector Autoregressions," The Review of Economics and Statistics, MIT Press, vol. 97(2), pages 436-451, May.
    32. Faust, Jon & Wright, Jonathan H., 2009. "Comparing Greenbook and Reduced Form Forecasts Using a Large Realtime Dataset," Journal of Business & Economic Statistics, American Statistical Association, vol. 27(4), pages 468-479.
    33. Andrea Carriero & George Kapetanios & Massimiliano Marcellino, 2011. "Forecasting large datasets with Bayesian reduced rank multivariate models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 26(5), pages 735-761, August.
    34. Steffen Henzel & Sebastian Rast, 2013. "Forecasting Properties of Indicators for Predicting GDP Growth in Germany," ifo Schnelldienst, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, vol. 66(17), pages 39-46, September.
    35. James H. Stock & Mark W.Watson, 2003. "Forecasting Output and Inflation: The Role of Asset Prices," Journal of Economic Literature, American Economic Association, vol. 41(3), pages 788-829, September.
    36. Croushore, Dean, 2006. "Forecasting with Real-Time Macroeconomic Data," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 17, pages 961-982, Elsevier.
    37. Klaus Abberger, 2007. "Forecasting Quarter-on-Quarter Changes of German GDP with Monthly Business Tendency Survey Results," ifo Working Paper Series 40, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    38. G. Elliott & C. Granger & A. Timmermann (ed.), 2013. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 2, number 2.
    39. Domenico Giannone & Lucrezia Reichlin & David Small, 2008. "Nowcasting: the real time informational content of macroeconomic data releases," ULB Institutional Repository 2013/6409, ULB -- Universite Libre de Bruxelles.
    40. Ben S. Bernanke & Jean Boivin & Piotr Eliasz, 2005. "Measuring the Effects of Monetary Policy: A Factor-Augmented Vector Autoregressive (FAVAR) Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(1), pages 387-422.
    41. G. Elliott & C. Granger & A. Timmermann (ed.), 2006. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 1, number 1.
    42. Schumacher Christian, 2011. "Forecasting with Factor Models Estimated on Large Datasets: A Review of the Recent Literature and Evidence for German GDP," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(1), pages 28-49, February.
    43. Mark W. Watson & James H. Stock, 2004. "Combination forecasts of output growth in a seven-country data set," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(6), pages 405-430.
    44. Domenico Giannone & Lucrezia Reichlin & David H. Small, 2005. "Nowcasting GDP and inflation: the real-time informational content of macroeconomic data releases," Finance and Economics Discussion Series 2005-42, Board of Governors of the Federal Reserve System (U.S.).
    45. Christian Dreger & Christian Schumacher, 2005. "Out-of-sample Performance of Leading Indicators for the German Business Cycle: Single vs. Combined Forecasts," Journal of Business Cycle Measurement and Analysis, OECD Publishing, Centre for International Research on Economic Tendency Surveys, vol. 2005(1), pages 71-87.
    46. Drechsel, Katja & Scheufele, Rolf, 2012. "The performance of short-term forecasts of the German economy before and during the 2008/2009 recession," International Journal of Forecasting, Elsevier, vol. 28(2), pages 428-445.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carstensen, Kai & Heinrich, Markus & Reif, Magnus & Wolters, Maik H., 2020. "Predicting ordinary and severe recessions with a three-state Markov-switching dynamic factor model," International Journal of Forecasting, Elsevier, vol. 36(3), pages 829-850.
    2. Faria, Gonçalo & Verona, Fabio, 2023. "Forecast combination in the frequency domain," Bank of Finland Research Discussion Papers 1/2023, Bank of Finland.
    3. Konstantin Kuck & Karsten Schweikert, 2021. "Forecasting Baden‐Württemberg's GDP growth: MIDAS regressions versus dynamic mixed‐frequency factor models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(5), pages 861-882, August.
    4. Aneta Ejsmont & Magdalena Majchrzak & Jacek Grzywacz, 2021. "Competitive Advantage in Co-operation of Enterprises Using Kaizen Costing Concept in Times of Extraordinary Dangers," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 4), pages 377-396.
    5. Magnus Reif, 2020. "Macroeconomics, Nonlinearities, and the Business Cycle," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 87.
    6. Eraslan, Sercan & Schröder, Maximilian, 2019. "Nowcasting GDP with a large factor model space," Discussion Papers 41/2019, Deutsche Bundesbank.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pirschel, Inske & Wolters, Maik, 2014. "Forecasting German key macroeconomic variables using large dataset methods," VfS Annual Conference 2014 (Hamburg): Evidence-based Economic Policy 100587, Verein für Socialpolitik / German Economic Association.
    2. Magnus Reif, 2020. "Macroeconomics, Nonlinearities, and the Business Cycle," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 87.
    3. Karim Barhoumi & Olivier Darné & Laurent Ferrara, 2014. "Dynamic factor models: A review of the literature," OECD Journal: Journal of Business Cycle Measurement and Analysis, OECD Publishing, Centre for International Research on Economic Tendency Surveys, vol. 2013(2), pages 73-107.
    4. Bańbura, Marta & Giannone, Domenico & Modugno, Michele & Reichlin, Lucrezia, 2013. "Now-Casting and the Real-Time Data Flow," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 195-237, Elsevier.
    5. Alessandro Girardi & Roberto Golinelli & Carmine Pappalardo, 2017. "The role of indicator selection in nowcasting euro-area GDP in pseudo-real time," Empirical Economics, Springer, vol. 53(1), pages 79-99, August.
    6. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    7. Liebermann, Joelle, 2012. "Real-time forecasting in a data-rich environment," MPRA Paper 39452, University Library of Munich, Germany.
    8. Andrea Carriero & Todd E. Clark & Massimiliano Marcellino, 2015. "Realtime nowcasting with a Bayesian mixed frequency model with stochastic volatility," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(4), pages 837-862, October.
    9. Berg, Tim O. & Henzel, Steffen R., 2015. "Point and density forecasts for the euro area using Bayesian VARs," International Journal of Forecasting, Elsevier, vol. 31(4), pages 1067-1095.
    10. Karlsson, Sune, 2013. "Forecasting with Bayesian Vector Autoregression," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 791-897, Elsevier.
    11. Robert Lehmann, 2016. "Economic Growth and Business Cycle Forecasting at the Regional Level," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 65.
    12. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    13. Norman R. Swanson & Weiqi Xiong, 2018. "Big data analytics in economics: What have we learned so far, and where should we go from here?," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 51(3), pages 695-746, August.
    14. Matteo Luciani & Lorenzo Ricci, 2014. "Nowcasting Norway," International Journal of Central Banking, International Journal of Central Banking, vol. 10(4), pages 215-248, December.
    15. Gupta, Rangan & Kabundi, Alain & Miller, Stephen M., 2011. "Forecasting the US real house price index: Structural and non-structural models with and without fundamentals," Economic Modelling, Elsevier, vol. 28(4), pages 2013-2021, July.
    16. Vladimir Kuzin & Massimiliano Marcellino & Christian Schumacher, 2009. "Pooling versus Model Selection for Nowcasting with Many Predictors: An Application to German GDP," Economics Working Papers ECO2009/13, European University Institute.
    17. Katja Heinisch & Rolf Scheufele, 2018. "Bottom-up or direct? Forecasting German GDP in a data-rich environment," Empirical Economics, Springer, vol. 54(2), pages 705-745, March.
    18. Barbara Rossi, 2019. "Forecasting in the presence of instabilities: How do we know whether models predict well and how to improve them," Economics Working Papers 1711, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2021.
    19. Bekiros Stelios & Paccagnini Alessia, 2015. "Estimating point and density forecasts for the US economy with a factor-augmented vector autoregressive DSGE model," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 19(2), pages 107-136, April.
    20. Banbura, Marta & Rünstler, Gerhard, 2011. "A look into the factor model black box: Publication lags and the role of hard and soft data in forecasting GDP," International Journal of Forecasting, Elsevier, vol. 27(2), pages 333-346, April.

    More about this item

    Keywords

    Large Bayesian VAR; Model averaging; Factor models; Great Recession; Ifo business climate index;
    All these keywords.

    JEL classification:

    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • E31 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Price Level; Inflation; Deflation
    • E32 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Business Fluctuations; Cycles
    • E37 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Forecasting and Simulation: Models and Applications
    • E47 - Macroeconomics and Monetary Economics - - Money and Interest Rates - - - Forecasting and Simulation: Models and Applications

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:empeco:v:55:y:2018:i:2:d:10.1007_s00181-017-1286-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.