IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2008.01714.html
   My bibliography  Save this paper

Macroeconomic Data Transformations Matter

Author

Listed:
  • Philippe Goulet Coulombe
  • Maxime Leroux
  • Dalibor Stevanovic
  • St'ephane Surprenant

Abstract

In a low-dimensional linear regression setup, considering linear transformations/combinations of predictors does not alter predictions. However, when the forecasting technology either uses shrinkage or is nonlinear, it does. This is precisely the fabric of the machine learning (ML) macroeconomic forecasting environment. Pre-processing of the data translates to an alteration of the regularization -- explicit or implicit -- embedded in ML algorithms. We review old transformations and propose new ones, then empirically evaluate their merits in a substantial pseudo-out-sample exercise. It is found that traditional factors should almost always be included as predictors and moving average rotations of the data can provide important gains for various forecasting targets. Also, we note that while predicting directly the average growth rate is equivalent to averaging separate horizon forecasts when using OLS-based techniques, the latter can substantially improve on the former when regularization and/or nonparametric nonlinearities are involved.

Suggested Citation

  • Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & St'ephane Surprenant, 2020. "Macroeconomic Data Transformations Matter," Papers 2008.01714, arXiv.org, revised Mar 2021.
  • Handle: RePEc:arx:papers:2008.01714
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2008.01714
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Banerjee, Anindya & Marcellino, Massimiliano & Masten, Igor, 2014. "Forecasting with factor-augmented error correction models," International Journal of Forecasting, Elsevier, vol. 30(3), pages 589-612.
    2. Granger, C. W. J., 1987. "Implications of Aggregation with Common Factors," Econometric Theory, Cambridge University Press, vol. 3(2), pages 208-222, April.
    3. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    4. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2005. "The Generalized Dynamic Factor Model: One-Sided Estimation and Forecasting," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 830-840, September.
    5. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    6. Engle, Robert & Granger, Clive, 2015. "Co-integration and error correction: Representation, estimation, and testing," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 39(3), pages 106-135.
    7. Goulet Coulombe, Philippe & Marcellino, Massimiliano & Stevanović, Dalibor, 2021. "Can Machine Learning Catch The Covid-19 Recession?," National Institute Economic Review, National Institute of Economic and Social Research, vol. 256, pages 71-109, April.
    8. Phillips, P C B, 1991. "To Criticize the Critics: An Objective Bayesian Analysis of Stochastic Trends," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 6(4), pages 333-364, Oct.-Dec..
    9. Carriero, Andrea & Galvão, Ana Beatriz & Kapetanios, George, 2019. "A comprehensive evaluation of macroeconomic forecasting methods," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1226-1239.
    10. Bergmeir, Christoph & Hyndman, Rob J. & Koo, Bonsoo, 2018. "A note on the validity of cross-validation for evaluating autoregressive time series prediction," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 70-83.
    11. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    12. Sims, Christopher A., 1988. "Bayesian skepticism on unit root econometrics," Journal of Economic Dynamics and Control, Elsevier, vol. 12(2-3), pages 463-474.
    13. Christoffersen, Peter F & Diebold, Francis X, 1998. "Cointegration and Long-Horizon Forecasting," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(4), pages 450-458, October.
    14. Gorodnichenko, Yuriy & Ng, Serena, 2017. "Level and volatility factors in macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 91(C), pages 52-68.
    15. Phillips, P C B, 1991. "Optimal Inference in Cointegrated Systems," Econometrica, Econometric Society, vol. 59(2), pages 283-306, March.
    16. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    17. Raffaella Giacomini & Barbara Rossi, 2010. "Forecast comparisons in unstable environments," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(4), pages 595-620.
    18. Chan, Nigel & Wang, Qiying, 2015. "Nonlinear regressions with nonstationary time series," Journal of Econometrics, Elsevier, vol. 185(1), pages 182-195.
    19. Jushan Bai & Serena Ng, 2004. "A PANIC Attack on Unit Roots and Cointegration," Econometrica, Econometric Society, vol. 72(4), pages 1127-1177, July.
    20. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    21. Michael W. McCracken & Serena Ng, 2021. "FRED-QD: A Quarterly Database for Macroeconomic Research," Review, Federal Reserve Bank of St. Louis, vol. 103(1), pages 1-44, January.
    22. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    23. Choi,In, 2015. "Almost All about Unit Roots," Cambridge Books, Cambridge University Press, number 9781107482500.
    24. Marcelo C. Medeiros & Gabriel F. R. Vasconcelos & Álvaro Veiga & Eduardo Zilberman, 2021. "Forecasting Inflation in a Data-Rich Environment: The Benefits of Machine Learning Methods," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(1), pages 98-119, January.
    25. Thomas J. Sargent & Christopher A. Sims, 1977. "Business cycle modeling without pretending to have too much a priori economic theory," Working Papers 55, Federal Reserve Bank of Minneapolis.
    26. Shiller, Robert J, 1973. "A Distributed Lag Estimator Derived from Smoothness Priors," Econometrica, Econometric Society, vol. 41(4), pages 775-788, July.
    27. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    28. Ghysels, Eric & Santa-Clara, Pedro & Valkanov, Rossen, 2004. "The MIDAS Touch: Mixed Data Sampling Regression Models," University of California at Los Angeles, Anderson Graduate School of Management qt9mf223rs, Anderson Graduate School of Management, UCLA.
    29. Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
    30. Colin Bermingham & Antonello D’Agostino, 2014. "Understanding and forecasting aggregate and disaggregate price dynamics," Empirical Economics, Springer, vol. 46(2), pages 765-788, March.
    31. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    32. Hassani, Hossein & Heravi, Saeed & Zhigljavsky, Anatoly, 2009. "Forecasting European industrial production with singular spectrum analysis," International Journal of Forecasting, Elsevier, vol. 25(1), pages 103-118.
    33. Sims, Christopher A & Uhlig, Harald, 1991. "Understanding Unit Rooters: A Helicopter Tour," Econometrica, Econometric Society, vol. 59(6), pages 1591-1599, November.
    34. Elliott, Graham & Gargano, Antonio & Timmermann, Allan, 2013. "Complete subset regressions," Journal of Econometrics, Elsevier, vol. 177(2), pages 357-373.
    35. Thomas R. Cook & Aaron Smalter Hall, 2017. "Macroeconomic Indicator Forecasting with Deep Neural Networks," Research Working Paper RWP 17-11, Federal Reserve Bank of Kansas City.
    36. Engle, Robert F. & Yoo, Byung Sam, 1987. "Forecasting and testing in co-integrated systems," Journal of Econometrics, Elsevier, vol. 35(1), pages 143-159, May.
    37. Jushan Bai & Serena Ng, 2009. "Boosting diffusion indices," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(4), pages 607-629.
    38. Rachidi Kotchoni & Maxime Leroux & Dalibor Stevanovic, 2019. "Macroeconomic forecast accuracy in a data‐rich environment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(7), pages 1050-1072, November.
    39. Marcellino, Massimiliano & Stock, James H. & Watson, Mark W., 2003. "Macroeconomic forecasting in the Euro area: Country specific versus area-wide information," European Economic Review, Elsevier, vol. 47(1), pages 1-18, February.
    40. Ben S. Bernanke & Jean Boivin & Piotr Eliasz, 2005. "Measuring the Effects of Monetary Policy: A Factor-Augmented Vector Autoregressive (FAVAR) Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(1), pages 387-422.
    41. Jean Boivin & Serena Ng, 2005. "Understanding and Comparing Factor-Based Forecasts," International Journal of Central Banking, International Journal of Central Banking, vol. 1(3), December.
    42. G. Elliott & C. Granger & A. Timmermann (ed.), 2006. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 1, number 1.
    43. Elliott, Graham, 2006. "Forecasting with Trending Data," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 11, pages 555-604, Elsevier.
    44. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    45. Hall, Anthony D & Anderson, Heather M & Granger, Clive W J, 1992. "A Cointegration Analysis of Treasury Bill Yields," The Review of Economics and Statistics, MIT Press, vol. 74(1), pages 116-126, February.
    46. Hossein Hassani & Abdol S. Soofi & Anatoly Zhigljavsky, 2013. "Predicting inflation dynamics with singular spectrum analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 176(3), pages 743-760, June.
    47. Sims, Christopher A & Stock, James H & Watson, Mark W, 1990. "Inference in Linear Time Series Models with Some Unit Roots," Econometrica, Econometric Society, vol. 58(1), pages 113-144, January.
    48. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    49. James H. Stock & Mark W. Watson, 1998. "A Comparison of Linear and Nonlinear Univariate Models for Forecasting Macroeconomic Time Series," NBER Working Papers 6607, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Miquel Oliu-Barton & Bary S. R. Pradelski & Nicolas Woloszko & Lionel Guetta-Jeanrenaud & Philippe Aghion & Patrick Artus & Arnaud Fontanet & Philippe Martin & Guntram B. Wolff, 2022. "The effect of COVID certificates on vaccine uptake, health outcomes, and the economy," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    2. Philippe Goulet Coulombe & Maximilian Goebel, 2023. "Maximally Machine-Learnable Portfolios," Papers 2306.05568, arXiv.org, revised Apr 2024.
    3. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    4. Escribano, Alvaro & Peña, Daniel & Ruiz, Esther, 2021. "30 years of cointegration and dynamic factor models forecasting and its future with big data: Editorial," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1333-1337.
    5. Donato Ceci & Andrea Silvestrini, 2023. "Nowcasting the state of the Italian economy: The role of financial markets," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1569-1593, November.
    6. Philippe Goulet Coulombe, 2021. "Slow-Growing Trees," Papers 2103.01926, arXiv.org, revised Jul 2021.
    7. Philippe Goulet Coulombe, 2021. "The Macroeconomy as a Random Forest," Working Papers 21-05, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management.
    8. Philippe Goulet Coulombe & Mikael Frenette & Karin Klieber, 2023. "From Reactive to Proactive Volatility Modeling with Hemisphere Neural Networks," Working Papers 23-04, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Nov 2023.
    9. Daniel Borup & Philippe Goulet Coulombe & Erik Christian Montes Schütte & David E. Rapach & Sander Schwenk-Nebbe, 2022. "The Anatomy of Out-of-Sample Forecasting Accuracy," FRB Atlanta Working Paper 2022-16, Federal Reserve Bank of Atlanta.
    10. Philippe Goulet Coulombe, 2022. "A Neural Phillips Curve and a Deep Output Gap," Working Papers 22-01, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management.
    11. Philippe Goulet Coulombe, 2021. "Slow-Growing Trees," Working Papers 21-02, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management.
    12. Philippe Goulet Coulombe & Maximilian Gobel, 2023. "Maximally Machine-Learnable Portfolios," Working Papers 23-01, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Apr 2023.
    13. Philippe Goulet Coulombe & Mikael Frenette & Karin Klieber, 2023. "From Reactive to Proactive Volatility Modeling with Hemisphere Neural Networks," Papers 2311.16333, arXiv.org, revised Apr 2024.
    14. Lily Davies & Mark Kattenberg & Benedikt Vogt, 2023. "Predicting Firm Exits with Machine Learning: Implications for Selection into COVID-19 Support and Productivity Growth," CPB Discussion Paper 444, CPB Netherlands Bureau for Economic Policy Analysis.
    15. Krzysztof Drachal, 2022. "Forecasting the Crude Oil Spot Price with Bayesian Symbolic Regression," Energies, MDPI, vol. 16(1), pages 1-29, December.
    16. Philippe Goulet Coulombe, 2022. "A Neural Phillips Curve and a Deep Output Gap," Papers 2202.04146, arXiv.org.
    17. Philippe Goulet Coulombe, 2020. "Time-Varying Parameters as Ridge Regressions," Papers 2009.00401, arXiv.org, revised Apr 2023.
    18. Philippe Goulet Coulombe, 2020. "To Bag is to Prune," Papers 2008.07063, arXiv.org, revised Jun 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    2. Philippe Goulet Coulombe, 2021. "The Macroeconomy as a Random Forest," Working Papers 21-05, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management.
    3. Philippe Goulet Coulombe, 2020. "The Macroeconomy as a Random Forest," Papers 2006.12724, arXiv.org, revised Mar 2021.
    4. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    5. Barbara Rossi, 2019. "Forecasting in the presence of instabilities: How do we know whether models predict well and how to improve them," Economics Working Papers 1711, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2021.
    6. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    7. Rachidi Kotchoni & Maxime Leroux & Dalibor Stevanovic, 2019. "Macroeconomic forecast accuracy in a data‐rich environment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(7), pages 1050-1072, November.
    8. Olivier Fortin‐Gagnon & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "A large Canadian database for macroeconomic analysis," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 55(4), pages 1799-1833, November.
    9. Philippe Goulet Coulombe, 2021. "To Bag is to Prune," Working Papers 21-03, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Jun 2021.
    10. Rangan Gupta & Alain Kabundi & Stephen Miller & Josine Uwilingiye, 2014. "Using large data sets to forecast sectoral employment," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 23(2), pages 229-264, June.
    11. Luciani, Matteo, 2014. "Forecasting with approximate dynamic factor models: The role of non-pervasive shocks," International Journal of Forecasting, Elsevier, vol. 30(1), pages 20-29.
    12. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    13. Hauzenberger, Niko & Huber, Florian & Klieber, Karin, 2023. "Real-time inflation forecasting using non-linear dimension reduction techniques," International Journal of Forecasting, Elsevier, vol. 39(2), pages 901-921.
    14. Cobb, Marcus P A, 2017. "Forecasting Economic Aggregates Using Dynamic Component Grouping," MPRA Paper 81585, University Library of Munich, Germany.
    15. Philippe Goulet Coulombe, 2020. "Time-Varying Parameters as Ridge Regressions," Papers 2009.00401, arXiv.org, revised Apr 2023.
    16. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    17. Norman R. Swanson & Weiqi Xiong, 2018. "Big data analytics in economics: What have we learned so far, and where should we go from here?," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 51(3), pages 695-746, August.
    18. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    19. Banerjee, Anindya & Marcellino, Massimiliano & Masten, Igor, 2014. "Forecasting with factor-augmented error correction models," International Journal of Forecasting, Elsevier, vol. 30(3), pages 589-612.
    20. Koop, Gary & Korobilis, Dimitris & Pettenuzzo, Davide, 2019. "Bayesian compressed vector autoregressions," Journal of Econometrics, Elsevier, vol. 210(1), pages 135-154.

    More about this item

    JEL classification:

    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • E37 - Macroeconomics and Monetary Economics - - Prices, Business Fluctuations, and Cycles - - - Forecasting and Simulation: Models and Applications

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2008.01714. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.