IDEAS home Printed from https://ideas.repec.org/p/tin/wpaper/20110122.html

Sparse and Robust Factor Modelling

Author

Listed:
  • Christophe Croux

    (K.U. Leuven, Belgium)

  • Peter Exterkate

    (Erasmus University Rotterdam)

Abstract

Factor construction methods are widely used to summarize a large panel of variables by means of a relatively small number of representative factors. We propose a novel factor construction procedure that enjoys the properties of robustness to outliers and of sparsity; that is, having relatively few nonzero factor loadings. Compared to more traditional factor construction methods, we find that this procedure leads to better interpretable factors and to a favorable forecasting performance, both in a Monte Carlo experiment and in two empirical applications to large data sets, one from macroeconomics and one from microeconomics.

Suggested Citation

  • Christophe Croux & Peter Exterkate, 2011. "Sparse and Robust Factor Modelling," Tinbergen Institute Discussion Papers 11-122/4, Tinbergen Institute.
  • Handle: RePEc:tin:wpaper:20110122
    as

    Download full text from publisher

    File URL: https://papers.tinbergen.nl/11122.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Exterkate, Peter & Groenen, Patrick J.F. & Heij, Christiaan & van Dijk, Dick, 2016. "Nonlinear forecasting with many predictors using kernel ridge regression," International Journal of Forecasting, Elsevier, vol. 32(3), pages 736-753.
    2. Harrison, David Jr. & Rubinfeld, Daniel L., 1978. "Hedonic housing prices and the demand for clean air," Journal of Environmental Economics and Management, Elsevier, vol. 5(1), pages 81-102, March.
    3. Sydney C. Ludvigson & Serena Ng, 2009. "Macro Factors in Bond Risk Premia," The Review of Financial Studies, Society for Financial Studies, vol. 22(12), pages 5027-5067, December.
    4. Pison, Greet & Rousseeuw, Peter J. & Filzmoser, Peter & Croux, Christophe, 2003. "Robust factor analysis," Journal of Multivariate Analysis, Elsevier, vol. 84(1), pages 145-172, January.
    5. Wang, Hansheng & Li, Guodong & Jiang, Guohua, 2007. "Robust Regression Shrinkage and Consistent Variable Selection Through the LAD-Lasso," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 347-355, July.
    6. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    7. Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
    8. Bai, Jushan & Ng, Serena, 2008. "Forecasting economic time series using targeted predictors," Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317, October.
    9. Marta Bańbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92, January.
    10. Giorgio Fagiolo & Mauro Napoletano & Andrea Roventini, 2008. "Are output growth-rate distributions fat-tailed? some evidence from OECD countries," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(5), pages 639-669.
    11. Andrea Carriero & George Kapetanios & Massimiliano Marcellino, 2011. "Forecasting large datasets with Bayesian reduced rank multivariate models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 26(5), pages 735-761, August.
    12. Pace, R Kelley & Gilley, Otis W, 1997. "Using the Spatial Configuration of the Data to Improve Estimation," The Journal of Real Estate Finance and Economics, Springer, vol. 14(3), pages 333-340, May.
    13. Ludvigson, Sydney C. & Ng, Serena, 2007. "The empirical risk-return relation: A factor analysis approach," Journal of Financial Economics, Elsevier, vol. 83(1), pages 171-222, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carlos Cesar Trucios-Maza & João H. G Mazzeu & Luis K. Hotta & Pedro L. Valls Pereira & Marc Hallin, 2019. "On the robustness of the general dynamic factor model with infinite-dimensional space: identification, estimation, and forecasting," Working Papers ECARES 2019-32, ULB -- Universite Libre de Bruxelles.
    2. Thomas Despois & Catherine Doz, 2021. "Identifying and interpreting the factors in factor models via sparsity: Different approaches," Working Papers halshs-02235543, HAL.
    3. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    4. Johannes Tang Kristensen, 2013. "Diffusion Indexes with Sparse Loadings," CREATES Research Papers 2013-22, Department of Economics and Business Economics, Aarhus University.
    5. Thomas Despois & Catherine Doz, 2022. "Identifying and interpreting the factors in factor models via sparsity : Different approaches," Working Papers halshs-03626503, HAL.
    6. Thomas Despois & Catherine Doz, 2022. "Identifying and interpreting the factors in factor models via sparsity : Different approaches," PSE Working Papers halshs-03626503, HAL.
    7. Kristensen Johannes Tang, 2014. "Factor-based forecasting in the presence of outliers: Are factors better selected and estimated by the median than by the mean?," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 18(3), pages 309-338, May.
    8. Thomas Despois & Catherine Doz, 2021. "Identifying and interpreting the factors in factor models via sparsity: Different approaches," PSE Working Papers halshs-02235543, HAL.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Exterkate, Peter & Groenen, Patrick J.F. & Heij, Christiaan & van Dijk, Dick, 2016. "Nonlinear forecasting with many predictors using kernel ridge regression," International Journal of Forecasting, Elsevier, vol. 32(3), pages 736-753.
    2. Yoshiki Nakajima & Naoya Sueishi, 2022. "Forecasting the Japanese macroeconomy using high-dimensional data," The Japanese Economic Review, Springer, vol. 73(2), pages 299-324, April.
    3. Trucíos, Carlos & Mazzeu, João H.G. & Hotta, Luiz K. & Valls Pereira, Pedro L. & Hallin, Marc, 2021. "Robustness and the general dynamic factor model with infinite-dimensional space: Identification, estimation, and forecasting," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1520-1534.
    4. Carlos Cesar Trucios-Maza & João H. G Mazzeu & Luis K. Hotta & Pedro L. Valls Pereira & Marc Hallin, 2019. "On the robustness of the general dynamic factor model with infinite-dimensional space: identification, estimation, and forecasting," Working Papers ECARES 2019-32, ULB -- Universite Libre de Bruxelles.
    5. Peter Exterkate, 2012. "Model Selection in Kernel Ridge Regression," CREATES Research Papers 2012-10, Department of Economics and Business Economics, Aarhus University.
    6. Fan, Jianqing & Xue, Lingzhou & Yao, Jiawei, 2017. "Sufficient forecasting using factor models," Journal of Econometrics, Elsevier, vol. 201(2), pages 292-306.
    7. Peter Exterkate, 2011. "Modelling Issues in Kernel Ridge Regression," Tinbergen Institute Discussion Papers 11-138/4, Tinbergen Institute.
    8. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    9. Çakmaklı, Cem & van Dijk, Dick, 2016. "Getting the most out of macroeconomic information for predicting excess stock returns," International Journal of Forecasting, Elsevier, vol. 32(3), pages 650-668.
    10. Paolo Andreini & Donato Ceci, 2019. "A Horse Race in High Dimensional Space," CEIS Research Paper 452, Tor Vergata University, CEIS, revised 14 Feb 2019.
    11. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    12. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    13. Marco Lombardi & Chiara Osbat & Bernd Schnatz, 2012. "Global commodity cycles and linkages: a FAVAR approach," Empirical Economics, Springer, vol. 43(2), pages 651-670, October.
    14. Andrea Carriero & Francesco Corsello & Massimiliano Marcellino, 2022. "The global component of inflation volatility," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(4), pages 700-721, June.
    15. Ralf Brüggemann & Christian Kascha, 2017. "Directed Graphs and Variable Selection in Large Vector Autoregressive Models," Working Paper Series of the Department of Economics, University of Konstanz 2017-06, Department of Economics, University of Konstanz.
    16. Gianluca Cubadda, 2025. "VAR Models with an Index Structure: A Survey with New Results," Econometrics, MDPI, vol. 13(4), pages 1-17, October.
    17. repec:gnv:wpaper:unige:76321 is not listed on IDEAS
    18. Cem Cakmakli & Dick van Dijk, 2010. "Getting the Most out of Macroeconomic Information for Predicting Stock Returns and Volatility," Tinbergen Institute Discussion Papers 10-115/4, Tinbergen Institute.
    19. Maio, Paulo & Philip, Dennis, 2015. "Macro variables and the components of stock returns," Journal of Empirical Finance, Elsevier, vol. 33(C), pages 287-308.
    20. Johannes Tang Kristensen, 2013. "Diffusion Indexes with Sparse Loadings," CREATES Research Papers 2013-22, Department of Economics and Business Economics, Aarhus University.
    21. repec:dau:papers:123456789/11663 is not listed on IDEAS
    22. Koop, Gary & Korobilis, Dimitris & Pettenuzzo, Davide, 2019. "Bayesian compressed vector autoregressions," Journal of Econometrics, Elsevier, vol. 210(1), pages 135-154.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tin:wpaper:20110122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tinbergen Office +31 (0)10-4088900 (email available below). General contact details of provider: https://edirc.repec.org/data/tinbenl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.