IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v216y2020i1p71-85.html
   My bibliography  Save this article

Factor-adjusted regularized model selection

Author

Listed:
  • Fan, Jianqing
  • Ke, Yuan
  • Wang, Kaizheng

Abstract

This paper studies model selection consistency for high dimensional sparse regression when data exhibits both cross-sectional and serial dependency. Most commonly-used model selection methods fail to consistently recover the true model when the covariates are highly correlated. Motivated by econometric and financial studies, we consider the case where covariate dependence can be reduced through the factor model, and propose a consistency strategy named Factor-Adjusted Regularized Model Selection (FarmSelect). By learning the latent factors and idiosyncratic components and using both of them as predictors, FarmSelect transforms the problem from model selection with highly correlated covariates to that with weakly correlated ones via lifting. Model selection consistency, as well as optimal rates of convergence, are obtained under mild conditions. Numerical studies demonstrate the nice finite sample performance in terms of both model selection and out-of-sample prediction. Moreover, our method is flexible in the sense that it pays no price for weakly correlated and uncorrelated cases. Our method is applicable to a wide range of high dimensional sparse regression problems. An R-package FarmSelect is also provided for implementation.

Suggested Citation

  • Fan, Jianqing & Ke, Yuan & Wang, Kaizheng, 2020. "Factor-adjusted regularized model selection," Journal of Econometrics, Elsevier, vol. 216(1), pages 71-85.
  • Handle: RePEc:eee:econom:v:216:y:2020:i:1:p:71-85
    DOI: 10.1016/j.jeconom.2020.01.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407620300117
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2020.01.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2005. "The Generalized Dynamic Factor Model: One-Sided Estimation and Forecasting," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 830-840, September.
    2. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    3. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    4. H. Wang, 2012. "Factor profiled sure independence screening," Biometrika, Biometrika Trust, vol. 99(1), pages 15-28.
    5. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    6. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    7. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    8. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    9. Baltagi, Badi H. & Kao, Chihwa & Wang, Fa, 2021. "Estimating and testing high dimensional factor models with multiple structural changes," Journal of Econometrics, Elsevier, vol. 220(2), pages 349-365.
    10. Hallin, Marc & Liska, Roman, 2007. "Determining the Number of Factors in the General Dynamic Factor Model," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 603-617, June.
    11. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    12. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    13. Chang, Jinyuan & Guo, Bin & Yao, Qiwei, 2015. "High dimensional stochastic regression with latent factors, endogeneity and nonlinearity," Journal of Econometrics, Elsevier, vol. 189(2), pages 297-312.
    14. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    15. G. E. P. Box & G. C. Tiao, 1976. "Comparison of Forecast and Actuality," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 25(3), pages 195-200, November.
    16. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    17. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    18. Xiangyu Wang & Chenlei Leng, 2016. "High dimensional ordinary least squares projection for screening variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 589-611, June.
    19. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    20. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    21. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    22. Chang, Jinyuan & Guo, Bin & Yao, Qiwei, 2015. "High dimensional stochastic regression with latent factors, endogeneity and nonlinearity," LSE Research Online Documents on Economics 61886, London School of Economics and Political Science, LSE Library.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Guo, Yanhong & Li, Ping & Li, Aihua, 2021. "Tail risk contagion between international financial markets during COVID-19 pandemic," International Review of Financial Analysis, Elsevier, vol. 73(C).
    2. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2021. "Bridging factor and sparse models," Papers 2102.11341, arXiv.org, revised Sep 2022.
    3. Mogliani, Matteo & Simoni, Anna, 2021. "Bayesian MIDAS penalized regressions: Estimation, selection, and prediction," Journal of Econometrics, Elsevier, vol. 222(1), pages 833-860.
    4. Miao He & Yanhong Guo, 2022. "Systemic Risk Contributions of Financial Institutions during the Stock Market Crash in China," Sustainability, MDPI, vol. 14(9), pages 1-14, April.
    5. Heiss, Florian & Hetzenecker, Stephan & Osterhaus, Maximilian, 2022. "Nonparametric estimation of the random coefficients model: An elastic net approach," Journal of Econometrics, Elsevier, vol. 229(2), pages 299-321.
    6. Lukoianove, Tatiana & Agarwal, James & Osiyevskyy, Oleksiy, 2022. "Modeling a country's political environment using dynamic factor analysis (DFA): A new methodology for IB research," Journal of World Business, Elsevier, vol. 57(5).
    7. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2022. "Do We Exploit all Information for Counterfactual Analysis? Benefits of Factor Models and Idiosyncratic Correction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(538), pages 574-590, April.
    8. Yongxia Zhang & Qi Wang & Maozai Tian, 2022. "Smoothed Quantile Regression with Factor-Augmented Regularized Variable Selection for High Correlated Data," Mathematics, MDPI, vol. 10(16), pages 1-30, August.
    9. Jonas Krampe & Luca Margaritella, 2021. "Factor Models with Sparse VAR Idiosyncratic Components," Papers 2112.07149, arXiv.org, revised May 2022.
    10. Collins, Alan & Fan, Jingwen & Mahabir, Aruneema, 2022. "Actual versus ‘natural’ rates of suicide: Evidence from the USA," Economic Modelling, Elsevier, vol. 106(C).
    11. Yucheng Yang & Yue Pang & Guanhua Huang & Weinan E, 2020. "The Knowledge Graph for Macroeconomic Analysis with Alternative Big Data," Papers 2010.05172, arXiv.org.
    12. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    13. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    14. Simone Tonini & Francesca Chiaromonte & Alessandro Giovannelli, 2022. "On the impact of serial dependence on penalized regression methods," LEM Papers Series 2022/21, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yongxia Zhang & Qi Wang & Maozai Tian, 2022. "Smoothed Quantile Regression with Factor-Augmented Regularized Variable Selection for High Correlated Data," Mathematics, MDPI, vol. 10(16), pages 1-30, August.
    2. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    3. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    4. Yoshimasa Uematsu & Takashi Yamagata, 2019. "Estimation of Weak Factor Models," DSSR Discussion Papers 96, Graduate School of Economics and Management, Tohoku University.
    5. Yuefeng Han & Cun-Hui Zhang & Rong Chen, 2021. "CP Factor Model for Dynamic Tensors," Papers 2110.15517, arXiv.org.
    6. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    7. Fan, Jianqing & Ke, Yuan & Liao, Yuan, 2021. "Augmented factor models with applications to validating market risk factors and forecasting bond risk premia," Journal of Econometrics, Elsevier, vol. 222(1), pages 269-294.
    8. Yuefeng Han & Rong Chen & Cun-Hui Zhang, 2020. "Rank Determination in Tensor Factor Model," Papers 2011.07131, arXiv.org, revised May 2022.
    9. Peña, Daniel & Smucler, Ezequiel & Yohai, Victor J., 2021. "Sparse estimation of dynamic principal components for forecasting high-dimensional time series," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1498-1508.
    10. Bai, Jushan & Liao, Yuan, 2012. "Efficient Estimation of Approximate Factor Models," MPRA Paper 41558, University Library of Munich, Germany.
    11. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    12. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.
    13. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised May 2022.
    14. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    15. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    16. Matteo Barigozzi & Marc Hallin, 2017. "A network analysis of the volatility of high dimensional financial series," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 581-605, April.
    17. Choi, Sung Hoon & Kim, Donggyu, 2023. "Large volatility matrix analysis using global and national factor models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1917-1933.
    18. Bai, Jushan & Liao, Yuan, 2017. "Inferences in panel data with interactive effects using large covariance matrices," Journal of Econometrics, Elsevier, vol. 200(1), pages 59-78.
    19. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    20. Matteo Barigozzi & Marc Hallin, 2015. "Networks, Dynamic Factors, and the Volatility Analysis of High-Dimensional Financial Series," Working Papers ECARES ECARES 2015-34, ULB -- Universite Libre de Bruxelles.

    More about this item

    Keywords

    Model selection consistency; Correlated covariates; Factor model; Regularized M-estimator; Time series;
    All these keywords.

    JEL classification:

    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:216:y:2020:i:1:p:71-85. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.