IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2502.15275.html
   My bibliography  Save this paper

A Supervised Screening and Regularized Factor-Based Method for Time Series Forecasting

Author

Listed:
  • Sihan Tu
  • Zhaoxing Gao

Abstract

Factor-based forecasting using Principal Component Analysis (PCA) is an effective machine learning tool for dimension reduction with many applications in statistics, economics, and finance. This paper introduces a Supervised Screening and Regularized Factor-based (SSRF) framework that systematically addresses high-dimensional predictor sets through a structured four-step procedure integrating both static and dynamic forecasting mechanisms. The static approach selects predictors via marginal correlation screening and scales them using univariate predictive slopes, while the dynamic method screens and scales predictors based on time series regression incorporating lagged predictors. PCA then extracts latent factors from the scaled predictors, followed by LASSO regularization to refine predictive accuracy. In the simulation study, we validate the effectiveness of SSRF and identify its parameter adjustment strategies in high-dimensional data settings. An empirical analysis of macroeconomic indices in China demonstrates that the SSRF method generally outperforms several commonly used forecasting techniques in out-of-sample predictions.

Suggested Citation

  • Sihan Tu & Zhaoxing Gao, 2025. "A Supervised Screening and Regularized Factor-Based Method for Time Series Forecasting," Papers 2502.15275, arXiv.org.
  • Handle: RePEc:arx:papers:2502.15275
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2502.15275
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ledoit, Olivier & Wolf, Michael, 2004. "A well-conditioned estimator for large-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 88(2), pages 365-411, February.
    2. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    3. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    4. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    5. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    6. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    7. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    8. Dashan Huang & Fuwei Jiang & Kunpeng Li & Guoshi Tong & Guofu Zhou, 2022. "Scaled PCA: A New Approach to Dimension Reduction," Management Science, INFORMS, vol. 68(3), pages 1678-1695, March.
    9. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    10. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    11. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    12. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fan, Jianqing & Ke, Yuan & Wang, Kaizheng, 2020. "Factor-adjusted regularized model selection," Journal of Econometrics, Elsevier, vol. 216(1), pages 71-85.
    2. Yongxia Zhang & Qi Wang & Maozai Tian, 2022. "Smoothed Quantile Regression with Factor-Augmented Regularized Variable Selection for High Correlated Data," Mathematics, MDPI, vol. 10(16), pages 1-30, August.
    3. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    4. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    5. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.
    6. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    7. Victor Chernozhukov & Christian Hansen & Yuan Liao, 2015. "A lava attack on the recovery of sums of dense and sparse signals," CeMMAP working papers CWP56/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    8. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    9. Matteo Barigozzi & Marc Hallin, 2017. "A network analysis of the volatility of high dimensional financial series," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 581-605, April.
    10. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    11. Mogliani, Matteo & Simoni, Anna, 2021. "Bayesian MIDAS penalized regressions: Estimation, selection, and prediction," Journal of Econometrics, Elsevier, vol. 222(1), pages 833-860.
    12. Bae, Juhee, 2024. "Factor-augmented forecasting in big data," International Journal of Forecasting, Elsevier, vol. 40(4), pages 1660-1688.
    13. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    14. Matteo Barigozzi & Marc Hallin, 2015. "Networks, Dynamic Factors, and the Volatility Analysis of High-Dimensional Financial Series," Papers 1510.05118, arXiv.org, revised Jul 2016.
    15. Duan, Junting & Pelger, Markus & Xiong, Ruoxuan, 2024. "Target PCA: Transfer learning large dimensional panel data," Journal of Econometrics, Elsevier, vol. 244(2).
    16. Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.
    17. Li, Hong & Porth, Lysa & Tan, Ken Seng & Zhu, Wenjun, 2021. "Improved index insurance design and yield estimation using a dynamic factor forecasting approach," Insurance: Mathematics and Economics, Elsevier, vol. 96(C), pages 208-221.
    18. Wan, Runzhe & Li, Yingying & Lu, Wenbin & Song, Rui, 2024. "Mining the factor zoo: Estimation of latent factor models with sufficient proxies," Journal of Econometrics, Elsevier, vol. 239(2).
    19. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    20. Joseph, Andreas & Potjagailo, Galina & Chakraborty, Chiranjit & Kapetanios, George, 2024. "Forecasting UK inflation bottom up," International Journal of Forecasting, Elsevier, vol. 40(4), pages 1521-1538.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2502.15275. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.