IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v68y2022i3p1678-1695.html
   My bibliography  Save this article

Scaled PCA: A New Approach to Dimension Reduction

Author

Listed:
  • Dashan Huang

    (Lee Kong Chian School of Business, Singapore Management University, 178899, Singapore)

  • Fuwei Jiang

    (School of Finance, Central University of Finance and Economics, 102206 China)

  • Kunpeng Li

    (International School of Economics and Management, Capital University of Economics and Business, 100070 China)

  • Guoshi Tong

    (Fanhai International School of Finance, Fudan University, 200001 China)

  • Guofu Zhou

    (Olin School of Business, Washington University in St. Louis, St. Louis, Missouri 63130)

Abstract

This paper proposes a novel supervised learning technique for forecasting: scaled principal component analysis (sPCA). The sPCA improves the traditional principal component analysis (PCA) by scaling each predictor with its predictive slope on the target to be forecasted. Unlike the PCA that maximizes the common variation of the predictors, the sPCA assigns more weight to those predictors with stronger forecasting power. In a general factor framework, we show that, under some appropriate conditions on data, the sPCA forecast beats the PCA forecast, and when these conditions break down, extensive simulations indicate that the sPCA still has a large chance to outperform the PCA. A real data example on macroeconomic forecasting shows that the sPCA has better performance in general.

Suggested Citation

  • Dashan Huang & Fuwei Jiang & Kunpeng Li & Guoshi Tong & Guofu Zhou, 2022. "Scaled PCA: A New Approach to Dimension Reduction," Management Science, INFORMS, vol. 68(3), pages 1678-1695, March.
  • Handle: RePEc:inm:ormnsc:v:68:y:2022:i:3:p:1678-1695
    DOI: 10.1287/mnsc.2021.4020
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2021.4020
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2021.4020?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kelly, Bryan T. & Pruitt, Seth & Su, Yinan, 2019. "Characteristics are covariances: A unified model of risk and return," Journal of Financial Economics, Elsevier, vol. 134(3), pages 501-524.
    2. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    3. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    4. Gregory Connor & Matthias Hagmann & Oliver Linton, 2012. "Efficient Semiparametric Estimation of the Fama–French Model and Extensions," Econometrica, Econometric Society, vol. 80(2), pages 713-754, March.
    5. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    6. Maurizio Daniele & Winfried Pohlmeier & Aygul Zagidullina, 2018. "Sparse Approximate Factor Estimation for High-Dimensional Covariance Matrices," Working Paper Series of the Department of Economics, University of Konstanz 2018-07, Department of Economics, University of Konstanz.
    7. Dashan Huang & Fuwei Jiang & Jun Tu & Guofu Zhou, 2015. "Investor Sentiment Aligned: A Powerful Predictor of Stock Returns," The Review of Financial Studies, Society for Financial Studies, vol. 28(3), pages 791-837.
    8. Jushan Bai & Serena Ng, 2004. "A PANIC Attack on Unit Roots and Cointegration," Econometrica, Econometric Society, vol. 72(4), pages 1127-1177, July.
    9. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    10. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    11. Bai, Jushan & Ng, Serena, 2008. "Forecasting economic time series using targeted predictors," Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317, October.
    12. Ludvigson, Sydney C. & Ng, Serena, 2007. "The empirical risk-return relation: A factor analysis approach," Journal of Financial Economics, Elsevier, vol. 83(1), pages 171-222, January.
    13. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    14. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    15. Clark, Todd E. & West, Kenneth D., 2007. "Approximately normal tests for equal predictive accuracy in nested models," Journal of Econometrics, Elsevier, vol. 138(1), pages 291-311, May.
    16. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    17. Jushan Bai & Serena Ng, 2006. "Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions," Econometrica, Econometric Society, vol. 74(4), pages 1133-1150, July.
    18. Connor, Gregory & Korajczyk, Robert A., 1986. "Performance measurement with the arbitrage pricing theory : A new framework for analysis," Journal of Financial Economics, Elsevier, vol. 15(3), pages 373-394, March.
    19. Kelly, Bryan & Pruitt, Seth, 2015. "The three-pass regression filter: A new approach to forecasting using many predictors," Journal of Econometrics, Elsevier, vol. 186(2), pages 294-316.
    20. Bryan Kelly & Seth Pruitt, 2013. "Market Expectations in the Cross-Section of Present Values," Journal of Finance, American Finance Association, vol. 68(5), pages 1721-1756, October.
    21. Stefano Giglio & Dacheng Xiu, 2021. "Asset Pricing with Omitted Factors," Journal of Political Economy, University of Chicago Press, vol. 129(7), pages 1947-1990.
    22. Nathaniel Light & Denys Maslov & Oleg Rytchkov, 2017. "Aggregation of Information About the Cross Section of Stock Returns: A Latent Variable Approach," The Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1339-1381.
    23. Hai Lin & Chunchi Wu & Guofu Zhou, 2018. "Forecasting Corporate Bond Returns with a Large Set of Predictors: An Iterated Combination Approach," Management Science, INFORMS, vol. 64(9), pages 4218-4238, September.
    24. Bai, Jushan, 2004. "Estimating cross-section common stochastic trends in nonstationary panel data," Journal of Econometrics, Elsevier, vol. 122(1), pages 137-183, September.
    25. Markus Pelger, 2020. "Understanding Systematic Risk: A High‐Frequency Approach," Journal of Finance, American Finance Association, vol. 75(4), pages 2179-2220, August.
    26. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Shan & Li, Ziwei, 2023. "Macroeconomic attention and oil futures volatility prediction," Finance Research Letters, Elsevier, vol. 57(C).
    2. Weijia Peng & Chun Yao, 2023. "Sector-level equity returns predictability with machine learning and market contagion measure," Empirical Economics, Springer, vol. 65(4), pages 1761-1798, October.
    3. Lu, Fei & Ma, Feng & Guo, Qiang, 2023. "Less is more? New evidence from stock market volatility predictability," International Review of Financial Analysis, Elsevier, vol. 89(C).
    4. Tan, Xilong & Tao, Yubo, 2023. "Trend-based forecast of cryptocurrency returns," Economic Modelling, Elsevier, vol. 124(C).
    5. Kuppenheimer, Gregory & Shelly, Stuart & Strauss, Jack, 2023. "Can machine learning identify sector-level financial ratios that predict sector returns?," Finance Research Letters, Elsevier, vol. 57(C).
    6. Lu, Xinjie & Ma, Feng & Wang, Tianyang & Wen, Fenghua, 2023. "International stock market volatility: A data-rich environment based on oil shocks," Journal of Economic Behavior & Organization, Elsevier, vol. 214(C), pages 184-215.
    7. Lu, Xinjie & Lang, Qiaoqi, 2023. "Categorial economic policy uncertainty indices or Twitter-based uncertainty indices? Evidence from Chinese stock market," Finance Research Letters, Elsevier, vol. 55(PB).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhaoxing Gao & Ruey S. Tsay, 2023. "Supervised Dynamic PCA: Linear Dynamic Forecasting with Many Predictors," Papers 2307.07689, arXiv.org.
    2. Alain-Philippe Fortin & Patrick Gagliardini & O. Scaillet, 2022. "Eigenvalue tests for the number of latent factors in short panels," Swiss Finance Institute Research Paper Series 22-81, Swiss Finance Institute.
    3. Stefano Giglio & Dacheng Xiu, 2017. "Inference on Risk Premia in the Presence of Omitted Factors," NBER Working Papers 23527, National Bureau of Economic Research, Inc.
    4. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    5. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," PSE Working Papers halshs-02262202, HAL.
    6. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers halshs-02262202, HAL.
    7. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    8. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    9. Fan, Jianqing & Xue, Lingzhou & Yao, Jiawei, 2017. "Sufficient forecasting using factor models," Journal of Econometrics, Elsevier, vol. 201(2), pages 292-306.
    10. Huang, Dashan & Li, Jiangyuan & Wang, Liyao, 2021. "Are disagreements agreeable? Evidence from information aggregation," Journal of Financial Economics, Elsevier, vol. 141(1), pages 83-101.
    11. Mykola Babiak & Jozef Barunik, 2020. "Deep Learning, Predictability, and Optimal Portfolio Returns," CERGE-EI Working Papers wp677, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    12. Oleg Rytchkov & Xun Zhong, 2020. "Information Aggregation and P-Hacking," Management Science, INFORMS, vol. 66(4), pages 1605-1626, April.
    13. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    14. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    15. Shi, Qi, 2023. "The RP-PCA factors and stock return predictability: An aligned approach," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    16. Vigo Pereira, Caio, 2021. "Portfolio efficiency with high-dimensional data as conditioning information," International Review of Financial Analysis, Elsevier, vol. 77(C).
    17. Francisco Corona & Pilar Poncela & Esther Ruiz, 2017. "Determining the number of factors after stationary univariate transformations," Empirical Economics, Springer, vol. 53(1), pages 351-372, August.
    18. Xiaolu Wei & Hongbing Ouyang, 2023. "Forecasting Carbon Price Using Double Shrinkage Methods," IJERPH, MDPI, vol. 20(2), pages 1-20, January.
    19. Ma, Tian & Leong, Wen Jun & Jiang, Fuwei, 2023. "A latent factor model for the Chinese stock market," International Review of Financial Analysis, Elsevier, vol. 87(C).
    20. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "Estimation of large dimensional conditional factor models in finance," Working Papers unige:125031, University of Geneva, Geneva School of Economics and Management.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:68:y:2022:i:3:p:1678-1695. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.