IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v45y2026i2p547-562.html

Stock Return Forecasting: A Supervised PCA With Selecting and Scaling

Author

Listed:
  • Ting Zhang
  • Haibin Xie

Abstract

This paper proposes a sparse scaled principal component analysis (PCA) to forecast stock returns. The sparse scaled PCA is a modification to the common PCA by first selecting the important predictors and then scaling the selected predictors according to their predictive power on the target to be forecasted. An advantage of the proposed sparse scaled PCA is that it takes the benefits of both scaled PCA and supervised PCA. An empirical study is conducted on the US stock market to evaluate its empirical performance, and the results confirm the superiority of sparse scaled PCA over a variety of dimension‐reduction techniques, including PCA, PLS, scaled PCA, and supervised PCA in both in‐sample and out‐of‐sample forecasting. Economic value analysis shows that the outperformance can yield economic utility gains at a reasonable transaction cost.

Suggested Citation

  • Ting Zhang & Haibin Xie, 2026. "Stock Return Forecasting: A Supervised PCA With Selecting and Scaling," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 45(2), pages 547-562, March.
  • Handle: RePEc:wly:jforec:v:45:y:2026:i:2:p:547-562
    DOI: 10.1002/for.70050
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.70050
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.70050?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:bla:jfinan:v:53:y:1998:i:5:p:1563-1587 is not listed on IDEAS
    2. Ferreira, Miguel A. & Santa-Clara, Pedro, 2011. "Forecasting stock market returns: The sum of the parts is more than the whole," Journal of Financial Economics, Elsevier, vol. 100(3), pages 514-537, June.
    3. Guo, Yangli & He, Feng & Liang, Chao & Ma, Feng, 2022. "Oil price volatility predictability: New evidence from a scaled PCA approach," Energy Economics, Elsevier, vol. 105(C).
    4. Hodrick, Robert J, 1992. "Dividend Yields and Expected Stock Returns: Alternative Procedures for Inference and Measurement," The Review of Financial Studies, Society for Financial Studies, vol. 5(3), pages 357-386.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    6. Dashan Huang & Fuwei Jiang & Jun Tu & Guofu Zhou, 2015. "Investor Sentiment Aligned: A Powerful Predictor of Stock Returns," The Review of Financial Studies, Society for Financial Studies, vol. 28(3), pages 791-837.
    7. Robert J. Shiller, 1984. "Stock Prices and Social Dynamics," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 15(2), pages 457-510.
    8. Christopher J. Neely & David E. Rapach & Jun Tu & Guofu Zhou, 2014. "Forecasting the Equity Risk Premium: The Role of Technical Indicators," Management Science, INFORMS, vol. 60(7), pages 1772-1791, July.
    9. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    10. Stefano Giglio & Dacheng Xiu, 2021. "Asset Pricing with Omitted Factors," Journal of Political Economy, University of Chicago Press, vol. 129(7), pages 1947-1990.
    11. Amit Goyal & Ivo Welch & Athanasse Zafirov, 2024. "A Comprehensive 2022 Look at the Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 37(11), pages 3490-3557.
    12. Dillon Huddleston & Fred Liu & Lars Stentoft, 2023. "Intraday Market Predictability: A Machine Learning Approach," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 485-527.
    13. Chen, Jian & Tang, Guohao & Yao, Jiaquan & Zhou, Guofu, 2022. "Investor Attention and Stock Returns," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 57(2), pages 455-484, March.
    14. Nathaniel Light & Denys Maslov & Oleg Rytchkov, 2017. "Aggregation of Information About the Cross Section of Stock Returns: A Latent Variable Approach," The Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1339-1381.
    15. Hai Lin & Chunchi Wu & Guofu Zhou, 2018. "Forecasting Corporate Bond Returns with a Large Set of Predictors: An Iterated Combination Approach," Management Science, INFORMS, vol. 64(9), pages 4218-4238, September.
    16. He, Mengxi & Zhang, Yaojie & Wen, Danyan & Wang, Yudong, 2021. "Forecasting crude oil prices: A scaled PCA approach," Energy Economics, Elsevier, vol. 97(C).
    17. Ai He & Dashan Huang & Jiaen Li & Guofu Zhou, 2023. "Shrinking Factor Dimension: A Reduced-Rank Approach," Management Science, INFORMS, vol. 69(9), pages 5501-5522, September.
    18. Fama, Eugene F, 1981. "Stock Returns, Real Activity, Inflation, and Money," American Economic Review, American Economic Association, vol. 71(4), pages 545-565, September.
    19. McCracken, Michael W., 2007. "Asymptotics for out of sample tests of Granger causality," Journal of Econometrics, Elsevier, vol. 140(2), pages 719-752, October.
    20. Campbell, John Y., 1987. "Stock returns and the term structure," Journal of Financial Economics, Elsevier, vol. 18(2), pages 373-399, June.
    21. Campbell, John Y. & Yogo, Motohiro, 2006. "Efficient tests of stock return predictability," Journal of Financial Economics, Elsevier, vol. 81(1), pages 27-60, July.
    22. repec:plo:pbio00:0020108 is not listed on IDEAS
    23. Lewellen, Jonathan, 2004. "Predicting returns with financial ratios," Journal of Financial Economics, Elsevier, vol. 74(2), pages 209-235, November.
    24. Shiller, Robert J, 1981. "Do Stock Prices Move Too Much to be Justified by Subsequent Changes in Dividends?," American Economic Review, American Economic Association, vol. 71(3), pages 421-436, June.
    25. Dashan Huang & Fuwei Jiang & Kunpeng Li & Guoshi Tong & Guofu Zhou, 2022. "Scaled PCA: A New Approach to Dimension Reduction," Management Science, INFORMS, vol. 68(3), pages 1678-1695, March.
    26. Andrew Detzel & Hong Liu & Jack Strauss & Guofu Zhou & Yingzi Zhu, 2021. "Learning and predictability via technical analysis: Evidence from bitcoin and stocks with hard‐to‐value fundamentals," Financial Management, Financial Management Association International, vol. 50(1), pages 107-137, March.
    27. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    28. Clark, Todd E. & West, Kenneth D., 2007. "Approximately normal tests for equal predictive accuracy in nested models," Journal of Econometrics, Elsevier, vol. 138(1), pages 291-311, May.
    29. Bair, Eric & Hastie, Trevor & Paul, Debashis & Tibshirani, Robert, 2006. "Prediction by Supervised Principal Components," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 119-137, March.
    30. Mehra, Rajnish & Prescott, Edward C., 1985. "The equity premium: A puzzle," Journal of Monetary Economics, Elsevier, vol. 15(2), pages 145-161, March.
    31. LeRoy, Stephen F & Porter, Richard D, 1981. "The Present-Value Relation: Tests Based on Implied Variance Bounds," Econometrica, Econometric Society, vol. 49(3), pages 555-574, May.
    32. Ball, Ray, 1978. "Anomalies in relationships between securities' yields and yield-surrogates," Journal of Financial Economics, Elsevier, vol. 6(2-3), pages 103-126.
    33. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wen, Danyan & He, Mengxi & Wang, Yudong & Zhang, Yaojie, 2024. "Forecasting crude oil market volatility: A comprehensive look at uncertainty variables," International Journal of Forecasting, Elsevier, vol. 40(3), pages 1022-1041.
    2. Yuan, Ying & Qu, Yong & Wang, Tianyang, 2025. "Predicting risk premiums: A constraint-based model," Journal of Empirical Finance, Elsevier, vol. 83(C).
    3. Yuan Zhao & Xue Gong & Weiguo Zhang & Weijun Xu, 2025. "Stock return forecasting based on the proxy variables of category factors," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 11(1), pages 1-48, December.
    4. Smith, Simon C., 2021. "International stock return predictability," International Review of Financial Analysis, Elsevier, vol. 78(C).
    5. Maung, Kenwin & Swanson, Norman R., 2025. "A survey of models and methods used for forecasting when investing in financial markets," International Journal of Forecasting, Elsevier, vol. 41(4), pages 1355-1382.
    6. Murphy, Austin & AlSalman, Zeina & Souropanis, Ioannis, 2025. "An investigation into the causes of stock market return deviations from real earnings yields," International Review of Economics & Finance, Elsevier, vol. 102(C).
    7. Babiak, Mykola & Baruník, Jozef, 2026. "Deep learning, predictability, and optimal portfolio returns," Journal of Empirical Finance, Elsevier, vol. 87(C).
    8. Wen, Danyan & Liu, Li & Wang, Yudong & Zhang, Yaojie, 2022. "Forecasting crude oil market returns: Enhanced moving average technical indicators," Resources Policy, Elsevier, vol. 76(C).
    9. Shi, Qi, 2023. "The RP-PCA factors and stock return predictability: An aligned approach," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    10. Kuppenheimer, Gregory & Shelly, Stuart & Strauss, Jack, 2023. "Can machine learning identify sector-level financial ratios that predict sector returns?," Finance Research Letters, Elsevier, vol. 57(C).
    11. Ma, Feng & Wang, Ruoxin & Lu, Xinjie & Wahab, M.I.M., 2021. "A comprehensive look at stock return predictability by oil prices using economic constraint approaches," International Review of Financial Analysis, Elsevier, vol. 78(C).
    12. Li, Xiao-Xin & Xie, Chi & Wang, Gang-Jin & Zhu, You & Li, Zhao-Chen & Zhang, Zhi-Yu, 2025. "Enhancing stock market return predictability by using a novel autoencoder-based aggregate EPU index," Pacific-Basin Finance Journal, Elsevier, vol. 93(C).
    13. Dai, Zhifeng & Jiang, Fuwei & Kang, Jie & Xue, Bowen, 2025. "Stock return predictability in the frequency domain," International Journal of Forecasting, Elsevier, vol. 41(3), pages 1126-1147.
    14. Dai, Zhifeng & Zhu, Huan, 2020. "Stock return predictability from a mixed model perspective," Pacific-Basin Finance Journal, Elsevier, vol. 60(C).
    15. Fernandez, Pablo & Aguirreamalloa, Javier & Liechtenstein, Heinrich, 2009. "The equity premium puzzle: High required equity premium, undervaluation and self fulfilling prophecy," IESE Research Papers D/821, IESE Business School.
    16. Jiang, Fuwei & Lee, Joshua & Martin, Xiumin & Zhou, Guofu, 2019. "Manager sentiment and stock returns," Journal of Financial Economics, Elsevier, vol. 132(1), pages 126-149.
    17. Kothari, Pratik & O’Doherty, Michael S., 2023. "Job postings and aggregate stock returns," Journal of Financial Markets, Elsevier, vol. 64(C).
    18. Tan, Xilong & Tao, Yubo, 2023. "Trend-based forecast of cryptocurrency returns," Economic Modelling, Elsevier, vol. 124(C).
    19. Yong Qu & Ying Yuan, 2025. "Predicting Equity Premium: A New Momentum Indicator Selection Strategy With Machine Learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(2), pages 424-435, March.
    20. Gong, Xue & Ye, Xin & Zhang, Weiguo & Zhang, Yue, 2023. "Predicting energy futures high-frequency volatility using technical indicators: The role of interaction," Energy Economics, Elsevier, vol. 119(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:45:y:2026:i:2:p:547-562. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.