IDEAS home Printed from https://ideas.repec.org/a/eee/ecolet/v255y2025ics0165176525003179.html

A correlation-robust shrinkage estimator: Oracle inequality and an application on out-of-sample factor selection

Author

Listed:
  • Sun, Chuanping

Abstract

Shrinkage methods are widely used in big data to achieve sparse variable selection and reduce overfitting. However, these methods, such as LASSO (Tibshirani, 1996), often struggle when faced with highly correlated predictors. In this paper, we examine a recently developed machine learning estimator that is robust to highly correlated variables, providing superior out-of-sample performance compared to traditional shrinkage techniques. We establish the asymptotic properties of this estimator under general conditions, including i.i.d. sub-Gaussianity. Empirically, we demonstrate the practical benefits of this approach in selecting factors to construct hedged portfolios, achieving significantly higher Sharpe ratios compared to benchmarks such as LASSO, Ridge, and Elastic Net in an out-of-sample context.

Suggested Citation

  • Sun, Chuanping, 2025. "A correlation-robust shrinkage estimator: Oracle inequality and an application on out-of-sample factor selection," Economics Letters, Elsevier, vol. 255(C).
  • Handle: RePEc:eee:ecolet:v:255:y:2025:i:c:s0165176525003179
    DOI: 10.1016/j.econlet.2025.112480
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0165176525003179
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.econlet.2025.112480?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    2. Andrii Babii & Eric Ghysels & Jonas Striaukas, 2022. "Machine Learning Time Series Regressions With an Application to Nowcasting," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(3), pages 1094-1106, June.
    3. Ludvigson, Sydney C., 2013. "Advances in Consumption-Based Asset Pricing: Empirical Tests," Handbook of the Economics of Finance, in: G.M. Constantinides & M. Harris & R. M. Stulz (ed.), Handbook of the Economics of Finance, volume 2, chapter 0, pages 799-906, Elsevier.
    4. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    5. Fan, Jianqing & Ke, Yuan & Wang, Kaizheng, 2020. "Factor-adjusted regularized model selection," Journal of Econometrics, Elsevier, vol. 216(1), pages 71-85.
    6. Bryzgalova, Svetlana & Huang, Jiantao & Julliard, Christian, 2023. "Bayesian solutions for the factor zoo: we just ran two quadrillion models," LSE Research Online Documents on Economics 126151, London School of Economics and Political Science, LSE Library.
    7. Alex Chinco & Adam D. Clark‐Joseph & Mao Ye, 2019. "Sparse Signals in the Cross‐Section of Returns," Journal of Finance, American Finance Association, vol. 74(1), pages 449-492, February.
    8. Sun, Chuanping, 2024. "Factor correlation and the cross section of asset returns: A correlation-robust machine learning approach," Journal of Empirical Finance, Elsevier, vol. 77(C).
    9. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    10. Kock, Anders Bredahl, 2016. "Oracle inequalities, variable selection and uniform inference in high-dimensional correlated random effects panel data models," Journal of Econometrics, Elsevier, vol. 195(1), pages 71-85.
    11. Svetlana Bryzgalova & Jiantao Huang & Christian Julliard, 2023. "Bayesian Solutions for the Factor Zoo: We Just Ran Two Quadrillion Models," Journal of Finance, American Finance Association, vol. 78(1), pages 487-557, February.
    12. Kock, Anders Bredahl & Tang, Haihan, 2019. "Uniform Inference In High-Dimensional Dynamic Panel Data Models With Approximately Sparse Fixed Effects," Econometric Theory, Cambridge University Press, vol. 35(2), pages 295-359, April.
    13. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    14. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    15. Andrew Y. Chen & Tom Zimmermann, 2022. "Open Source Cross-Sectional Asset Pricing," Critical Finance Review, now publishers, vol. 11(2), pages 207-264, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sun, Chuanping, 2024. "Factor correlation and the cross section of asset returns: A correlation-robust machine learning approach," Journal of Empirical Finance, Elsevier, vol. 77(C).
    2. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    3. Mogliani, Matteo & Simoni, Anna, 2021. "Bayesian MIDAS penalized regressions: Estimation, selection, and prediction," Journal of Econometrics, Elsevier, vol. 222(1), pages 833-860.
    4. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    5. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    6. Chen, Ding & Guo, Biao & Zhou, Guofu, 2023. "Firm fundamentals and the cross-section of implied volatility shapes," Journal of Financial Markets, Elsevier, vol. 63(C).
    7. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    8. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    9. David A. Mascio & Marat Molyboga & Frank J. Fabozzi, 2023. "The battle of the factors: Macroeconomic variables or investor sentiment?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(8), pages 2280-2291, December.
    10. Hafner, Christian M. & Wang, Linqi, 2024. "Dynamic portfolio selection with sector-specific regularization," Econometrics and Statistics, Elsevier, vol. 32(C), pages 17-33.
    11. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    12. Bang, Jeongseok & Kang, Yeonchan & Ryu, Doojin, 2024. "Potential pricing factors in the Korean market," Finance Research Letters, Elsevier, vol. 67(PB).
    13. Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.
    14. Bo Yu & Dayong Zhang & Qiang Ji, 2025. "Forecasting portfolio variance: a new decomposition approach," Annals of Operations Research, Springer, vol. 348(1), pages 543-578, May.
    15. Matteo Bagnara, 2024. "Asset Pricing and Machine Learning: A critical review," Journal of Economic Surveys, Wiley Blackwell, vol. 38(1), pages 27-56, February.
    16. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    17. Pan, Zhiyuan & Zhong, Hao & Wang, Yudong & Huang, Juan, 2024. "Forecasting oil futures returns with news," Energy Economics, Elsevier, vol. 134(C).
    18. Wang, Jinzhe & Zhu, Yifeng, 2024. "A comparison of factor models in China," Journal of Empirical Finance, Elsevier, vol. 79(C).
    19. Alexander Dickerson & Christian Julliard & Philippe Mueller, 2026. "The Co-Pricing Factor Zoo," Papers 2604.04430, arXiv.org.
    20. Jia, Yuecheng & Wu, Yangru & Yan, Shu & Liu, Yuzheng, 2023. "A seesaw effect in the cryptocurrency market: Understanding the return cross predictability of cryptocurrencies," Journal of Empirical Finance, Elsevier, vol. 74(C).

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • G12 - Financial Economics - - General Financial Markets - - - Asset Pricing; Trading Volume; Bond Interest Rates

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecolet:v:255:y:2025:i:c:s0165176525003179. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ecolet .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.