IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2508.14986.html
   My bibliography  Save this paper

Variable selection for minimum-variance portfolios

Author

Listed:
  • Guilherme V. Moura
  • Andr'e P. Santos
  • Hudson S. Torrent

Abstract

Machine learning (ML) methods have been successfully employed in identifying variables that can predict the equity premium of individual stocks. In this paper, we investigate if ML can also be helpful in selecting variables relevant for optimal portfolio choice. To address this question, we parameterize minimum-variance portfolio weights as a function of a large pool of firm-level characteristics as well as their second-order and cross-product transformations, yielding a total of 4,610 predictors. We find that the gains from employing ML to select relevant predictors are substantial: minimum-variance portfolios achieve lower risk relative to sparse specifications commonly considered in the literature, especially when non-linear terms are added to the predictor space. Moreover, some of the selected predictors that help decreasing portfolio risk also increase returns, leading to minimum-variance portfolios with good performance in terms of Shape ratios in some situations. Our evidence suggests that ad-hoc sparsity can be detrimental to the performance of minimum-variance characteristics-based portfolios.

Suggested Citation

  • Guilherme V. Moura & Andr'e P. Santos & Hudson S. Torrent, 2025. "Variable selection for minimum-variance portfolios," Papers 2508.14986, arXiv.org.
  • Handle: RePEc:arx:papers:2508.14986
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2508.14986
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Harrison Hong & Terence Lim & Jeremy C. Stein, 2000. "Bad News Travels Slowly: Size, Analyst Coverage, and the Profitability of Momentum Strategies," Journal of Finance, American Finance Association, vol. 55(1), pages 265-295, February.
    3. Malcolm Baker & Jeffrey Wurgler, 2006. "Investor Sentiment and the Cross‐Section of Stock Returns," Journal of Finance, American Finance Association, vol. 61(4), pages 1645-1680, August.
    4. Alex Chinco & Adam D. Clark‐Joseph & Mao Ye, 2019. "Sparse Signals in the Cross‐Section of Returns," Journal of Finance, American Finance Association, vol. 74(1), pages 449-492, February.
    5. Nicholas G. Polson & James G. Scott & Jesse Windle, 2014. "The Bayesian bridge," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 713-733, September.
    6. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    7. R. David Mclean & Jeffrey Pontiff, 2016. "Does Academic Research Destroy Stock Return Predictability?," Journal of Finance, American Finance Association, vol. 71(1), pages 5-32, February.
    8. Kozak, Serhiy & Nagel, Stefan & Santosh, Shrihari, 2020. "Shrinking the cross-section," Journal of Financial Economics, Elsevier, vol. 135(2), pages 271-292.
    9. Green, Richard C & Hollifield, Burton, 1992. "When Will Mean-Variance Efficient Portfolios Be Well Diversified?," Journal of Finance, American Finance Association, vol. 47(5), pages 1785-1809, December.
    10. Kewei Hou & Chen Xue & Lu Zhang, 2015. "Editor's Choice Digesting Anomalies: An Investment Approach," The Review of Financial Studies, Society for Financial Studies, vol. 28(3), pages 650-705.
    11. Victor DeMiguel & Lorenzo Garlappi & Raman Uppal, 2009. "Optimal Versus Naive Diversification: How Inefficient is the 1-N Portfolio Strategy?," The Review of Financial Studies, Society for Financial Studies, vol. 22(5), pages 1915-1953, May.
    12. Carlos M. Carvalho & Nicholas G. Polson & James G. Scott, 2010. "The horseshoe estimator for sparse signals," Biometrika, Biometrika Trust, vol. 97(2), pages 465-480.
    13. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    14. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    15. DeMiguel, Victor & Gil-Bazo, Javier & Nogales, Francisco J. & Santos, André A.P., 2023. "Machine learning and fund characteristics help to select mutual funds with positive alpha," Journal of Financial Economics, Elsevier, vol. 150(3).
    16. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    17. Ledoit, Oliver & Wolf, Michael, 2008. "Robust performance hypothesis testing with the Sharpe ratio," Journal of Empirical Finance, Elsevier, vol. 15(5), pages 850-859, December.
    18. Louis K.C. Chan & Jason Karceski & Josef Lakonishok, 1999. "On Portfolio Optimization: Forecasting Covariances and Choosing the Risk Model," NBER Working Papers 7039, National Bureau of Economic Research, Inc.
    19. Kewei Hou & Chen Xue & Lu Zhang, 2020. "Replicating Anomalies," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2019-2133.
    20. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    21. Ravi Jagannathan & Tongshu Ma, 2003. "Risk Reduction in Large Portfolios: Why Imposing the Wrong Constraints Helps," Journal of Finance, American Finance Association, vol. 58(4), pages 1651-1683, August.
    22. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    23. Pastor, Lubos & Stambaugh, Robert F., 2000. "Comparing asset pricing models: an investment perspective," Journal of Financial Economics, Elsevier, vol. 56(3), pages 335-381, June.
    24. Kaniel, Ron & Lin, Zihan & Pelger, Markus & Van Nieuwerburgh, Stijn, 2023. "Machine-learning the skill of mutual fund managers," Journal of Financial Economics, Elsevier, vol. 150(1), pages 94-138.
    25. John H. Cochrane, 2011. "Presidential Address: Discount Rates," Journal of Finance, American Finance Association, vol. 66(4), pages 1047-1108, August.
    26. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    27. Buchen, Teresa & Wohlrabe, Klaus, 2011. "Forecasting with many predictors: Is boosting a viable alternative?," Economics Letters, Elsevier, vol. 113(1), pages 16-18, October.
    28. Alexander Kempf & Christoph Memmel, 2006. "Estimating the global Minimum Variance Portfolio," Schmalenbach Business Review (sbr), LMU Munich School of Management, vol. 58(4), pages 332-348, October.
    29. Michael W. Brandt & Pedro Santa-Clara & Rossen Valkanov, 2009. "Parametric Portfolio Policies: Exploiting Characteristics in the Cross-Section of Equity Returns," The Review of Financial Studies, Society for Financial Studies, vol. 22(9), pages 3411-3447, September.
    30. Schmid, Matthias & Hothorn, Torsten, 2008. "Boosting additive models using component-wise P-Splines," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 298-311, December.
    31. Yacine AÏT‐SAHALI & Michael W. Brandt, 2001. "Variable Selection for Portfolio Choice," Journal of Finance, American Finance Association, vol. 56(4), pages 1297-1351, August.
    32. David E. Rapach & Jack K. Strauss & Guofu Zhou, 2013. "International Stock Return Predictability: What Is the Role of the United States?," Journal of Finance, American Finance Association, vol. 68(4), pages 1633-1662, August.
    33. Mark Britten‐Jones, 1999. "The Sampling Error in Estimates of Mean‐Variance Efficient Portfolio Weights," Journal of Finance, American Finance Association, vol. 54(2), pages 655-671, April.
    34. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    35. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    36. Döpke, Jörg & Fritsche, Ulrich & Pierdzioch, Christian, 2017. "Predicting recessions with boosted regression trees," International Journal of Forecasting, Elsevier, vol. 33(4), pages 745-759.
    37. Pastor, Lubos & Stambaugh, Robert F., 2003. "Liquidity Risk and Expected Stock Returns," Journal of Political Economy, University of Chicago Press, vol. 111(3), pages 642-685, June.
    38. Victor DeMiguel & Alberto Martín-Utrera & Francisco J Nogales & Raman Uppal, 2020. "A Transaction-Cost Perspective on the Multitude of Firm Characteristics," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2180-2222.
    39. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    40. Jushan Bai & Serena Ng, 2009. "Boosting diffusion indices," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(4), pages 607-629.
    41. Fabozzi, Frank J. & Huang, Dashan & Jiang, Fuwei & Wang, Jiexun, 2024. "What difference do new factor models make in portfolio allocation?," Journal of International Money and Finance, Elsevier, vol. 140(C).
    42. repec:bla:jfinan:v:58:y:2003:i:4:p:1651-1684 is not listed on IDEAS
    43. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    44. Chan, Louis K C & Karceski, Jason & Lakonishok, Josef, 1999. "On Portfolio Optimization: Forecasting Covariances and Choosing the Risk Model," The Review of Financial Studies, Society for Financial Studies, vol. 12(5), pages 937-974.
    45. Victor DeMiguel & Francisco J. Nogales, 2009. "Portfolio Selection with Robust Estimation," Operations Research, INFORMS, vol. 57(3), pages 560-577, June.
    46. Campbell R. Harvey & Akhtar Siddique, 2000. "Conditional Skewness in Asset Pricing Tests," Journal of Finance, American Finance Association, vol. 55(3), pages 1263-1295, June.
    47. Jeremiah Green & John R. M. Hand & X. Frank Zhang, 2017. "The Characteristics that Provide Independent Information about Average U.S. Monthly Stock Returns," The Review of Financial Studies, Society for Financial Studies, vol. 30(12), pages 4389-4436.
    48. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    49. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    50. Victor DeMiguel & Alberto Martín-Utrera & Francisco J Nogales & Raman Uppal & Andrew KarolyiEditor, 2020. "A Transaction-Cost Perspective on the Multitude of Firm Characteristics," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2180-2222.
    51. Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.
    52. Scherer, Bernd, 2011. "A note on the returns from minimum variance investing," Journal of Empirical Finance, Elsevier, vol. 18(4), pages 652-660, September.
    53. Kauppi, Heikki & Virtanen, Timo, 2021. "Boosting nonlinear predictability of macroeconomic time series," International Journal of Forecasting, Elsevier, vol. 37(1), pages 151-170.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Caldeira, João F. & Santos, André A.P. & Torrent, Hudson S., 2023. "Semiparametric portfolios: Improving portfolio performance by exploiting non-linearities in firm characteristics," Economic Modelling, Elsevier, vol. 122(C).
    2. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    3. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    4. Fieberg, Christian & Liedtke, Gerrit & Zaremba, Adam & Cakici, Nusret, 2025. "A factor model for the cross-section of country equity risk premia," Journal of Banking & Finance, Elsevier, vol. 171(C).
    5. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    6. Victor DeMiguel & Javier Gil-Bazo & Francisco J. Nogales & André A. P. Santos, 2021. "Can machine learning help to select portfolios of mutual funds?," Economics Working Papers 1772, Department of Economics and Business, Universitat Pompeu Fabra.
    7. Victor DeMiguel & Javier Gil-Bazo & Francisco J. Nogales & André A. P. Santos, 2021. "Can Machine Learning Help to Select Portfolios of Mutual Funds?," Working Papers 1245, Barcelona School of Economics.
    8. Christopher G. Lamoureux & Huacheng Zhang, 2021. "An Empirical Assessment of Characteristics and Optimal Portfolios," Papers 2104.12975, arXiv.org, revised Feb 2024.
    9. Stadtmüller, Immo & Auer, Benjamin R. & Schuhmacher, Frank, 2022. "On the benefits of active stock selection strategies for diversified investors," The Quarterly Review of Economics and Finance, Elsevier, vol. 85(C), pages 342-354.
    10. Chen, Ding & Guo, Biao & Zhou, Guofu, 2023. "Firm fundamentals and the cross-section of implied volatility shapes," Journal of Financial Markets, Elsevier, vol. 63(C).
    11. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    12. DeMiguel, Victor & Gil-Bazo, Javier & Nogales, Francisco J. & Santos, André A.P., 2023. "Machine learning and fund characteristics help to select mutual funds with positive alpha," Journal of Financial Economics, Elsevier, vol. 150(3).
    13. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    14. Doron Avramov & Guy Kaplanski & Avanidhar Subrahmanyam, 2022. "Postfundamentals Price Drift in Capital Markets: A Regression Regularization Perspective," Management Science, INFORMS, vol. 68(10), pages 7658-7681, October.
    15. Doron Avramov & Si Cheng & Lior Metzker & Stefan Voigt, 2023. "Integrating Factor Models," Journal of Finance, American Finance Association, vol. 78(3), pages 1593-1646, June.
    16. Sak, Halis & Huang, Tao & Chng, Michael T., 2024. "Exploring the factor zoo with a machine-learning portfolio," International Review of Financial Analysis, Elsevier, vol. 96(PA).
    17. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.
    18. Matteo Bagnara, 2024. "Asset Pricing and Machine Learning: A critical review," Journal of Economic Surveys, Wiley Blackwell, vol. 38(1), pages 27-56, February.
    19. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    20. Wang, Jianqiu & Wu, Ke & Tong, Guoshi & Chen, Dongxu, 2023. "Nonlinearity in the cross-section of stock returns: Evidence from China," International Review of Economics & Finance, Elsevier, vol. 85(C), pages 174-205.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2508.14986. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.