IDEAS home Printed from
   My bibliography  Save this article

Optimal shrinkage estimator for high-dimensional mean vector


  • Bodnar, Taras
  • Okhrin, Ostap
  • Parolya, Nestor


In this paper we derive the optimal linear shrinkage estimator for the high-dimensional mean vector using random matrix theory. The results are obtained under the assumption that both the dimension p and the sample size n tend to infinity in such a way that p∕n→c∈(0,∞). Under weak conditions imposed on the underlying data generating mechanism, we find the asymptotic equivalents to the optimal shrinkage intensities and estimate them consistently. The proposed nonparametric estimator for the high-dimensional mean vector has a simple structure and is proven to minimize asymptotically, with probability 1, the quadratic loss when c∈(0,1). When c∈(1,∞) we modify the estimator by using a feasible estimator for the precision covariance matrix. To this end, an exhaustive simulation study and an application to real data are provided where the proposed estimator is compared with known benchmarks from the literature. It turns out that the existing estimators of the mean vector, including the new proposal, converge to the sample mean vector when the true mean vector has an unbounded Euclidean norm.

Suggested Citation

  • Bodnar, Taras & Okhrin, Ostap & Parolya, Nestor, 2019. "Optimal shrinkage estimator for high-dimensional mean vector," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 63-79.
  • Handle: RePEc:eee:jmvana:v:170:y:2019:i:c:p:63-79
    DOI: 10.1016/j.jmva.2018.07.004

    Download full text from publisher

    File URL:
    Download Restriction: Full text for ScienceDirect subscribers only

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    1. Tu, Jun & Zhou, Guofu, 2011. "Markowitz meets Talmud: A combination of sophisticated and naive diversification strategies," Journal of Financial Economics, Elsevier, vol. 99(1), pages 204-215, January.
    2. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    3. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    4. Victor DeMiguel & Lorenzo Garlappi & Raman Uppal, 2009. "Optimal Versus Naive Diversification: How Inefficient is the 1-N Portfolio Strategy?," Review of Financial Studies, Society for Financial Studies, vol. 22(5), pages 1915-1953, May.
    5. Friesen, Olga & Löwe, Matthias & Stolz, Michael, 2013. "Gaussian fluctuations for sample covariance matrices with dependent data," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 270-287.
    6. Bodnar, Taras & Reiß, Markus, 2016. "Exact and asymptotic tests on a factor model in low and large dimensions with applications," Journal of Multivariate Analysis, Elsevier, vol. 150(C), pages 125-151.
    7. Germán Aneiros & Philippe Vieu, 2015. "Partial linear modelling with multi-functional covariates," Computational Statistics, Springer, vol. 30(3), pages 647-671, September.
    8. Bodnar, Taras & Parolya, Nestor & Schmid, Wolfgang, 2018. "Estimation of the global minimum variance portfolio in high dimensions," European Journal of Operational Research, Elsevier, vol. 266(1), pages 371-390.
    9. Schöne, Alexander & Schmid, Wolfgang, 2000. "On the Joint Distribution of a Quadratic and a Linear Form in Normal Variables," Journal of Multivariate Analysis, Elsevier, vol. 72(2), pages 163-182, February.
    10. G. Aneiros & P. Vieu, 2016. "Sparse nonparametric model for regression with functional covariate," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(4), pages 839-859, October.
    11. Jianqing Fan & Jingjin Zhang & Ke Yu, 2012. "Vast Portfolio Selection With Gross-Exposure Constraints," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 592-606, June.
    12. Wang, Cheng & Tong, Tiejun & Cao, Longbing & Miao, Baiqi, 2014. "Non-parametric shrinkage mean estimation for quadratic loss functions with unknown covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 222-232.
    13. Taras Bodnar & Yarema Okhrin & Nestor Parolya, 2016. "Optimal shrinkage-based portfolio selection in high dimensions," Papers 1611.01958,, revised Jul 2018.
    14. F. Ferraty & P. Hall & P. Vieu, 2010. "Most-predictive design points for functional data predictors," Biometrika, Biometrika Trust, vol. 97(4), pages 807-824.
    15. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    16. Bodnar, Taras & Dette, Holger & Parolya, Nestor, 2016. "Spectral analysis of the Moore–Penrose inverse of a large dimensional sample covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 148(C), pages 160-172.
    17. Aneiros, Germán & Vieu, Philippe, 2014. "Variable selection in infinite-dimensional problems," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 12-20.
    18. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    19. Bai, Jushan & Ng, Serena, 2008. "Large Dimensional Factor Analysis," Foundations and Trends(R) in Econometrics, now publishers, vol. 3(2), pages 89-163, June.
    20. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    21. Fourdrinier, Dominique & Strawderman, William E. & Wells, Martin T., 2003. "Robust shrinkage estimation for elliptically symmetric distributions with unknown covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 85(1), pages 24-39, April.
    Full references (including those not matched with items on IDEAS)


    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

    Cited by:

    1. Taras Bodnar & Holger Dette & Nestor Parolya & Erik Thors'en, 2019. "Sampling Distributions of Optimal Portfolio Weights and Characteristics in Low and Large Dimensions," Papers 1908.04243,, revised Aug 2019.
    2. Taras Bodnar & Stepan Mazur & Nestor Parolya, 2019. "Central limit theorems for functionals of large sample covariance matrix and mean vector in matrix‐variate location mixture of normal distributions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 46(2), pages 636-660, June.
    3. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:170:y:2019:i:c:p:63-79. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Dana Niculescu). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.