IDEAS home Printed from https://ideas.repec.org/p/qmw/qmwecw/764.html
   My bibliography  Save this paper

A Multiple Testing Approach to the Regularisation of Large Sample Correlation Matrices

Author

Listed:
  • Natalia Bailey

    (Queen Mary University of London)

  • M. Hashem Pesaran

    (University of Southern California, and University of Cambridge)

  • L. Vanessa Smith

    (University of York)

Abstract

This paper proposes a regularisation method for the estimation of large covariance matrices that uses insights from the multiple testing (MT) literature. The approach tests the statistical significance of individual pair-wise correlations and sets to zero those elements that are not statistically significant, taking account of the multiple testing nature of the problem. By using the inverse of the normal distribution at a predetermined significance level, it circumvents the challenge of estimating the theoretical constant arising in the rate of convergence of existing thresholding estimators, and hence it is easy to implement and does not require cross-validation. The MT estimator of the sample correlation matrix is shown to be consistent in the spectral and Frobenius norms, and in terms of support recovery, so long as the true covariance matrix is sparse. The performance of the proposed MT estimator is compared to a number of other estimators in the literature using Monte Carlo experiments. It is shown that the MT estimator performs well and tends to outperform the other estimators, particularly when the cross section dimension, N, is larger than the time series dimension, T.

Suggested Citation

  • Natalia Bailey & M. Hashem Pesaran & L. Vanessa Smith, 2015. "A Multiple Testing Approach to the Regularisation of Large Sample Correlation Matrices," Working Papers 764, Queen Mary University of London, School of Economics and Finance.
  • Handle: RePEc:qmw:qmwecw:764
    as

    Download full text from publisher

    File URL: https://www.qmul.ac.uk/sef/media/econ/research/workingpapers/2015/items/wp764.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Ledoit, Olivier & Wolf, Michael, 2004. "A well-conditioned estimator for large-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 88(2), pages 365-411, February.
    2. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    3. Filippo di Mauro & L. Vanessa Smith & Stephane Dees & M. Hashem Pesaran, 2007. "Exploring the international linkages of the euro area: a global VAR analysis," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(1), pages 1-38.
    4. Enrique Sentana, 2009. "The econometrics of mean-variance efficiency tests: a survey," Econometrics Journal, Royal Economic Society, vol. 12(3), pages 65-101, November.
    5. Jacob Bien & Robert J. Tibshirani, 2011. "Sparse estimation of a covariance matrix," Biometrika, Biometrika Trust, vol. 98(4), pages 807-820.
    6. Natalia Bailey & Sean Holly & M. Hashem Pesaran, 2016. "A Two‐Stage Approach to Spatio‐Temporal Analysis with Strong and Weak Cross‐Sectional Dependence," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(1), pages 249-280, January.
    7. M. Hashem Pesaran, 2015. "Testing Weak Cross-Sectional Dependence in Large Panels," Econometric Reviews, Taylor & Francis Journals, vol. 34(6-10), pages 1089-1117, December.
    8. Alexander Chudik & M. Hashem Pesaran & Elisa Tosetti, 2011. "Weak and strong cross‐section dependence and estimation of large panels," Econometrics Journal, Royal Economic Society, vol. 14(1), pages 45-90, February.
    9. Pesaran, M.H., 2010. "Conditional Volatility and Correlations of Weekly Returns and the VaR Analysis of 2008 Stock Market," Cambridge Working Papers in Economics 1025, Faculty of Economics, University of Cambridge.
    10. Ole E. Barndorff‐Nielsen & Neil Shephard, 2002. "Econometric analysis of realized volatility and its use in estimating stochastic volatility models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(2), pages 253-280, May.
    11. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    12. M. Hashem Pesaran, 2006. "Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure," Econometrica, Econometric Society, vol. 74(4), pages 967-1012, July.
    13. Romano, Joseph P. & Shaikh, Azeem M. & Wolf, Michael, 2008. "Formalized Data Snooping Based On Generalized Error Rates," Econometric Theory, Cambridge University Press, vol. 24(2), pages 404-447, April.
    14. Torben G. Andersen & Tim Bollerslev & Francis X. Diebold & Paul Labys, 2003. "Modeling and Forecasting Realized Volatility," Econometrica, Econometric Society, vol. 71(2), pages 579-625, March.
    15. Pesaran M.H. & Schuermann T. & Weiner S.M., 2004. "Modeling Regional Interdependencies Using a Global Error-Correcting Macroeconometric Model," Journal of Business & Economic Statistics, American Statistical Association, vol. 22, pages 129-162, April.
    16. Schäfer Juliane & Strimmer Korbinian, 2005. "A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 4(1), pages 1-32, November.
    17. Cai, Tony & Liu, Weidong, 2011. "Adaptive Thresholding for Sparse Covariance Matrix Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 672-684.
    18. Aurore Delaigle & Peter Hall & Jiashun Jin, 2011. "Robustness and accuracy of methods for high dimensional data analysis based on Student's t‐statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 283-301, June.
    19. Lam, Clifford & Fan, Jianqing, 2009. "Sparsistency and rates of convergence in large covariance matrix estimation," LSE Research Online Documents on Economics 31540, London School of Economics and Political Science, LSE Library.
    20. Joseph P. Romano & Michael Wolf, 2005. "Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 94-108, March.
    21. Ledoit, Olivier & Wolf, Michael, 2003. "Improved estimation of the covariance matrix of stock returns with an application to portfolio selection," Journal of Empirical Finance, Elsevier, vol. 10(5), pages 603-621, December.
    22. Raymond J. Carroll, 2003. "Variances Are Not Always Nuisance Parameters," Biometrics, The International Biometric Society, vol. 59(2), pages 211-220, June.
    23. Rothman, Adam J. & Levina, Elizaveta & Zhu, Ji, 2009. "Generalized Thresholding of Large Covariance Matrices," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 177-186.
    24. Jan R. Magnus, 1978. "The moments of products of quadratic forms in normal variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 32(4), pages 201-210, December.
    25. Antoniadis A. & Fan J., 2001. "Regularization of Wavelet Approximations," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 939-967, September.
    26. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    27. Pesaran, Bahram & Pesaran, M. Hashem, 2010. "Conditional volatility and correlations of weekly returns and the VaR analysis of 2008 stock market crash," Economic Modelling, Elsevier, vol. 27(6), pages 1398-1416, November.
    28. Adam J. Rothman, 2012. "Positive definite estimators of large covariance matrices," Biometrika, Biometrika Trust, vol. 99(3), pages 733-740.
    29. William F. Sharpe, 1963. "A Simplified Model for Portfolio Analysis," Management Science, INFORMS, vol. 9(2), pages 277-293, January.
    30. Fama, Eugene F. & French, Kenneth R., 1997. "Industry costs of equity," Journal of Financial Economics, Elsevier, vol. 43(2), pages 153-193, February.
    31. Peng, Jie & Wang, Pei & Zhou, Nengfeng & Zhu, Ji, 2009. "Partial Correlation Estimation by Joint Sparse Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 735-746.
    32. Ming Yuan & Yi Lin, 2007. "Model selection and estimation in the Gaussian graphical model," Biometrika, Biometrika Trust, vol. 94(1), pages 19-35.
    33. Jianhua Z. Huang & Naiping Liu & Mohsen Pourahmadi & Linxu Liu, 2006. "Covariance matrix selection and estimation via penalised normal likelihood," Biometrika, Biometrika Trust, vol. 93(1), pages 85-98, March.
    34. Lingzhou Xue & Shiqian Ma & Hui Zou, 2012. "Positive-Definite ℓ 1 -Penalized Estimation of Large Covariance Matrices," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1480-1491, December.
    35. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    36. Peter D. Hoff, 2009. "A hierarchical eigenmodel for pooled covariance estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 971-992, November.
    37. P. Fryzlewicz, 2013. "High-dimensional volatility matrix estimation via wavelets and thresholding," Biometrika, Biometrika Trust, vol. 100(4), pages 921-938.
    38. Michael J. Daniels & Robert E. Kass, 2001. "Shrinkage Estimators for Covariance Matrices," Biometrics, The International Biometric Society, vol. 57(4), pages 1173-1184, December.
    39. Abadir, Karim M. & Distaso, Walter & Žikeš, Filip, 2014. "Design-free estimation of variance matrices," Journal of Econometrics, Elsevier, vol. 181(2), pages 165-180.
    40. Pesaran, M. Hashem & Yamagata, Takashi, 2012. "Testing CAPM with a Large Number of Assets," IZA Discussion Papers 6469, Institute of Labor Economics (IZA).
    41. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    42. Cai, Tony & Liu, Weidong & Luo, Xi, 2011. "A Constrained â„“1 Minimization Approach to Sparse Precision Matrix Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 594-607.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Natalia Bailey & George Kapetanios & M. Hashem Pesaran, 2019. "Exponent of Cross-sectional Dependence for Residuals," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 46-102, September.
    2. Natalia Bailey & Sean Holly & M. Hashem Pesaran, 2016. "A Two‐Stage Approach to Spatio‐Temporal Analysis with Strong and Weak Cross‐Sectional Dependence," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(1), pages 249-280, January.
    3. Tae-Hwy Lee & Millie Yi Mao & Aman Ullah, 2021. "Estimation of high-dimensional dynamic conditional precision matrices with an application to forecast combination," Econometric Reviews, Taylor & Francis Journals, vol. 40(10), pages 905-918, November.
    4. Natalia Bailey & George Kapetanios & M. Hashem Pesaran, 2021. "Measurement of factor strength: Theory and practice," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(5), pages 587-613, August.
    5. M. Hashem Pesaran & Ron P. Smith, 2023. "The Role of Pricing Errors in Linear Asset Pricing Models with Strong, Semi-Strong, and Latent Factors," CESifo Working Paper Series 10282, CESifo.
    6. Alexander Chudik & M. Hashem Pesaran, 2016. "Theory And Practice Of Gvar Modelling," Journal of Economic Surveys, Wiley Blackwell, vol. 30(1), pages 165-197, February.
    7. Alexander Chudik & George Kapetanios & M. Hashem Pesaran, 2016. "Big data analytics: a new perspective," Globalization Institute Working Papers 268, Federal Reserve Bank of Dallas.
    8. A. Chudik & G. Kapetanios & M. Hashem Pesaran, 2018. "A One Covariate at a Time, Multiple Testing Approach to Variable Selection in High‐Dimensional Linear Regression Models," Econometrica, Econometric Society, vol. 86(4), pages 1479-1512, July.
    9. Filippo di Mauro & Alexander Al-Haschimi & Stephane Dees & Martina Jancokova, 2014. "Linking Distress of Financial Institutions to Macrofinancial Shocks," EcoMod2014 6807, EcoMod.
    10. Ben Craig & Martín Saldías, 2016. "Spatial Dependence and Data-Driven Networks of International Banks," IMF Working Papers 2016/184, International Monetary Fund.
    11. Alexander Chudik & M. Hashem Pesaran & Kamiar Mohaddes, 2020. "Identifying Global and National Output and Fiscal Policy Shocks Using a GVAR," Advances in Econometrics, in: Essays in Honor of Cheng Hsiao, volume 41, pages 143-189, Emerald Group Publishing Limited.
    12. Chudik, Alexander & Grossman, Valerie & Pesaran, M. Hashem, 2016. "A multi-country approach to forecasting output growth using PMIs," Journal of Econometrics, Elsevier, vol. 192(2), pages 349-365.
    13. Ambrogio Cesa-Bianchi & M Hashem Pesaran & Alessandro Rebucci & Stijn Van Nieuwerburgh, 2020. "Uncertainty and Economic Activity: A Multicountry Perspective [Emerging market business cycles: The cycle is the trend]," The Review of Financial Studies, Society for Financial Studies, vol. 33(8), pages 3393-3445.
    14. Rebucci, Alessandro & Acalin, Julien, 2020. "Global Business and Financial Cycles: A Tale of Two Capital Account Regimes," CEPR Discussion Papers 15190, C.E.P.R. Discussion Papers.
    15. Feng, Long & Lan, Wei & Liu, Binghui & Ma, Yanyuan, 2022. "High-dimensional test for alpha in linear factor pricing models with sparse alternatives," Journal of Econometrics, Elsevier, vol. 229(1), pages 152-175.
    16. George Kapetanios & M. Hashem Pesaran & Simon Reese, 2018. "A Residual-based Threshold Method for Detection of Units that are Too Big to Fail in Large Factor Models," CESifo Working Paper Series 7401, CESifo.
    17. Gilles Boevi Koumou, 2020. "Diversification and portfolio theory: a review," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 34(3), pages 267-312, September.
    18. Alexander Chudik & M. Hashem Pesaran & Mahrad Sharifvaghefi, 2020. "Variable Selection in High Dimensional Linear Regressions with Parameter Instability," Globalization Institute Working Papers 394, Federal Reserve Bank of Dallas, revised 18 Jan 2023.
    19. Kapetanios, G. & Pesaran, M.H. & Reese, S., 2021. "Detection of units with pervasive effects in large panel data models," Journal of Econometrics, Elsevier, vol. 221(2), pages 510-541.
    20. Alexander Chudik & M. Hashem Pesaran & Ron P. Smith, 2023. "Revisiting the Great Ratios Hypothesis," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(5), pages 1023-1047, October.
    21. Cynthia Fan Yang, 2021. "Common factors and spatial dependence: an application to US house prices," Econometric Reviews, Taylor & Francis Journals, vol. 40(1), pages 14-50, January.
    22. Elhorst, J. Paul & Gross, Marco & Tereanu, Eugen, 2018. "Spillovers in space and time: where spatial econometrics and Global VAR models meet," Working Paper Series 2134, European Central Bank.
    23. Yigit Aydede & Jan Ditzen, 2022. "Identifying the regional drivers of influenza-like illness in Nova Scotia with dominance analysis," Papers 2212.06684, arXiv.org.
    24. Ge, Shuyi & Li, Shaoran & Linton, Oliver, 2023. "News-implied linkages and local dependency in the equity market," Journal of Econometrics, Elsevier, vol. 235(2), pages 779-815.
    25. M. Hashem Pesaran & Ron P. Smith, 2021. "Factor Strengths, Pricing Errors, and Estimation of Risk Premia," CESifo Working Paper Series 8947, CESifo.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lam, Clifford, 2020. "High-dimensional covariance matrix estimation," LSE Research Online Documents on Economics 101667, London School of Economics and Political Science, LSE Library.
    2. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.
    3. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    4. Gautam Sabnis & Debdeep Pati & Anirban Bhattacharya, 2019. "Compressed Covariance Estimation with Automated Dimension Learning," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 466-481, December.
    5. Yang, Yihe & Zhou, Jie & Pan, Jianxin, 2021. "Estimation and optimal structure selection of high-dimensional Toeplitz covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    6. Shaoxin Wang & Hu Yang & Chaoli Yao, 2019. "On the penalized maximum likelihood estimation of high-dimensional approximate factor model," Computational Statistics, Springer, vol. 34(2), pages 819-846, June.
    7. Choi, Young-Geun & Lim, Johan & Roy, Anindya & Park, Junyong, 2019. "Fixed support positive-definite modification of covariance matrix estimators via linear shrinkage," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 234-249.
    8. Na Huang & Piotr Fryzlewicz, 2019. "NOVELIST estimator of large correlation and covariance matrices and their inverses," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 694-727, September.
    9. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    10. Huang, Na & Fryzlewicz, Piotr, 2018. "NOVELIST estimator of large correlation and covariance matrices and their inverses," LSE Research Online Documents on Economics 89055, London School of Economics and Political Science, LSE Library.
    11. Ziqi Chen & Chenlei Leng, 2016. "Dynamic Covariance Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1196-1207, July.
    12. Bai, Jushan & Liao, Yuan, 2012. "Efficient Estimation of Approximate Factor Models," MPRA Paper 41558, University Library of Munich, Germany.
    13. Vahe Avagyan & Andrés M. Alonso & Francisco J. Nogales, 2018. "D-trace estimation of a precision matrix using adaptive Lasso penalties," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(2), pages 425-447, June.
    14. Ding, Yi & Li, Yingying & Zheng, Xinghua, 2021. "High dimensional minimum variance portfolio estimation under statistical factor models," Journal of Econometrics, Elsevier, vol. 222(1), pages 502-515.
    15. Avagyan, Vahe & Alonso Fernández, Andrés Modesto & Nogales, Francisco J., 2015. "D-trace Precision Matrix Estimation Using Adaptive Lasso Penalties," DES - Working Papers. Statistics and Econometrics. WS 21775, Universidad Carlos III de Madrid. Departamento de Estadística.
    16. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    17. Xi Luo, 2011. "Recovering Model Structures from Large Low Rank and Sparse Covariance Matrix Estimation," Papers 1111.1133, arXiv.org, revised Mar 2013.
    18. Natalia Bailey & George Kapetanios & M. Hashem Pesaran, 2019. "Exponent of Cross-sectional Dependence for Residuals," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 46-102, September.
    19. Jianqing Fan & Alex Furger & Dacheng Xiu, 2016. "Incorporating Global Industrial Classification Standard Into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator With High-Frequency Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 489-503, October.
    20. Maurizio Daniele & Winfried Pohlmeier & Aygul Zagidullina, 2018. "Sparse Approximate Factor Estimation for High-Dimensional Covariance Matrices," Working Paper Series of the Department of Economics, University of Konstanz 2018-07, Department of Economics, University of Konstanz.

    More about this item

    Keywords

    Sparse correlation matrices; High-dimensional data; Multiple testing; Thresholding; Shrinkage;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:qmw:qmwecw:764. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Nicholas Owen (email available below). General contact details of provider: https://edirc.repec.org/data/deqmwuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.