IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/46026.html
   My bibliography  Save this paper

Two Sample Tests for High Dimensional Covariance Matrices

Author

Listed:
  • Chen, Songxi

Abstract

We propose two tests for the equality of covariance matrices between two high-dimensional populations. One test is on the whole variance-covariance matrices, and the other is on offdiagonal sub-matrices which define the covariance between two non-overlapping segments of the high-dimensional random vectors. The tests are applicable (i) when the data dimension is much larger than the sample sizes, namely the “large p, small n” situations and (ii) without assuming parametric distributions for the two populations. These two aspects surpass the capability of the conventional likelihood ratio test. The proposed tests can be used to test on covariances associated with gene ontology terms.

Suggested Citation

  • Chen, Songxi, 2012. "Two Sample Tests for High Dimensional Covariance Matrices," MPRA Paper 46026, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:46026
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/46026/1/MPRA_paper_46026.pdf
    File Function: original version
    Download Restriction: no

    File URL: https://mpra.ub.uni-muenchen.de/46278/1/MPRA_paper_46026.pdf
    File Function: revised version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ledoit, Olivier & Wolf, Michael, 2004. "A well-conditioned estimator for large-dimensional covariance matrices," Journal of Multivariate Analysis, Elsevier, vol. 88(2), pages 365-411, February.
    2. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    3. Fan, Jianqing & Peng, Heng & Huang, Tao, 2005. "Semilinear High-Dimensional Model for Normalization of Microarray Data: A Theoretical Analysis and Partial Consistency," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 781-796, September.
    4. Fan, Jianqing & Hall, Peter & Yao, Qiwei, 2007. "To How Many Simultaneous Hypothesis Tests Can Normal, Student's t or Bootstrap Calibration Be Applied?," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1282-1288, December.
    5. Huang, Jian & Wang, Deli & Zhang, Cun-Hui, 2005. "A Two-Way Semilinear Model for Normalization and Analysis of cDNA Microarray Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 814-829, September.
    6. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    7. Schott, James R., 2007. "A test for the equality of covariance matrices when the dimension is large relative to the sample sizes," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6535-6542, August.
    8. Clifford Lam & Qiwei Yao & Neil Bathia, 2011. "Estimation of latent factors for high-dimensional time series," Biometrika, Biometrika Trust, vol. 98(4), pages 901-918.
    9. Jianhua Z. Huang & Naiping Liu & Mohsen Pourahmadi & Linxu Liu, 2006. "Covariance matrix selection and estimation via penalised normal likelihood," Biometrika, Biometrika Trust, vol. 93(1), pages 85-98, March.
    10. Adam J. Rothman & Elizaveta Levina & Ji Zhu, 2010. "A new approach to Cholesky-based covariance regularization in high dimensions," Biometrika, Biometrika Trust, vol. 97(3), pages 539-550.
    11. Wei Biao Wu, 2003. "Nonparametric estimation of large covariance matrices of longitudinal data," Biometrika, Biometrika Trust, vol. 90(4), pages 831-844, December.
    12. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lam, Clifford, 2020. "High-dimensional covariance matrix estimation," LSE Research Online Documents on Economics 101667, London School of Economics and Political Science, LSE Library.
    2. Xi Luo, 2011. "Recovering Model Structures from Large Low Rank and Sparse Covariance Matrix Estimation," Papers 1111.1133, arXiv.org, revised Mar 2013.
    3. Gautam Sabnis & Debdeep Pati & Anirban Bhattacharya, 2019. "Compressed Covariance Estimation with Automated Dimension Learning," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 466-481, December.
    4. Fang, Qian & Yu, Chen & Weiping, Zhang, 2020. "Regularized estimation of precision matrix for high-dimensional multivariate longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
    5. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    6. Yumou Qiu & Song Xi Chen, 2015. "Bandwidth Selection for High-Dimensional Covariance Matrix Estimation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1160-1174, September.
    7. He, Jing & Chen, Song Xi, 2016. "Testing super-diagonal structure in high dimensional covariance matrices," Journal of Econometrics, Elsevier, vol. 194(2), pages 283-297.
    8. Abadir, Karim M. & Distaso, Walter & Žikeš, Filip, 2014. "Design-free estimation of variance matrices," Journal of Econometrics, Elsevier, vol. 181(2), pages 165-180.
    9. Benjamin Poignard & Manabu Asai, 2023. "Estimation of high-dimensional vector autoregression via sparse precision matrix," The Econometrics Journal, Royal Economic Society, vol. 26(2), pages 307-326.
    10. Pesaran, M. Hashem & Yamagata, Takashi, 2012. "Testing CAPM with a Large Number of Assets," IZA Discussion Papers 6469, Institute of Labor Economics (IZA).
    11. Qiu, Yumou & Chen, Songxi, 2012. "Test for Bandedness of High Dimensional Covariance Matrices with Bandwidth Estimation," MPRA Paper 46242, University Library of Munich, Germany.
    12. Chen, Jia & Li, Degui & Linton, Oliver, 2019. "A new semiparametric estimation approach for large dynamic covariance matrices with multiple conditioning variables," Journal of Econometrics, Elsevier, vol. 212(1), pages 155-176.
    13. Xiaoping Zhou & Dmitry Malioutov & Frank J. Fabozzi & Svetlozar T. Rachev, 2014. "Smooth monotone covariance for elliptical distributions and applications in finance," Quantitative Finance, Taylor & Francis Journals, vol. 14(9), pages 1555-1571, September.
    14. Kang, Xiaoning & Wang, Mingqiu, 2021. "Ensemble sparse estimation of covariance structure for exploring genetic disease data," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    15. Chi, Eric C. & Lange, Kenneth, 2014. "Stable estimation of a covariance matrix guided by nuclear norm penalties," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 117-128.
    16. Bailey, Natalia & Pesaran, M. Hashem & Smith, L. Vanessa, 2019. "A multiple testing approach to the regularisation of large sample correlation matrices," Journal of Econometrics, Elsevier, vol. 208(2), pages 507-534.
    17. Lopes, Hedibert F. & McCulloch, Robert E. & Tsay, Ruey S., 2022. "Parsimony inducing priors for large scale state–space models," Journal of Econometrics, Elsevier, vol. 230(1), pages 39-61.
    18. Zvi Bodie & Jérôme Detemple & Marcel Rindisbacher, 2009. "Life-Cycle Finance and the Design of Pension Plans," Annual Review of Financial Economics, Annual Reviews, vol. 1(1), pages 249-286, November.
    19. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    20. Liusha Yang & Matthew R. Mckay & Romain Couillet, 2018. "High-Dimensional MVDR Beamforming: Optimized Solutions Based on Spiked Random Matrix Models," Post-Print hal-01957672, HAL.

    More about this item

    Keywords

    High dimensional covariance; Large p small n; Likelihood ratio test; Testing for Gene-sets.;
    All these keywords.

    JEL classification:

    • C0 - Mathematical and Quantitative Methods - - General
    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
    • C2 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables
    • C3 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables
    • C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics
    • C5 - Mathematical and Quantitative Methods - - Econometric Modeling
    • C6 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling
    • C7 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory
    • C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs
    • C9 - Mathematical and Quantitative Methods - - Design of Experiments
    • G0 - Financial Economics - - General
    • G1 - Financial Economics - - General Financial Markets
    • G2 - Financial Economics - - Financial Institutions and Services
    • G3 - Financial Economics - - Corporate Finance and Governance

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:46026. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.