IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v167y2018icp225-249.html
   My bibliography  Save this article

On two-sample mean tests under spiked covariances

Author

Listed:
  • Wang, Rui
  • Xu, Xingzhong

Abstract

This paper considers testing the means of two p-variate normal samples in high dimensional settings. We show that under the null hypothesis, a necessary and sufficient condition for the asymptotic normality of the test statistic of Chen and Qin (2010) is that the eigenvalues of the covariance matrix are concentrated around their average. However, this condition is not satisfied when the variables are strongly correlated. To characterize the correlations between variables, we adopt a spiked covariance model. Under the spiked covariance model, we derive the asymptotic distribution of the test statistic of Chen and Qin (2010) and correct its critical value. The recently proposed random projection test procedures suggest that the power of tests may be boosted using the projected data. By maximizing an average signal to noise ratio, we find that the optimal projection subspace is the orthogonal complement of the principal subspace. We propose a new test procedure through the projection onto the estimated orthogonal complement of the principal subspace. The asymptotic normality of the new test statistic is proved and the asymptotic power function of the test is given. Theoretical and simulation results show that the new test outperforms the competing tests substantially under the spiked covariance model.

Suggested Citation

  • Wang, Rui & Xu, Xingzhong, 2018. "On two-sample mean tests under spiked covariances," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 225-249.
  • Handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:225-249
    DOI: 10.1016/j.jmva.2018.05.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X17304165
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.05.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    2. Thulin, Måns, 2014. "A high-dimensional two-sample test for the mean using random subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 74(C), pages 26-38.
    3. Ma, Yingying & Lan, Wei & Wang, Hansheng, 2015. "A high dimensional two-sample test under a low dimensional factor structure," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 162-170.
    4. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    5. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    6. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    7. Damien Passemier & Zhaoyuan Li & Jianfeng Yao, 2017. "On estimation of the noise variance in high dimensional probabilistic principal component analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 51-67, January.
    8. Srivastava, Muni S. & Du, Meng, 2008. "A test for the mean vector with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 386-402, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mingxiang Cao & Yuanjing He, 2022. "A high-dimensional test on linear hypothesis of means under a low-dimensional factor model," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(5), pages 557-572, July.
    2. Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
    3. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mingxiang Cao & Yuanjing He, 2022. "A high-dimensional test on linear hypothesis of means under a low-dimensional factor model," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(5), pages 557-572, July.
    2. Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2018. "Hotelling’s T2 in separable Hilbert spaces," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 284-305.
    3. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. He, Yong & Zhang, Mingjuan & Zhang, Xinsheng & Zhou, Wang, 2020. "High-dimensional two-sample mean vectors test and support recovery with factor adjustment," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    5. Tzviel Frostig & Yoav Benjamini, 2022. "Testing the equality of multivariate means when $$p>n$$ p > n by combining the Hotelling and Simes tests," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 390-415, June.
    6. Feng, Long & Sun, Fasheng, 2015. "A note on high-dimensional two-sample test," Statistics & Probability Letters, Elsevier, vol. 105(C), pages 29-36.
    7. Zhang, Jie & Pan, Meng, 2016. "A high-dimension two-sample test for the mean using cluster subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 87-97.
    8. Zhang, Jin-Ting & Guo, Jia & Zhou, Bu, 2017. "Linear hypothesis testing in high-dimensional one-way MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 200-216.
    9. Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
    10. Zhang, Jin-Ting & Zhou, Bu & Guo, Jia, 2022. "Linear hypothesis testing in high-dimensional heteroscedastic one-way MANOVA: A normal reference L2-norm based test," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    11. Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    12. Qiu, Tao & Xu, Wangli & Zhu, Liping, 2021. "Two-sample test in high dimensions through random selection," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    13. Hyodo, Masashi & Nishiyama, Takahiro & Pavlenko, Tatjana, 2023. "A Behrens–Fisher problem for general factor models in high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    14. Miao, Ke & Phillips, Peter C.B. & Su, Liangjun, 2023. "High-dimensional VARs with common factors," Journal of Econometrics, Elsevier, vol. 233(1), pages 155-183.
    15. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    16. Wang, Wei & Lin, Nan & Tang, Xiang, 2019. "Robust two-sample test of high-dimensional mean vectors under dependence," Journal of Multivariate Analysis, Elsevier, vol. 169(C), pages 312-329.
    17. Zhang, Huaiyu & Wang, Haiyan, 2021. "A more powerful test of equality of high-dimensional two-sample means," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    18. Ouyang, Yanyan & Liu, Jiamin & Tong, Tiejun & Xu, Wangli, 2022. "A rank-based high-dimensional test for equality of mean vectors," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    19. GUO-FITOUSSI, Liang, 2013. "A Comparison of the Finite Sample Properties of Selection Rules of Factor Numbers in Large Datasets," MPRA Paper 50005, University Library of Munich, Germany.
    20. Li, Yang & Wang, Zhaojun & Zou, Changliang, 2016. "A simpler spatial-sign-based two-sample test for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 192-198.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:225-249. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.