IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v66y2025i6d10.1007_s00362-025-01744-4.html
   My bibliography  Save this article

A two-sample test for high-dimensional mean vectors via double verification

Author

Listed:
  • Ruizhe Jiang

    (Jinan University)

  • Xiaowen Huang

    (Jinan University)

  • Yunlu Jiang

    (Jinan University)

Abstract

Testing the equality of two high-dimensional mean vectors is a fundamental and important statistical problem. The majority of existing methodological frameworks are limited to either dense or sparse cases. In this paper, we propose a novel framework that incorporates a double validation test statistic designed to be valid for both dense and sparse alternatives by combining the test statistic via the random integration of the difference technique and the extreme-type test statistic. The new framework enables a more tailored and efficient process, contingent on the specific circumstances. Additionally, we propose a data-driven procedure for selecting weight to increase the power of the proposed test. Furthermore, we show the asymptotic properties of the proposed test. Numerical simulations and a real data analysis illustrate the promising performances of our proposed approach.

Suggested Citation

  • Ruizhe Jiang & Xiaowen Huang & Yunlu Jiang, 2025. "A two-sample test for high-dimensional mean vectors via double verification," Statistical Papers, Springer, vol. 66(6), pages 1-24, October.
  • Handle: RePEc:spr:stpapr:v:66:y:2025:i:6:d:10.1007_s00362-025-01744-4
    DOI: 10.1007/s00362-025-01744-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-025-01744-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-025-01744-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Cheng, Tingting & Yan, Cheng & Yan, Yayi, 2021. "Improved inference for fund alphas using high-dimensional cross-sectional tests," Journal of Empirical Finance, Elsevier, vol. 61(C), pages 57-81.
    2. Lan Wang & Bo Peng & Runze Li, 2015. "A High-Dimensional Nonparametric Multivariate Test for Mean Vector," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1658-1669, December.
    3. Wanjun Liu & Xiufan Yu & Wei Zhong & Runze Li, 2024. "Projection Test for Mean Vector in High Dimensions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(545), pages 744-756, January.
    4. Srivastava, Muni S., 2009. "A test for the mean vector with fewer observations than the dimension under non-normality," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 518-532, March.
    5. Srivastava, Muni S. & Yanagihara, Hirokazu, 2010. "Testing the equality of several covariance matrices with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 101(6), pages 1319-1329, July.
    6. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    7. T. Tony Cai & Weidong Liu & Yin Xia, 2014. "Two-sample test of high dimensional means under dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(2), pages 349-372, March.
    8. Zhang, Liang & Zhu, Tianming & Zhang, Jin-Ting, 2020. "A Simple Scale-Invariant Two-Sample Test for High-dimensional Data," Econometrics and Statistics, Elsevier, vol. 14(C), pages 131-144.
    9. Jiang Hu & Zhidong Bai & Chen Wang & Wei Wang, 2017. "On testing the equality of high dimensional mean vectors with unequal covariance matrices," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(2), pages 365-387, April.
    10. Jinyuan Chang & Chao Zheng & Wen‐Xin Zhou & Wen Zhou, 2017. "Simulation‐based hypothesis testing of high dimensional means under covariance heterogeneity," Biometrics, The International Biometric Society, vol. 73(4), pages 1300-1310, December.
    11. Kwangil Ro & Changliang Zou & Zhaojun Wang & Guosheng Yin, 2015. "Outlier detection for high-dimensional data," Biometrika, Biometrika Trust, vol. 102(3), pages 589-599.
    12. Chen, Song Xi & Zhang, Li-Xin & Zhong, Ping-Shou, 2010. "Tests for High-Dimensional Covariance Matrices," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 810-819.
    13. Rui Wang & Wangli Xu, 2022. "An approximate randomization test for the high-dimensional two-sample Behrens–Fisher problem under arbitrary covariances [Broad patterns of gene expression revealed by clustering analysis of tumor ," Biometrika, Biometrika Trust, vol. 109(4), pages 1117-1132.
    14. Srivastava, Muni S. & Du, Meng, 2008. "A test for the mean vector with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 386-402, March.
    15. Chen, Song Xi & Guo, Bin & Qiu, Yumou, 2023. "Testing and signal identification for two-sample high-dimensional covariances via multi-level thresholding," Journal of Econometrics, Elsevier, vol. 235(2), pages 1337-1354.
    16. Qiu, Tao & Xu, Wangli & Zhu, Liping, 2021. "Two-sample test in high dimensions through random selection," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    17. Chen, Songxi, 2012. "Two Sample Tests for High Dimensional Covariance Matrices," MPRA Paper 46026, University Library of Munich, Germany.
    18. Gongjun Xu & Lifeng Lin & Peng Wei & Wei Pan, 2016. "An adaptive two-sample test for high-dimensional means," Biometrika, Biometrika Trust, vol. 103(3), pages 609-624.
    19. Karl Bruce Gregory & Raymond J. Carroll & Veerabhadran Baladandayuthapani & Soumendra N. Lahiri, 2015. "A Two-Sample Test for Equality of Means in High Dimension," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 837-849, June.
    20. Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    21. Xiufan Yu & Danning Li & Lingzhou Xue & Runze Li, 2023. "Power-Enhanced Simultaneous Test of High-Dimensional Mean Vectors and Covariance Matrices with Application to Gene-Set Testing," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(544), pages 2548-2561, October.
    22. Jin-Ting Zhang & Jia Guo & Bu Zhou & Ming-Yen Cheng, 2020. "A Simple Two-Sample Test in High Dimensions Based on L2-Norm," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 1011-1027, April.
    23. Yunlu Jiang & Xueqin Wang & Canhong Wen & Yukang Jiang & Heping Zhang, 2024. "Nonparametric Two-Sample Tests of High Dimensional Mean Vectors via Random Integration," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(545), pages 701-714, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Jianghao Li & Shizhe Hong & Zhenzhen Niu & Zhidong Bai, 2025. "Test for high-dimensional linear hypothesis of mean vectors via random integration," Statistical Papers, Springer, vol. 66(1), pages 1-34, January.
    3. Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. Li, Jun, 2023. "Finite sample t-tests for high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    5. Zhang, Jin-Ting & Zhou, Bu & Guo, Jia, 2022. "Linear hypothesis testing in high-dimensional heteroscedastic one-way MANOVA: A normal reference L2-norm based test," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    6. Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
    7. Zhang, Yu & Feng, Long, 2024. "Adaptive rank-based tests for high dimensional mean problems," Statistics & Probability Letters, Elsevier, vol. 214(C).
    8. Mingjuan Zhang & Libin Jin, 2024. "High-Dimensional U-Statistics Type Hypothesis Testing via Jackknife Pseudo-Values with Multiplier Bootstrap," Mathematics, MDPI, vol. 12(23), pages 1-20, December.
    9. Wang, Jingyi & Zhu, Tianming & Zhang, Jin-Ting, 2024. "Two-sample test for high-dimensional covariance matrices: A normal-reference approach," Journal of Multivariate Analysis, Elsevier, vol. 204(C).
    10. Jin-Ting Zhang & Bu Zhou & Jia Guo, 2022. "Testing high-dimensional mean vector with applications," Statistical Papers, Springer, vol. 63(4), pages 1105-1137, August.
    11. He, Yong & Zhang, Mingjuan & Zhang, Xinsheng & Zhou, Wang, 2020. "High-dimensional two-sample mean vectors test and support recovery with factor adjustment," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    12. Saha, Enakshi & Sarkar, Soham & Ghosh, Anil K., 2017. "Some high-dimensional one-sample tests based on functions of interpoint distances," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 83-95.
    13. Zhang, Jin-Ting & Zhu, Tianming, 2022. "A new normal reference test for linear hypothesis testing in high-dimensional heteroscedastic one-way MANOVA," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    14. Lixiu Wu & Jiang Hu, 2024. "Multi-sample hypothesis testing of high-dimensional mean vectors under covariance heterogeneity," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 76(4), pages 579-615, August.
    15. Guanpeng Wang & Yuyuan Wang & Hengjian Cui, 2025. "Mean test for high-dimensional data based on covariance matrix with linear structures," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 88(5), pages 657-678, July.
    16. Jiang Hu & Zhidong Bai & Chen Wang & Wei Wang, 2017. "On testing the equality of high dimensional mean vectors with unequal covariance matrices," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(2), pages 365-387, April.
    17. Yin, Yanqing, 2021. "Test for high-dimensional mean vector under missing observations," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    18. Davy Paindaveine & Thomas Verdebout, 2013. "Universal Asymptotics for High-Dimensional Sign Tests," Working Papers ECARES ECARES 2013-40, ULB -- Universite Libre de Bruxelles.
    19. Ghosh, Santu & Ayyala, Deepak Nag & Hellebuyck, Rafael, 2021. "Two-sample high dimensional mean test based on prepivots," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).
    20. Zhang, Huaiyu & Wang, Haiyan, 2021. "A more powerful test of equality of high-dimensional two-sample means," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:66:y:2025:i:6:d:10.1007_s00362-025-01744-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.