IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v39y2024i3d10.1007_s00180-023-01366-0.html
   My bibliography  Save this article

Bayesian hypothesis testing for equality of high-dimensional means using cluster subspaces

Author

Listed:
  • Fang Chen

    (Center for Biologics Evaluation and Research (CBER), Food and Drug Administration)

  • Qiuchen Hai

    (Texas A&M University-San Antonio)

  • Min Wang

    (The University of Texas at San Antonio)

Abstract

The classical Hotelling’s $$T^2$$ T 2 test and Bayesian hypothesis tests breakdown for the problem of comparing two high-dimensional population means due to the singularity of the pooled sample covariance matrices when the model dimension p exceeds the sample size n. In this paper, we develop a simple closed-form Bayesian testing procedure based on a split-and-merge technique. Specifically, we adopt the subspace clustering technique to split the high-dimensional data into lower-dimensional random spaces so that the Bayes factor can be implemented. Then we utilize the geometric mean to merge the results of the Bayesian test to obtain a novel test statistic. We carry out simulation studies to compare the performance of the proposed test with several existing ones in the literature. Finally, two real-data applications are provided for illustrative purposes.

Suggested Citation

  • Fang Chen & Qiuchen Hai & Min Wang, 2024. "Bayesian hypothesis testing for equality of high-dimensional means using cluster subspaces," Computational Statistics, Springer, vol. 39(3), pages 1301-1320, May.
  • Handle: RePEc:spr:compst:v:39:y:2024:i:3:d:10.1007_s00180-023-01366-0
    DOI: 10.1007/s00180-023-01366-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-023-01366-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-023-01366-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Huaiyu & Wang, Haiyan, 2021. "A more powerful test of equality of high-dimensional two-sample means," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    2. Joris Mulder & James O. Berger & Víctor Peña & M. J. Bayarri, 2021. "On the prevalence of information inconsistency in normal linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 103-132, March.
    3. Zhang, Jie & Pan, Meng, 2016. "A high-dimension two-sample test for the mean using cluster subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 87-97.
    4. Liang, Feng & Paulo, Rui & Molina, German & Clyde, Merlise A. & Berger, Jim O., 2008. "Mixtures of g Priors for Bayesian Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 410-423, March.
    5. Roger S. Zoh & Abhra Sarkar & Raymond J. Carroll & Bani K. Mallick, 2018. "A Powerful Bayesian Test for Equality of Means in High Dimensions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1733-1741, October.
    6. Srivastava, Muni S. & Du, Meng, 2008. "A test for the mean vector with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 386-402, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Wang, Guanpeng & Wu, Jiujing & Cui, Hengjian, 2024. "Cross projection test for mean vectors via multiple random splits in high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 204(C).
    3. Zhang, Jin-Ting & Guo, Jia & Zhou, Bu, 2017. "Linear hypothesis testing in high-dimensional one-way MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 200-216.
    4. Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
    5. Qiu, Tao & Xu, Wangli & Zhu, Liping, 2021. "Two-sample test in high dimensions through random selection," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    6. Wang, Wei & Lin, Nan & Tang, Xiang, 2019. "Robust two-sample test of high-dimensional mean vectors under dependence," Journal of Multivariate Analysis, Elsevier, vol. 169(C), pages 312-329.
    7. Zhang, Huaiyu & Wang, Haiyan, 2021. "A more powerful test of equality of high-dimensional two-sample means," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    8. Ouyang, Yanyan & Liu, Jiamin & Tong, Tiejun & Xu, Wangli, 2022. "A rank-based high-dimensional test for equality of mean vectors," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    9. Domenico Giannone & Michele Lenza & Lucrezia Reichlin, 2011. "Market Freedom and the Global Recession," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 59(1), pages 111-135, April.
    10. Li, Yang & Wang, Zhaojun & Zou, Changliang, 2016. "A simpler spatial-sign-based two-sample test for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 192-198.
    11. Reza Modarres, 2024. "Hotelling $$T^2$$ T 2 test in high dimensions with application to Wilks outlier method," Statistical Papers, Springer, vol. 65(8), pages 5203-5218, October.
    12. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    13. repec:osf:socarx:n49hv_v1 is not listed on IDEAS
    14. Ons Jedidi & Jean Sébastien Pentecote, 2015. "Robust Signals for Banking Crises," Economics Bulletin, AccessEcon, vol. 35(3), pages 1617-1629.
    15. Anna Sokolova, 2023. "Marginal Propensity to Consume and Unemployment: a Meta-analysis," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 51, pages 813-846, December.
    16. Hasan, Iftekhar & Horvath, Roman & Mares, Jan, 2020. "Finance and wealth inequality," Journal of International Money and Finance, Elsevier, vol. 108(C).
    17. Mariam Camarero & Sergi Moliner & Cecilio Tamarit, 2021. "Is there a euro effect in the drivers of US FDI? New evidence using Bayesian model averaging techniques," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 157(4), pages 881-926, November.
    18. Lianjie Shu & Jinyu Fan, 2018. "A distribution‐free control chart for monitoring high‐dimensional processes based on interpoint distances," Naval Research Logistics (NRL), John Wiley & Sons, vol. 65(4), pages 317-330, June.
    19. Ley, Eduardo & Steel, Mark F.J., 2012. "Mixtures of g-priors for Bayesian model averaging with economic applications," Journal of Econometrics, Elsevier, vol. 171(2), pages 251-266.
    20. Galharret, Jean-Michel & Philippe, Anne, 2023. "Bayesian analysis for mediation and moderation using g−priors," Econometrics and Statistics, Elsevier, vol. 27(C), pages 161-172.
    21. Jesus Crespo Cuaresma & Bettina Grün & Paul Hofmarcher & Stefan Humer & Mathias Moser, 2015. "A Comprehensive Approach to Posterior Jointness Analysis in Bayesian Model Averaging Applications," Department of Economics Working Papers wuwp193, Vienna University of Economics and Business, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:39:y:2024:i:3:d:10.1007_s00180-023-01366-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.