IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v167y2018icp284-305.html
   My bibliography  Save this article

Hotelling’s T2 in separable Hilbert spaces

Author

Listed:
  • Pini, Alessia
  • Stamm, Aymeric
  • Vantini, Simone

Abstract

We address the problem of finite-sample null hypothesis significance testing on the mean element of a random variable that takes value in a generic separable Hilbert space. For this purpose, we propose a (re)definition of Hotelling’s T2 that naturally expands to any separable Hilbert space that we further embed within a permutation inferential approach. In detail, we present a unified framework for making inference on the mean element of Hilbert populations based on Hotelling’s T2 statistic, using a permutation-based testing procedure of which we prove finite-sample exactness and consistency; we showcase the explicit form of Hotelling’s T2 statistic in the case of some famous spaces used in functional data analysis (i.e., Sobolev and Bayes spaces); we demonstrate, by means of simulations, that Hotelling’s T2 exhibits the best performances in terms of statistical power for detecting mean differences between Gaussian populations, compared to other state-of-the-art statistics, in most simulated scenarios; we propose a case study that demonstrate the importance of the space into which one decides to embed the data; we provide an implementation of the proposed tools in the R package fdahotelling available at https://github.com/astamm/fdahotelling.

Suggested Citation

  • Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2018. "Hotelling’s T2 in separable Hilbert spaces," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 284-305.
  • Handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:284-305
    DOI: 10.1016/j.jmva.2018.05.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X17302646
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.05.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sangalli, Laura M. & Secchi, Piercesare & Vantini, Simone & Veneziani, Alessandro, 2009. "A Case Study in Exploratory Functional Data Analysis: Geometrical Features of the Internal Carotid Artery," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 37-48.
    2. Biswas, Munmun & Ghosh, Anil K., 2014. "A nonparametric two-sample test applicable to high dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 160-171.
    3. Lajos Horváth & Piotr Kokoszka & Ron Reeder, 2013. "Estimation of the mean of functional time series and a two-sample problem," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(1), pages 103-122, January.
    4. Frédéric Ferraty & Ingrid Van Keilegom & Philippe Vieu, 2010. "On the Validity of the Bootstrap in Non‐Parametric Functional Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 286-306, June.
    5. Thulin, Måns, 2014. "A high-dimensional two-sample test for the mean using random subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 74(C), pages 26-38.
    6. Shen, Yanfeng & Lin, Zhengyan, 2015. "An adaptive test for the mean vector in large-p-small-n problems," Computational Statistics & Data Analysis, Elsevier, vol. 89(C), pages 25-38.
    7. Tyler, David E., 2010. "A note on multivariate location and scatter statistics for sparse data sets," Statistics & Probability Letters, Elsevier, vol. 80(17-18), pages 1409-1413, September.
    8. Ana-Maria Staicu & Yingxing Li & Ciprian M. Crainiceanu & David Ruppert, 2014. "Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(4), pages 932-949, December.
    9. Livio Corain & Viatcheslav Melas & Andrey Pepelyshev & Luigi Salmaso, 2014. "New insights on permutation approach for hypothesis testing on functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 339-356, September.
    10. Antoniadis, Anestis & Sapatinas, Theofanis, 2007. "Estimation and inference in functional mixed-effects models," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4793-4813, June.
    11. Mondal, Pronoy K. & Biswas, Munmun & Ghosh, Anil K., 2015. "On high dimensional two-sample tests based on nearest neighbors," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 168-178.
    12. A. Pini & S. Vantini, 2017. "Interval-wise testing for functional data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(2), pages 407-424, April.
    13. Katayama, Shota & Kano, Yutaka & Srivastava, Muni S., 2013. "Asymptotic distributions of some test criteria for the mean vector with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 410-421.
    14. Ma, Yingying & Lan, Wei & Wang, Hansheng, 2015. "A high dimensional two-sample test under a low dimensional factor structure," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 162-170.
    15. Alessia Pini & Simone Vantini & Bianca Maria Colosimo & Marco Grasso, 2018. "Domain‐selective functional analysis of variance for supervised statistical profile monitoring of signal data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(1), pages 55-81, January.
    16. Cuevas, Antonio & Febrero, Manuel & Fraiman, Ricardo, 2006. "On the use of the bootstrap for estimating functions with functional data," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1063-1074, November.
    17. Zhang, Jie & Pan, Meng, 2016. "A high-dimension two-sample test for the mean using cluster subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 87-97.
    18. Srivastava, Muni S., 2009. "A test for the mean vector with fewer observations than the dimension under non-normality," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 518-532, March.
    19. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    20. Alessia Pini & Simone Vantini, 2016. "The interval testing procedure: A general framework for inference in functional data analysis," Biometrics, The International Biometric Society, vol. 72(3), pages 835-845, September.
    21. Ricardo Fraiman & Leonardo Moreno & Sebastian Vallejo, 2017. "Some hypothesis tests based on random projection," Computational Statistics, Springer, vol. 32(3), pages 1165-1189, September.
    22. Freedman, David & Lane, David, 1983. "A Nonstochastic Interpretation of Reported Significance Levels," Journal of Business & Economic Statistics, American Statistical Association, vol. 1(4), pages 292-298, October.
    23. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    24. Ayyala, Deepak Nag & Park, Junyong & Roy, Anindya, 2017. "Mean vector testing for high-dimensional dependent observations," Journal of Multivariate Analysis, Elsevier, vol. 153(C), pages 136-155.
    25. Feng, Long & Sun, Fasheng, 2015. "A note on high-dimensional two-sample test," Statistics & Probability Letters, Elsevier, vol. 105(C), pages 29-36.
    26. Matilde Dalla Rosa & Laura Sangalli & Simone Vantini, 2014. "Principal differential analysis of the Aneurisk65 data set," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 287-302, September.
    27. Hervé Cardot & Luboš Prchal & Pascal Sarda, 2007. "No effect and lack-of-fit permutation tests for functional regression," Computational Statistics, Springer, vol. 22(3), pages 371-390, September.
    28. Menafoglio, Alessandra & Petris, Giovanni, 2016. "Kriging for Hilbert-space valued random fields: The operatorial point of view," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 84-94.
    29. Munk, A. & Paige, R. & Pang, J. & Patrangenaru, V. & Ruymgaart, F., 2008. "The one- and multi-sample problem for functional data with application to projective shape analysis," Journal of Multivariate Analysis, Elsevier, vol. 99(5), pages 815-833, May.
    30. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    31. Dennis D. Cox & Jong Soo Lee, 2008. "Pointwise testing with functional data using the Westfall--Young randomization method," Biometrika, Biometrika Trust, vol. 95(3), pages 621-634.
    32. Li, Yang & Wang, Zhaojun & Zou, Changliang, 2016. "A simpler spatial-sign-based two-sample test for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 192-198.
    33. Cuevas, Antonio & Febrero, Manuel & Fraiman, Ricardo, 2004. "An anova test for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 47(1), pages 111-122, August.
    34. Schott, James R., 2007. "Some high-dimensional tests for a one-way MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1825-1839, October.
    35. Florentina Bunea & Andrada E. Ivanescu & Marten H. Wegkamp, 2011. "Adaptive inference for the mean of a Gaussian process in functional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(4), pages 531-538, September.
    36. Yamada, Takayuki & Himeno, Tetsuto, 2015. "Testing homogeneity of mean vectors under heteroscedasticity in high-dimension," Journal of Multivariate Analysis, Elsevier, vol. 139(C), pages 7-27.
    37. Saha, Enakshi & Sarkar, Soham & Ghosh, Anil K., 2017. "Some high-dimensional one-sample tests based on functions of interpoint distances," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 83-95.
    38. Spitzner D.J. & Marron J.S. & Essick G.K., 2003. "Mixed-Model Functional ANOVA for Studying Human Tactile Perception," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 263-272, January.
    39. Gina-Maria Pomann & Ana-Maria Staicu & Sujit Ghosh, 2016. "A two-sample distribution-free test for functional data with application to a diffusion tensor imaging study of multiple sclerosis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(3), pages 395-414, April.
    40. Ferraty, Frederic & Van Keilegom, Ingrid & Vieu, Philippe, 2010. "On the Validity of the Bootstrap in Non-Parametric Functional Regression," LIDAM Reprints ISBA 2010018, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    41. Hao Chen & Jerome H. Friedman, 2017. "A New Graph-Based Two-Sample Test for Multivariate and Object Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 397-409, January.
    42. J. Cuesta-Albertos & M. Febrero-Bande, 2010. "A simple multiway ANOVA for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 19(3), pages 537-557, November.
    43. Guanqun Cao & Lijian Yang & David Todem, 2012. "Simultaneous inference for the mean function based on dense functional data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(2), pages 359-377.
    44. Peter Hall, 2002. "Permutation tests for equality of distributions in high-dimensional settings," Biometrika, Biometrika Trust, vol. 89(2), pages 359-374, June.
    45. Srivastava, Muni S. & Katayama, Shota & Kano, Yutaka, 2013. "A two sample test in high dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 349-358.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kraus, David, 2019. "Inferential procedures for partially observed functional data," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 583-603.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qiu, Zhiping & Chen, Jianwei & Zhang, Jin-Ting, 2021. "Two-sample tests for multivariate functional data with applications," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    2. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Pini, Alessia & Spreafico, Lorenzo & Vantini, Simone & Vietti, Alessandro, 2019. "Multi-aspect local inference for functional data: Analysis of ultrasound tongue profiles," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 162-185.
    4. Pini, Alessia & Sørensen, Helle & Tolver, Anders & Vantini, Simone, 2023. "Local inference for functional linear mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
    5. Zhang, Jin-Ting & Guo, Jia & Zhou, Bu, 2017. "Linear hypothesis testing in high-dimensional one-way MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 200-216.
    6. A. Pini & S. Vantini, 2017. "Interval-wise testing for functional data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(2), pages 407-424, April.
    7. Shin-ichi Tsukada, 2019. "High dimensional two-sample test based on the inter-point distance," Computational Statistics, Springer, vol. 34(2), pages 599-615, June.
    8. Zhang, Jin-Ting & Zhu, Tianming, 2022. "A new normal reference test for linear hypothesis testing in high-dimensional heteroscedastic one-way MANOVA," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    9. Jiang, Qing & Hušková, Marie & Meintanis, Simos G. & Zhu, Lixing, 2019. "Asymptotics, finite-sample comparisons and applications for two-sample tests with functional data," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 202-220.
    10. Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    11. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    12. Jin-Ting Zhang & Bu Zhou & Jia Guo, 2022. "Testing high-dimensional mean vector with applications," Statistical Papers, Springer, vol. 63(4), pages 1105-1137, August.
    13. Paul, Biplab & De, Shyamal K. & Ghosh, Anil K., 2022. "Some clustering-based exact distribution-free k-sample tests applicable to high dimension, low sample size data," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    14. Rafael Meléndez & Ramón Giraldo & Víctor Leiva, 2020. "Sign, Wilcoxon and Mann-Whitney Tests for Functional Data: An Approach Based on Random Projections," Mathematics, MDPI, vol. 9(1), pages 1-11, December.
    15. Konrad Abramowicz & Alessia Pini & Lina Schelin & Sara Sjöstedt de Luna & Aymeric Stamm & Simone Vantini, 2023. "Domain selection and familywise error rate for functional data: A unified framework," Biometrics, The International Biometric Society, vol. 79(2), pages 1119-1132, June.
    16. Jiang Hu & Zhidong Bai & Chen Wang & Wei Wang, 2017. "On testing the equality of high dimensional mean vectors with unequal covariance matrices," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(2), pages 365-387, April.
    17. Kraus, David, 2019. "Inferential procedures for partially observed functional data," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 583-603.
    18. Mondal, Pronoy K. & Biswas, Munmun & Ghosh, Anil K., 2015. "On high dimensional two-sample tests based on nearest neighbors," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 168-178.
    19. Amanda Plunkett & Junyong Park, 2019. "Two-sample test for sparse high-dimensional multinomial distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 804-826, September.
    20. Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:284-305. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.