IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0211044.html
   My bibliography  Save this article

A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario

Author

Listed:
  • Kristian Hindberg
  • Jan Hannig
  • Fred Godtliebsen

Abstract

Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated.

Suggested Citation

  • Kristian Hindberg & Jan Hannig & Fred Godtliebsen, 2019. "A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario," PLOS ONE, Public Library of Science, vol. 14(1), pages 1-20, January.
  • Handle: RePEc:plo:pone00:0211044
    DOI: 10.1371/journal.pone.0211044
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0211044
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0211044&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0211044?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. D. R. Cox & Nanny Wermuth, 1994. "Tests of Linearity, Multivariate Normality and the Adequacy of Linear Scores," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 43(2), pages 347-355, June.
    2. Lasse Holmström & Leena Pasanen, 2017. "Statistical Scale Space Methods," International Statistical Review, International Statistical Institute, vol. 85(1), pages 1-30, April.
    3. Marsaglia, George & Marsaglia, John, 2004. "Evaluating the Anderson-Darling Distribution," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 9(i02).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. BenSaïda, Ahmed & Slim, Skander, 2016. "Highly flexible distributions to fit multiple frequency financial returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 442(C), pages 203-213.
    2. Fernández de Marcos Giménez de los Galanes, Alberto & García Portugués, Eduardo, 2022. "Data-driven stabilizations of goodness-of-fit tests," DES - Working Papers. Statistics and Econometrics. WS 35324, Universidad Carlos III de Madrid. Departamento de Estadística.
    3. Shibin Zhang & Xin M. Tu, 2022. "Tests for comparing time‐invariant and time‐varying spectra based on the Anderson–Darling statistic," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 76(3), pages 254-282, August.
    4. Zinoviy Landsman & Udi Makov & Tomer Shushi, 2017. "Extended Generalized Skew-Elliptical Distributions and their Moments," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 79(1), pages 76-100, February.
    5. Hocine Khelifa & Eric Vagnon & Abderrahmane Beroual, 2023. "Effect of Fullerene and Graphene Nanoparticles on the AC Dielectric Strength of Natural Ester," Energies, MDPI, vol. 16(4), pages 1-11, February.
    6. Nanny Wermuth & Kayvan Sadeghi, 2012. "Sequences of regressions and their independences," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 21(2), pages 215-252, June.
    7. Fernández-de-Marcos, Alberto & García-Portugués, Eduardo, 2023. "Data-driven stabilizations of goodness-of-fit tests," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    8. Dobric, Jadran & Schmid, Friedrich, 2007. "A goodness of fit test for copulas based on Rosenblatt's transformation," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4633-4642, May.
    9. Sung Ik Kim, 2022. "ARMA–GARCH model with fractional generalized hyperbolic innovations," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-25, December.
    10. Daniele Coin, 2017. "A goodness-of-fit test for Generalized Error Distribution," Temi di discussione (Economic working papers) 1096, Bank of Italy, Economic Research and International Relations Area.
    11. Riccardo Lucchetti & Claudia Pigini, 2013. "A test for bivariate normality with applications in microeconometric models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 535-572, November.
    12. Norbert Henze, 2002. "Invariant tests for multivariate normality: a critical review," Statistical Papers, Springer, vol. 43(4), pages 467-506, October.
    13. Vuollo, Ville & Holmström, Lasse, 2018. "A scale space approach for exploring structure in spherical data," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 57-69.
    14. Konstantinos Leptokaropoulos & Catherine A. Rychert & Nicholas Harmon & David Schlaphorst & Ingo Grevemeyer & John-Michael Kendall & Satish C. Singh, 2023. "Broad fault zones enable deep fluid transport and limit earthquake magnitudes," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    15. Jurgen A. Doornik & Henrik Hansen, 2008. "An Omnibus Test for Univariate and Multivariate Normality," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 70(s1), pages 927-939, December.
    16. Gomes-Gonçalves, Erika & Gzyl, Henryk & Mayoral, Silvia, 2015. "Two maxentropic approaches to determine the probability density of compound risk losses," Insurance: Mathematics and Economics, Elsevier, vol. 62(C), pages 42-53.
    17. Ttofi, Maria M. & Farrington, David P. & Piquero, Alex R. & Lösel, Friedrich & DeLisi, Matthew & Murray, Joseph, 2016. "Intelligence as a protective factor against offending: A meta-analytic review of prospective longitudinal studies," Journal of Criminal Justice, Elsevier, vol. 45(C), pages 4-18.
    18. Asmerilda Hitaj & Lorenzo Mercuri & Edit Rroji, 2019. "Sensitivity analysis of Mixed Tempered Stable parameters with implications in portfolio optimization," Computational Management Science, Springer, vol. 16(1), pages 71-95, February.
    19. Riccardo LUCCHETTI & Claudia PIGINI, 2011. "Conditional Moment Tests for Normality in Bivariate Limited Dependent Variable Models: a Monte Carlo Study," Working Papers 357, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    20. Foraita, Ronja & Klasen, Stephan & Pigeot, Iris, 2008. "Using graphical chain models to analyze differences in structural correlates of undernutrition in Benin and Bangladesh," Economics & Human Biology, Elsevier, vol. 6(3), pages 398-419, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0211044. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.