IDEAS home Printed from https://ideas.repec.org/a/bla/istatr/v88y2020i3p698-714.html
   My bibliography  Save this article

Graphical Comparison of High‐Dimensional Distributions

Author

Listed:
  • Reza Modarres

Abstract

We consider groups of observations in Rd and present a simultaneous plot of the empirical cumulative distribution functions of the within and between interpoint distances to visualise and examine the equality of the underlying distribution functions of the observations. We provide several examples to illustrate how such plots can be utilised to envision and canvass the relationship between the two distributions under location, scale, dependence and shape changes. We suggest new statistics for testing the equality of k distributions and extend the simultaneous plots to visualise them. We compare the new statistics to existing tests based on the interpoint distances.

Suggested Citation

  • Reza Modarres, 2020. "Graphical Comparison of High‐Dimensional Distributions," International Statistical Review, International Statistical Institute, vol. 88(3), pages 698-714, December.
  • Handle: RePEc:bla:istatr:v:88:y:2020:i:3:p:698-714
    DOI: 10.1111/insr.12358
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/insr.12358
    Download Restriction: no

    File URL: https://libkey.io/10.1111/insr.12358?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Biswas, Munmun & Ghosh, Anil K., 2014. "A nonparametric two-sample test applicable to high dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 160-171.
    2. Paul R. Rosenbaum, 2005. "An exact distribution‐free test comparing two multivariate distributions based on adjacency," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(4), pages 515-530, September.
    3. Zhenyu Liu & Reza Modarres, 2011. "A triangle test for equality of distribution functions in high dimensions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 23(3), pages 605-615.
    4. Yu Song & Reza Modarres, 2019. "Interpoint Distance Test of Homogeneity for Multivariate Mixture Models," International Statistical Review, International Statistical Institute, vol. 87(3), pages 613-638, December.
    5. D. Zhan & J. D. Hart, 2014. "Testing equality of a large number of densities," Biometrika, Biometrika Trust, vol. 101(2), pages 449-464.
    6. Giovagnoli, Alessandra & Wynn, H. P., 1995. "Multivariate dispersion orderings," Statistics & Probability Letters, Elsevier, vol. 22(4), pages 325-332, March.
    7. Peter Hall, 2002. "Permutation tests for equality of distributions in high-dimensional settings," Biometrika, Biometrika Trust, vol. 89(2), pages 359-374, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Modarres, Reza, 2022. "A high dimensional dissimilarity measure," Computational Statistics & Data Analysis, Elsevier, vol. 175(C).
    2. Modarres, Reza, 2023. "Analysis of distance matrices," Statistics & Probability Letters, Elsevier, vol. 193(C).
    3. Paul, Biplab & De, Shyamal K. & Ghosh, Anil K., 2022. "Some clustering-based exact distribution-free k-sample tests applicable to high dimension, low sample size data," Journal of Multivariate Analysis, Elsevier, vol. 190(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shin-ichi Tsukada, 2019. "High dimensional two-sample test based on the inter-point distance," Computational Statistics, Springer, vol. 34(2), pages 599-615, June.
    2. Paul, Biplab & De, Shyamal K. & Ghosh, Anil K., 2022. "Some clustering-based exact distribution-free k-sample tests applicable to high dimension, low sample size data," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    3. Mondal, Pronoy K. & Biswas, Munmun & Ghosh, Anil K., 2015. "On high dimensional two-sample tests based on nearest neighbors," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 168-178.
    4. Lovato, Ilenia & Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2020. "Model-free two-sample test for network-valued data," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    5. Biswas, Munmun & Ghosh, Anil K., 2014. "A nonparametric two-sample test applicable to high dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 160-171.
    6. Reza Modarres & Yu Song, 2020. "Multivariate power series interpoint distances," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(4), pages 955-982, December.
    7. Cousido-Rocha, Marta & de Uña-Álvarez, Jacobo & Hart, Jeffrey D., 2019. "A two-sample test for the equality of univariate marginal distributions for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    8. Liu, Zhi & Xia, Xiaochao & Zhou, Wang, 2015. "A test for equality of two distributions via jackknife empirical likelihood and characteristic functions," Computational Statistics & Data Analysis, Elsevier, vol. 92(C), pages 97-114.
    9. Nicolas Städler & Sach Mukherjee, 2017. "Two-sample testing in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 225-246, January.
    10. Modarres, Reza, 2014. "On the interpoint distances of Bernoulli vectors," Statistics & Probability Letters, Elsevier, vol. 84(C), pages 215-222.
    11. W. Lok & Stephen Lee, 2011. "A new statistical depth function with applications to multimodal data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 23(3), pages 617-631.
    12. Lingzhe Guo & Reza Modarres, 2020. "Testing the equality of matrix distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(2), pages 289-307, June.
    13. Luai Al-Labadi & Forough Fazeli Asl & Zahra Saberi, 2022. "A Bayesian nonparametric multi-sample test in any dimension," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(2), pages 217-242, June.
    14. Ruiyi Zhang & R. Todd Ogden & Martin Picard & Anuj Srivastava, 2022. "Nonparametric k‐sample test on shape spaces with applications to mitochondrial shape analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(1), pages 51-69, January.
    15. Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2018. "Hotelling’s T2 in separable Hilbert spaces," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 284-305.
    16. Petrie, Adam, 2016. "Graph-theoretic multisample tests of equality in distribution for high dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 145-158.
    17. Romanazzi, Mario, 2009. "Data depth, random simplices and multivariate dispersion," Statistics & Probability Letters, Elsevier, vol. 79(12), pages 1473-1479, June.
    18. Federico A. Bugni & Joel L. Horowitz, 2021. "Permutation tests for equality of distributions of functional data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(7), pages 861-877, November.
    19. Zhi Peng Ong & Aixiang Andy Chen & Tianming Zhu & Jin-Ting Zhang, 2023. "Testing Equality of Several Distributions at High Dimensions: A Maximum-Mean-Discrepancy-Based Approach," Mathematics, MDPI, vol. 11(20), pages 1-21, October.
    20. Guillermo Ayala & María Concepción López-Díaz & Miguel López-Díaz & Lucía Martínez-Costa, 2015. "Methods and Algorithms to Test the Hausdorff and Simplex Dispersion Orders with an R Package," Methodology and Computing in Applied Probability, Springer, vol. 17(3), pages 661-675, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:istatr:v:88:y:2020:i:3:p:698-714. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/isiiinl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.