IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v205y2025ics0047259x2400085x.html
   My bibliography  Save this article

A conditional distribution function-based measure for independence and K-sample tests in multivariate data

Author

Listed:
  • Wang, Li
  • Zhou, Hongyi
  • Ma, Weidong
  • Yang, Ying

Abstract

We introduce a new index to measure the degree of dependence and test for independence between two random vectors. The index is obtained by generalizing the Cramér–von Mises distances between the conditional and marginal distribution functions via the projection-averaging technique. If one of the random vectors is categorical with K categories, we propose slicing estimators to estimate our index. We conduct an asymptotic analysis for the slicing estimators, considering both situations where K is fixed and where K is allowed to increase with the sample size. When both random vectors are continuous, we introduce a kernel regression estimator for the proposed index, demonstrating that its asymptotic null distribution follows a normal distribution and conducting a local power analysis for the kernel estimator-based independence test. The proposed tests are studied via simulations, with a real data application presented to illustrate our methods.

Suggested Citation

  • Wang, Li & Zhou, Hongyi & Ma, Weidong & Yang, Ying, 2025. "A conditional distribution function-based measure for independence and K-sample tests in multivariate data," Journal of Multivariate Analysis, Elsevier, vol. 205(C).
  • Handle: RePEc:eee:jmvana:v:205:y:2025:i:c:s0047259x2400085x
    DOI: 10.1016/j.jmva.2024.105378
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X2400085X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2024.105378?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Escanciano, J. Carlos, 2006. "A Consistent Diagnostic Test For Regression Models Using Projections," Econometric Theory, Cambridge University Press, vol. 22(6), pages 1030-1051, December.
    2. Su, Liangjun & White, Halbert, 2007. "A consistent characteristic function-based test for conditional independence," Journal of Econometrics, Elsevier, vol. 141(2), pages 807-834, December.
    3. Chenlu Ke & Xiangrong Yin, 2020. "Expected Conditional Characteristic Function-based Measures for Testing Independence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 985-996, April.
    4. Holger Dette & Karl F. Siburg & Pavel A. Stoimenov, 2013. "A Copula-Based Non-parametric Measure of Regression Dependence," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(1), pages 21-41, March.
    5. Cui, Hengjian & Zhong, Wei, 2019. "A distribution-free test of independence based on mean variance index," Computational Statistics & Data Analysis, Elsevier, vol. 139(C), pages 117-133.
    6. Fan, Yanqin & Li, Qi, 1996. "Consistent Model Specification Tests: Omitted Variables and Semiparametric Functional Forms," Econometrica, Econometric Society, vol. 64(4), pages 865-890, July.
    7. Jin, Ze & Matteson, David S., 2018. "Generalizing distance covariance to measure and test multivariate mutual dependence via complete and incomplete V-statistics," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 304-322.
    8. Zhanrui Cai & Jing Lei & Kathryn Roeder, 2024. "Asymptotic Distribution-Free Independence Test for High-Dimension Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(547), pages 1794-1804, July.
    9. Runze Li & Kai Xu & Yeqing Zhou & Liping Zhu, 2023. "Testing the Effects of High-Dimensional Covariates via Aggregating Cumulative Covariances," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 2184-2194, July.
    10. Hengjian Cui & Runze Li & Wei Zhong, 2015. "Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 630-641, June.
    11. Hart, Jeffrey D., 2016. "A nonparametric test of stationarity for independent data," Statistics & Probability Letters, Elsevier, vol. 108(C), pages 40-44.
    12. M. D. Jiménez-Gamero & M. Cousido-Rocha & M. V. Alba-Fernández & F. Jiménez-Jiménez, 2022. "Testing the equality of a large number of populations," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 1-21, March.
    13. Xin Chen & R. Dennis Cook & Changliang Zou, 2015. "Diagnostic studies in sufficient dimension reduction," Biometrika, Biometrika Trust, vol. 102(3), pages 545-558.
    14. Efang Kong & Yingcun Xia & Wei Zhong, 2019. "Composite Coefficient of Determination and Its Application in Ultrahigh Dimensional Variable Screening," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1740-1751, October.
    15. Nabarun Deb & Bodhisattva Sen, 2023. "Multivariate Rank-Based Distribution-Free Nonparametric Testing Using Measure Transportation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 192-207, January.
    16. Niklas Pfister & Peter Bühlmann & Bernhard Schölkopf & Jonas Peters, 2018. "Kernel‐based tests for joint independence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 5-31, January.
    17. Dette, Holger & Dörnemann, Nina, 2020. "Likelihood ratio tests for many groups in high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 178(C).
    18. Sourav Chatterjee, 2021. "A New Coefficient of Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 2009-2022, October.
    19. Liping Zhu & Kai Xu & Runze Li & Wei Zhong, 2017. "Projection correlation between two random vectors," Biometrika, Biometrika Trust, vol. 104(4), pages 829-843.
    20. Hongjian Shi & Mathias Drton & Fang Han, 2022. "Distribution-Free Consistent Independence Tests via Center-Outward Ranks and Signs," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(537), pages 395-410, January.
    21. D. Zhan & J. D. Hart, 2014. "Testing equality of a large number of densities," Biometrika, Biometrika Trust, vol. 101(2), pages 449-464.
    22. Xueqin Wang & Wenliang Pan & Wenhao Hu & Yuan Tian & Heping Zhang, 2015. "Conditional Distance Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1726-1734, December.
    23. Yaowu Zhang & Liping Zhu, 2024. "Projective independence tests in high dimensions: the curses and the cures," Biometrika, Biometrika Trust, vol. 111(3), pages 1013-1027.
    24. Marta Cousido-Rocha & Jacobo Uña-Álvarez & Jeffrey D. Hart, 2019. "Testing equality of a large number of densities under mixing conditions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(4), pages 1203-1228, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ke, Chenlu & Yang, Wei & Yuan, Qingcong & Li, Lu, 2023. "Partial sufficient variable screening with categorical controls," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    2. Dai, Shengtao & Song, Xiaojun, 2025. "Consistent tests for semiparametric conditional independence," Statistics & Probability Letters, Elsevier, vol. 216(C).
    3. Xu, Kai & Cheng, Qing, 2024. "Test of conditional independence in factor models via Hilbert–Schmidt independence criterion," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    4. Zhang, Qingyang, 2023. "On the asymptotic null distribution of the symmetrized Chatterjee’s correlation coefficient," Statistics & Probability Letters, Elsevier, vol. 194(C).
    5. Zhang, Wei & Gao, Wei & Ng, Hon Keung Tony, 2023. "Multivariate tests of independence based on a new class of measures of independence in Reproducing Kernel Hilbert Space," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    6. Zhou, Yeqing & Liu, Jingyuan & Zhu, Liping, 2020. "Test for conditional independence with application to conditional screening," Journal of Multivariate Analysis, Elsevier, vol. 175(C).
    7. Xuehu Zhu & Jun Lu & Jun Zhang & Lixing Zhu, 2021. "Testing for conditional independence: A groupwise dimension reduction‐based adaptive‐to‐model approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 549-576, June.
    8. Zhang, Yaowu & Zhou, Yeqing & Zhu, Liping, 2024. "A post-screening diagnostic study for ultrahigh dimensional data," Journal of Econometrics, Elsevier, vol. 239(2).
    9. Liu, Jicai & Si, Yuefeng & Niu, Yong & Zhang, Riquan, 2022. "Projection quantile correlation and its use in high-dimensional grouped variable screening," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
    10. Xiaohong Chen & Sokbae Lee & Myung Hwan Seo & Myunghyun Song, 2020. "Inference for parameters identified by conditional moment restrictions using a generalized Bierens maximum statistic," Papers 2008.11140, arXiv.org, revised Oct 2024.
    11. Wang, Xuexin, 2015. "A Note on Consistent Conditional Moment Tests," MPRA Paper 69005, University Library of Munich, Germany.
    12. Juan Carlos Escanciano & Kyungchul Song, 2007. "Asymptotically Optimal Tests for Single-Index Restrictions with a Focus on Average Partial Effects," PIER Working Paper Archive 07-005, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    13. M. D. Jiménez-Gamero & M. Cousido-Rocha & M. V. Alba-Fernández & F. Jiménez-Jiménez, 2022. "Testing the equality of a large number of populations," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 1-21, March.
    14. Zhang, Qingyang, 2019. "Independence test for large sparse contingency tables based on distance correlation," Statistics & Probability Letters, Elsevier, vol. 148(C), pages 17-22.
    15. M. D. Jiménez-Gamero, 2024. "Testing normality of a large number of populations," Statistical Papers, Springer, vol. 65(1), pages 435-465, February.
    16. Azadkia, Mona & Chatterjee, Sourav, 2021. "A simple measure of conditional dependence," LSE Research Online Documents on Economics 125584, London School of Economics and Political Science, LSE Library.
    17. Escanciano, Juan Carlos & Song, Kyungchul, 2010. "Testing single-index restrictions with a focus on average derivatives," Journal of Econometrics, Elsevier, vol. 156(2), pages 377-391, June.
    18. Fang Han, 2024. "An Introduction to Permutation Processes (version 0.5)," Papers 2407.09664, arXiv.org.
    19. Fan, Jianqing & Feng, Yang & Xia, Lucy, 2020. "A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models," Journal of Econometrics, Elsevier, vol. 218(1), pages 119-139.
    20. Shi, Chengchun & Xu, Tianlin & Bergsma, Wicher & Li, Lexin, 2021. "Double generative adversarial networks for conditional independence testing," LSE Research Online Documents on Economics 112550, London School of Economics and Political Science, LSE Library.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:205:y:2025:i:c:s0047259x2400085x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.