IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2210.08149.html

Distance and Kernel-Based Measures for Global and Local Two-Sample Conditional Distribution Testing

Author

Listed:
  • Jian Yan
  • Zhuoxi Li
  • Xianyang Zhang

Abstract

Testing the equality of two conditional distributions is crucial in various modern applications, including transfer learning and causal inference. Despite its importance, this fundamental problem has received surprisingly little attention in the literature, with existing works focusing exclusively on global two-sample conditional distribution testing. Based on distance and kernel methods, this paper presents the first unified framework for both global and local two-sample conditional distribution testing. To this end, we introduce distance and kernel-based measures that characterize the homogeneity of two conditional distributions. Drawing from the concept of conditional U-statistics, we propose consistent estimators for these measures. Theoretically, we derive the convergence rates and the asymptotic distributions of the estimators under both the null and alternative hypotheses. Utilizing these measures, along with a local bootstrap approach, we develop global and local tests that can detect discrepancies between two conditional distributions at global and local levels, respectively. Our tests demonstrate reliable performance through simulations and real data analysis.

Suggested Citation

  • Jian Yan & Zhuoxi Li & Xianyang Zhang, 2022. "Distance and Kernel-Based Measures for Global and Local Two-Sample Conditional Distribution Testing," Papers 2210.08149, arXiv.org, revised Aug 2025.
  • Handle: RePEc:arx:papers:2210.08149
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2210.08149
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    2. Minsu Chang & Sokbae Lee & Yoon‐Jae Whang, 2015. "Nonparametric tests of conditional treatment effects with an application to single‐sex schooling on academic achievements," Econometrics Journal, Royal Economic Society, vol. 18(3), pages 307-346, October.
    3. Taamouti, Abderrahim & Bouezmarni, Taoufik & El Ghouch, Anouar, 2014. "Nonparametric estimation and inference for conditional density based Granger causality measures," Journal of Econometrics, Elsevier, vol. 180(2), pages 251-264.
    4. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "Rejoinder on: An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 442-447, September.
    5. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 361-411, September.
    6. Liangjun Su & Martin Spindler, 2013. "Nonparametric Testing for Asymmetric Information," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(2), pages 208-225, April.
    7. Xueqin Wang & Wenliang Pan & Wenhao Hu & Yuan Tian & Heping Zhang, 2015. "Conditional Distance Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1726-1734, December.
    8. Chenlu Ke & Xiangrong Yin, 2020. "Expected Conditional Characteristic Function-based Measures for Testing Independence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 985-996, April.
    9. Shubhadeep Chakraborty & Xianyang Zhang, 2019. "Distance Metrics for Measuring Joint Dependence with Application to Causal Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1638-1650, October.
    10. Xiaoyu Hu & Jing Lei, 2024. "A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(546), pages 1136-1154, April.
    11. Tarn Duong, 2013. "Local significant differences from nonparametric two-sample tests," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 25(3), pages 635-645, September.
    12. Sebastian Calonico & Matias D. Cattaneo & Rocio Titiunik, 2014. "Robust Nonparametric Confidence Intervals for Regression‐Discontinuity Designs," Econometrica, Econometric Society, vol. 82, pages 2295-2326, November.
    13. Richard K. Crump & V. Joseph Hotz & Guido W. Imbens & Oscar A. Mitnik, 2008. "Nonparametric Tests for Treatment Effect Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 389-405, August.
    14. Su, Liangjun & White, Halbert, 2008. "A Nonparametric Hellinger Metric Test For Conditional Independence," Econometric Theory, Cambridge University Press, vol. 24(4), pages 829-864, August.
    15. repec:taf:jnlbes:v:30:y:2012:i:2:p:275-287 is not listed on IDEAS
    16. Federico A. Bugni & Ivan A. Canay & Deborah Kim, 2025. "Testing Conditional Stochastic Dominance at Target Points," Papers 2503.14747, arXiv.org, revised Nov 2025.
    17. Efstathios Paparoditis & Dimitris Politis, 2000. "The Local Bootstrap for Kernel Estimators under General Dependence Conditions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(1), pages 139-159, March.
    18. Lavergne, Pascal, 2001. "An equality test across nonparametric regressions," Journal of Econometrics, Elsevier, vol. 103(1-2), pages 307-344, July.
    19. Jian Yan & Xianyang Zhang, 2023. "Kernel two-sample tests in high dimensions: interplay between moment discrepancy and dimension-and-sample orders," Biometrika, Biometrika Trust, vol. 110(2), pages 411-430.
    20. Szekely, Gábor J. & Rizzo, Maria L., 2005. "A new test for multivariate normality," Journal of Multivariate Analysis, Elsevier, vol. 93(1), pages 58-80, March.
    21. Shu Shen & Xiaohan Zhang, 2016. "Distributional Tests for Regression Discontinuity: Theory and Empirical Examples," The Review of Economics and Statistics, MIT Press, vol. 98(4), pages 685-700, October.
    22. Myoung‐jae Lee, 2009. "Non‐parametric tests for distributional treatment effect for randomly censored responses," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(1), pages 243-264, January.
    23. Fan, Yanqin & Li, Qi, 1996. "Consistent Model Specification Tests: Omitted Variables and Semiparametric Functional Forms," Econometrica, Econometric Society, vol. 64(4), pages 865-890, July.
    24. Hall, Peter, 1984. "Central limit theorem for integrated square error of multivariate nonparametric density estimators," Journal of Multivariate Analysis, Elsevier, vol. 14(1), pages 1-16, February.
    25. Su, Liangjun & White, Halbert, 2007. "A consistent characteristic function-based test for conditional independence," Journal of Econometrics, Elsevier, vol. 141(2), pages 807-834, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jian Yan & Zhuoxi Li & Yang Ning & Yong Chen, 2025. "Machine-Learning-Assisted Comparison of Regression Functions," Papers 2510.24714, arXiv.org.
    2. Taoufik Bouezmarni & Jeroen V.K. Rombouts & Abderrahim Taamouti, 2011. "Nonparametric Copula-Based Test for Conditional Independence with Applications to Granger Causality," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(2), pages 275-287, October.
    3. Zhou, Niwen & Guo, Xu & Zhu, Lixing, 2024. "Significance test for semiparametric conditional average treatment effects and other structural functions," Computational Statistics & Data Analysis, Elsevier, vol. 189(C).
    4. Pedro H. C. Sant’Anna, 2021. "Nonparametric Tests for Treatment Effect Heterogeneity With Duration Outcomes," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(3), pages 816-832, July.
    5. Sokbae (Simon) Lee & Yoon-Jae Whang, 2009. "Nonparametric tests of conditional treatment effects," CeMMAP working papers CWP36/09, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Ruiz-Castillo, Javier, 2012. "From the “European Paradox” to a European Drama in citation impact," UC3M Working papers. Economics we1211, Universidad Carlos III de Madrid. Departamento de Economía.
    7. Su, Liangjun & White, Halbert, 2014. "Testing conditional independence via empirical likelihood," Journal of Econometrics, Elsevier, vol. 182(1), pages 27-44.
    8. Wang, Li & Zhou, Hongyi & Ma, Weidong & Yang, Ying, 2025. "A conditional distribution function-based measure for independence and K-sample tests in multivariate data," Journal of Multivariate Analysis, Elsevier, vol. 205(C).
    9. Taoufik Bouezmarni & Abderrahim Taamouti, 2014. "Nonparametric tests for conditional independence using conditional distributions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(4), pages 697-719, December.
    10. Xuehu Zhu & Jun Lu & Jun Zhang & Lixing Zhu, 2021. "Testing for conditional independence: A groupwise dimension reduction‐based adaptive‐to‐model approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 549-576, June.
    11. Sant’Anna, Pedro H.C. & Song, Xiaojun, 2019. "Specification tests for the propensity score," Journal of Econometrics, Elsevier, vol. 210(2), pages 379-404.
    12. Taoufik Bouezmarni & Mohamed Doukali & Abderrahim Taamouti, 2024. "Testing Granger non-causality in expectiles," Econometric Reviews, Taylor & Francis Journals, vol. 43(1), pages 30-51, January.
    13. Dai, Shengtao & Song, Xiaojun, 2025. "Consistent tests for semiparametric conditional independence," Statistics & Probability Letters, Elsevier, vol. 216(C).
    14. Yongzhen Feng & Jie Li & Xiaojun Song, 2025. "Testing linearity in semi-functional partially linear regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 34(3), pages 786-814, September.
    15. Fan, Jianqing & Feng, Yang & Xia, Lucy, 2020. "A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models," Journal of Econometrics, Elsevier, vol. 218(1), pages 119-139.
    16. Wei Huang & Oliver Linton & Zheng Zhang, 2022. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1817-1830, October.
    17. Dong, Hao & Taylor, Luke, 2022. "Nonparametric Significance Testing In Measurement Error Models," Econometric Theory, Cambridge University Press, vol. 38(3), pages 454-496, June.
    18. Xu Guo & Gao-Rong Li & Michael McAleer & Wing-Keung Wong, 2018. "Specification Testing of Production in a Stochastic Frontier Model," Sustainability, MDPI, vol. 10(9), pages 1-10, August.
    19. Xu Guo & Wangli Xu & Lixing Zhu, 2015. "Model checking for parametric regressions with response missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(2), pages 229-259, April.
    20. Zongwu Cai & Ying Fang & Ming Lin & Shengfang Tang, 2020. "Testing Unconfoundedness Assumption Using Auxiliary Variables," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202004, University of Kansas, Department of Economics, revised Feb 2020.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2210.08149. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.