IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2508.09040.html
   My bibliography  Save this paper

Bias correction for Chatterjee's graph-based correlation coefficient

Author

Listed:
  • Mona Azadkia
  • Leihao Chen
  • Fang Han

Abstract

Azadkia and Chatterjee (2021) recently introduced a simple nearest neighbor (NN) graph-based correlation coefficient that consistently detects both independence and functional dependence. Specifically, it approximates a measure of dependence that equals 0 if and only if the variables are independent, and 1 if and only if they are functionally dependent. However, this NN estimator includes a bias term that may vanish at a rate slower than root-$n$, preventing root-$n$ consistency in general. In this article, we propose a bias correction approach that overcomes this limitation, yielding an NN-based estimator that is both root-$n$ consistent and asymptotically normal.

Suggested Citation

  • Mona Azadkia & Leihao Chen & Fang Han, 2025. "Bias correction for Chatterjee's graph-based correlation coefficient," Papers 2508.09040, arXiv.org.
  • Handle: RePEc:arx:papers:2508.09040
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2508.09040
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. H Shi & M Drton & F Han, 2022. "On the power of Chatterjee’s rank correlation [Adaptive test of independence based on HSIC measures]," Biometrika, Biometrika Trust, vol. 109(2), pages 317-333.
    2. Zhexiao Lin & Fang Han, 2024. "On the failure of the bootstrap for Chatterjee’s rank correlation," Biometrika, Biometrika Trust, vol. 111(3), pages 1063-1070.
    3. Belloni, Alexandre & Chernozhukov, Victor & Chetverikov, Denis & Kato, Kengo, 2015. "Some new asymptotic theory for least squares series: Pointwise and uniform results," Journal of Econometrics, Elsevier, vol. 186(2), pages 345-366.
    4. Zhexiao Lin & Peng Ding & Fang Han, 2023. "Estimation Based on Nearest Neighbor Matching: From Density Ratio to Average Treatment Effect," Econometrica, Econometric Society, vol. 91(6), pages 2187-2217, November.
    5. Xiaohong Chen & Timothy M. Christensen, 2018. "Optimal sup‐norm rates and uniform inference on nonlinear functionals of nonparametric IV regression," Quantitative Economics, Econometric Society, vol. 9(1), pages 39-84, March.
    6. Alberto Abadie & Guido W. Imbens, 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 1-11, January.
    7. Holger Dette & Karl F. Siburg & Pavel A. Stoimenov, 2013. "A Copula-Based Non-parametric Measure of Regression Dependence," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(1), pages 21-41, March.
    8. Sourav Chatterjee, 2021. "A New Coefficient of Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 2009-2022, October.
    9. H Dette & M Kroll, 2025. "A simple bootstrap for Chatterjee’s rank correlation," Biometrika, Biometrika Trust, vol. 112(1), pages 3070-3102.
    10. Azadkia, Mona & Chatterjee, Sourav, 2021. "A simple measure of conditional dependence," LSE Research Online Documents on Economics 125584, London School of Economics and Political Science, LSE Library.
    11. Newey, Whitney K., 1997. "Convergence rates and asymptotic normality for series estimators," Journal of Econometrics, Elsevier, vol. 79(1), pages 147-168, July.
    12. Mauricio Olivares & Tomasz Olma & Daniel Wilhelm, 2025. "A Powerful Bootstrap Test of Independence in High Dimensions," Papers 2503.21715, arXiv.org, revised Apr 2025.
    13. Lin, Zhexiao & Han, Fang, 2025. "On regression-adjusted imputation estimators of average treatment effects," Journal of Econometrics, Elsevier, vol. 251(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fang Han, 2024. "An Introduction to Permutation Processes (version 0.5)," Papers 2407.09664, arXiv.org.
    2. Ziming Lin & Fang Han, 2024. "On the consistency of bootstrap for matching estimators," Papers 2410.23525, arXiv.org, revised Nov 2024.
    3. Matias D. Cattaneo & Fang Han & Zhexiao Lin, 2023. "On Rosenbaum's Rank-based Matching Estimator," Papers 2312.07683, arXiv.org, revised Jan 2024.
    4. Christoph Breunig & Peter Haan, 2018. "Nonparametric Regression with Selectively Missing Covariates," Papers 1810.00411, arXiv.org, revised Oct 2020.
    5. Nan Liu & Yanbo Liu & Yuya Sasaki, 2024. "Estimation and Inference for Causal Functions with Multiway Clustered Data," Papers 2409.06654, arXiv.org.
    6. Difang Huang & Jiti Gao & Tatsushi Oka, 2025. "Semiparametric single-index estimation for average treatment effects," Econometric Reviews, Taylor & Francis Journals, vol. 44(6), pages 843-885, July.
    7. Zhexiao Lin & Fang Han, 2023. "On the failure of the bootstrap for Chatterjee's rank correlation," Papers 2303.14088, arXiv.org, revised Apr 2023.
    8. Li, Jia & Liao, Zhipeng, 2020. "Uniform nonparametric inference for time series," Journal of Econometrics, Elsevier, vol. 219(1), pages 38-51.
    9. Breunig, Christoph, 2021. "Varying random coefficient models," Journal of Econometrics, Elsevier, vol. 221(2), pages 381-408.
    10. Zhang, Qingyang, 2023. "On the asymptotic null distribution of the symmetrized Chatterjee’s correlation coefficient," Statistics & Probability Letters, Elsevier, vol. 194(C).
    11. Liu, Lin & Mukherjee, Rajarshi & Robins, James M., 2024. "Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators," Journal of Econometrics, Elsevier, vol. 240(2).
    12. Breunig, Christoph & Haan, Peter, 2021. "Nonparametric regression with selectively missing covariates," Journal of Econometrics, Elsevier, vol. 223(1), pages 28-52.
    13. Samuele Centorrino & Aman Ullah & Jing Xue, 2019. "Semiparametric Estimation of Correlated Random Coefficient Models without Instrumental Variables," Papers 1911.06857, arXiv.org.
    14. Zheng Fang & Juwon Seo, 2019. "A Projection Framework for Testing Shape Restrictions That Form Convex Cones," Papers 1910.07689, arXiv.org, revised Sep 2021.
    15. Breunig, Christoph & Mammen, Enno & Simoni, Anna, 2018. "Nonparametric estimation in case of endogenous selection," Journal of Econometrics, Elsevier, vol. 202(2), pages 268-285.
    16. Feng, Junlong & Lee, Sokbae, 2025. "Individual welfare analysis: Random quasilinear utility, independence, and confidence bounds," Journal of Econometrics, Elsevier, vol. 247(C).
    17. Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
    18. Christoph Breunig & Xiaohong Chen, 2020. "Adaptive, Rate-Optimal Hypothesis Testing in Nonparametric IV Models," Papers 2006.09587, arXiv.org, revised Nov 2024.
    19. Yihui He & Fang Han, 2023. "On propensity score matching with a diverging number of matches," Papers 2310.14142, arXiv.org, revised Nov 2023.
    20. Christoph Breunig & Stefan Hoderlein, 2016. "Nonparametric Specification Testing in Random Parameter Models," Boston College Working Papers in Economics 897, Boston College Department of Economics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2508.09040. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.