IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v110y2015i512p1726-1734.html
   My bibliography  Save this article

Conditional Distance Correlation

Author

Listed:
  • Xueqin Wang
  • Wenliang Pan
  • Wenhao Hu
  • Yuan Tian
  • Heping Zhang

Abstract

Statistical inference on conditional dependence is essential in many fields including genetic association studies and graphical models. The classic measures focus on linear conditional correlations and are incapable of characterizing nonlinear conditional relationship including nonmonotonic relationship. To overcome this limitation, we introduce a nonparametric measure of conditional dependence for multivariate random variables with arbitrary dimensions. Our measure possesses the necessary and intuitive properties as a correlation index. Briefly, it is zero almost surely if and only if two multivariate random variables are conditionally independent given a third random variable. More importantly, the sample version of this measure can be expressed elegantly as the root of a V or U-process with random kernels and has desirable theoretical properties. Based on the sample version, we propose a test for conditional independence, which is proven to be more powerful than some recently developed tests through our numerical simulations. The advantage of our test is even greater when the relationship between the multivariate random variables given the third random variable cannot be expressed in a linear or monotonic function of one random variable versus the other. We also show that the sample measure is consistent and weakly convergent, and the test statistic is asymptotically normal. By applying our test in a real data analysis, we are able to identify two conditionally associated gene expressions, which otherwise cannot be revealed. Thus, our measure of conditional dependence is not only an ideal concept, but also has important practical utility. Supplementary materials for this article are available online.

Suggested Citation

  • Xueqin Wang & Wenliang Pan & Wenhao Hu & Yuan Tian & Heping Zhang, 2015. "Conditional Distance Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1726-1734, December.
  • Handle: RePEc:taf:jnlasa:v:110:y:2015:i:512:p:1726-1734
    DOI: 10.1080/01621459.2014.993081
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2014.993081
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2014.993081?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Schick, Anton, 1997. "On U-statistics with random kernels," Statistics & Probability Letters, Elsevier, vol. 34(3), pages 275-283, June.
    2. Su, Liangjun & White, Halbert, 2007. "A consistent characteristic function-based test for conditional independence," Journal of Econometrics, Elsevier, vol. 141(2), pages 807-834, December.
    3. Fan, Yanqin & Li, Qi, 1996. "Consistent Model Specification Tests: Omitted Variables and Semiparametric Functional Forms," Econometrica, Econometric Society, vol. 64(4), pages 865-890, July.
    4. Scutari, Marco, 2010. "Learning Bayesian Networks with the bnlearn R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 35(i03).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yi Liu & Qihua Wang, 2018. "Model-free feature screening for ultrahigh-dimensional data conditional on some variables," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(2), pages 283-301, April.
    2. Ke, Chenlu & Yang, Wei & Yuan, Qingcong & Li, Lu, 2023. "Partial sufficient variable screening with categorical controls," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    3. Shi, Chengchun & Xu, Tianlin & Bergsma, Wicher & Li, Lexin, 2021. "Double generative adversarial networks for conditional independence testing," LSE Research Online Documents on Economics 112550, London School of Economics and Political Science, LSE Library.
    4. Bianchi, Pascal & Elgui, Kevin & Portier, François, 2023. "Conditional independence testing via weighted partial copulas," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    5. Jun Lu & Lu Lin, 2020. "Model-free conditional screening via conditional distance correlation," Statistical Papers, Springer, vol. 61(1), pages 225-244, February.
    6. Yuan, Qingcong & Chen, Xianyan & Ke, Chenlu & Yin, Xiangrong, 2022. "Independence index sufficient variable screening for categorical responses," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    7. Zhou, Yeqing & Liu, Jingyuan & Zhu, Liping, 2020. "Test for conditional independence with application to conditional screening," Journal of Multivariate Analysis, Elsevier, vol. 175(C).
    8. Dominic Edelmann & Thomas Welchowski & Axel Benner, 2022. "A consistent version of distance covariance for right‐censored survival data and its application in hypothesis testing," Biometrics, The International Biometric Society, vol. 78(3), pages 867-879, September.
    9. Cencheng Shen & Joshua T. Vogelstein, 2021. "The exact equivalence of distance and kernel methods in hypothesis testing," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(3), pages 385-403, September.
    10. Maomao Ding & Ruosha Li & Jin Qin & Jing Ning, 2023. "A double‐robust test for high‐dimensional gene coexpression networks conditioning on clinical information," Biometrics, The International Biometric Society, vol. 79(4), pages 3227-3238, December.
    11. Xinyi Xu & Jingxiao Zhang, 2020. "Groupwise sufficient dimension reduction via conditional distance clustering," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 83(2), pages 217-242, February.
    12. Soale, Abdul-Nasah, 2023. "Projection expectile regression for sufficient dimension reduction," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    13. Xuehu Zhu & Jun Lu & Jun Zhang & Lixing Zhu, 2021. "Testing for conditional independence: A groupwise dimension reduction‐based adaptive‐to‐model approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 549-576, June.
    14. Fan, Jianqing & Feng, Yang & Xia, Lucy, 2020. "A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models," Journal of Econometrics, Elsevier, vol. 218(1), pages 119-139.
    15. Dingke Tang & Dehan Kong & Wenliang Pan & Linbo Wang, 2023. "Ultra‐high dimensional variable selection for doubly robust causal inference," Biometrics, The International Biometric Society, vol. 79(2), pages 903-914, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Taoufik Bouezmarni & Jeroen V.K. Rombouts & Abderrahim Taamouti, 2011. "Nonparametric Copula-Based Test for Conditional Independence with Applications to Granger Causality," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(2), pages 275-287, October.
    2. Xuehu Zhu & Jun Lu & Jun Zhang & Lixing Zhu, 2021. "Testing for conditional independence: A groupwise dimension reduction‐based adaptive‐to‐model approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 549-576, June.
    3. Nianqing Liu & Quang Vuong & Haiqing Xu, 2012. "Rationalization and Identification of Discrete Games with Correlated Types," Department of Economics Working Papers 130915, The University of Texas at Austin, Department of Economics.
    4. Liu, Nianqing & Vuong, Quang & Xu, Haiqing, 2017. "Rationalization and identification of binary games with correlated types," Journal of Econometrics, Elsevier, vol. 201(2), pages 249-268.
    5. Marcelo Fernandes & Breno Neri, 2010. "Nonparametric Entropy-Based Tests of Independence Between Stochastic Processes," Econometric Reviews, Taylor & Francis Journals, vol. 29(3), pages 276-306.
    6. El Ghouch, Anouar & Genton, Marc G. & Bouezmarni , Taoufik, 2012. "Measuring the Discrepancy of a Parametric Model via Local Polynomial Smoothing," LIDAM Discussion Papers ISBA 2012001, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    7. Taoufik Bouezmarni & Mohamed Doukali & Abderrahim Taamouti, 2023. "Testing Granger Non-Causality in Expectiles," University of East Anglia School of Economics Working Paper Series 2023-02, School of Economics, University of East Anglia, Norwich, UK..
    8. Jinhyun Lee, 2013. "A Consistent Nonparametric Bootstrap Test of Exogeneity," Discussion Paper Series, School of Economics and Finance 201316, School of Economics and Finance, University of St Andrews.
    9. Dante Amengual & Marine Carrasco & Enrique Sentana, 2017. "Testing Distributional Assumptions Using a Continuum of Moments," Working Papers wp2018_1709, CEMFI.
    10. Prabal Das & D. A. Sachindra & Kironmala Chanda, 2022. "Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(15), pages 6043-6071, December.
    11. Nishiyama, Yoshihiko & Hitomi, Kohtaro & Kawasaki, Yoshinori & Jeong, Kiho, 2011. "A consistent nonparametric test for nonlinear causality—Specification in time series regression," Journal of Econometrics, Elsevier, vol. 165(1), pages 112-127.
    12. Vuong, Quan-Hoang & La, Viet-Phuong, 2019. "The bayesvl R package. User guide v0.8.1," OSF Preprints w5dx6, Center for Open Science.
    13. F. Cugnata & G. Perucca & S. Salini, 2017. "Bayesian networks and the assessment of universities' value added," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(10), pages 1785-1806, July.
    14. Patrick Saart & Jiti Gao & Nam Hyun Kim, 2014. "Semiparametric methods in nonlinear time series analysis: a selective review," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(1), pages 141-169, March.
    15. Temel, Tugrul T., 2001. "A Nonparametric Hypothesis Test Via The Bootstrap Resampling," 2001 Annual meeting, August 5-8, Chicago, IL 20600, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    16. E. Zacharias & T. Stengos, 2006. "Intertemporal pricing and price discrimination: a semiparametric hedonic analysis of the personal computer market," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(3), pages 371-386.
    17. Oliver Linton & Pedro Gozalo, 1996. "Conditional Independence Restrictions: Testing and Estimation," Cowles Foundation Discussion Papers 1140, Cowles Foundation for Research in Economics, Yale University.
    18. Chen, Bin & Song, Zhaogang, 2013. "Testing whether the underlying continuous-time process follows a diffusion: An infinitesimal operator-based approach," Journal of Econometrics, Elsevier, vol. 173(1), pages 83-107.
    19. Fernandes, Marcelo & Grammig, Joachim, 2005. "Nonparametric specification tests for conditional duration models," Journal of Econometrics, Elsevier, vol. 127(1), pages 35-68, July.
    20. Roland R. Ramsahai, 2020. "Connecting actuarial judgment to probabilistic learning techniques with graph theory," Papers 2007.15475, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:110:y:2015:i:512:p:1726-1734. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.