IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/112550.html
   My bibliography  Save this paper

Double generative adversarial networks for conditional independence testing

Author

Listed:
  • Shi, Chengchun
  • Xu, Tianlin
  • Bergsma, Wicher
  • Li, Lexin

Abstract

In this article, we study the problem of high-dimensional conditional independence testing, a key building block in statistics and machine learning. We propose an inferential procedure based on double generative adversarial networks (GANs). Specifically, we first introduce a double GANs framework to learn two generators of the conditional distributions. We then integrate the two generators to construct a test statistic, which takes the form of the maximum of generalized covariance measures of multiple transformation functions. We also employ data-splitting and cross-fitting to minimize the conditions on the generators to achieve the desired asymptotic properties, and employ multiplier bootstrap to obtain the corresponding p-value. We show that the constructed test statistic is doubly robust, and the resulting test both controls type-I error and has the power approaching one asymptotically. Also notably, we establish those theoretical guarantees under much weaker and practically more feasible conditions compared to the existing tests, and our proposal gives a concrete example of how to utilize some state-of-the-art deep learning tools, such as GANs, to help address a classical but challenging statistical problem. We demonstrate the efficacy of our test through both simulations and an application to an anti-cancer drug dataset.

Suggested Citation

  • Shi, Chengchun & Xu, Tianlin & Bergsma, Wicher & Li, Lexin, 2021. "Double generative adversarial networks for conditional independence testing," LSE Research Online Documents on Economics 112550, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:112550
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/112550/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Su, Liangjun & White, Halbert, 2007. "A consistent characteristic function-based test for conditional independence," Journal of Econometrics, Elsevier, vol. 141(2), pages 807-834, December.
    2. Rajarshi Mukherjee & Whitney K. Newey & James Robins, 2017. "Semiparametric efficient empirical higher order influence function estimators," CeMMAP working papers CWP30/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Aude Geneway & Gabriel Peyré & Marco Cuturi, 2017. "Learning Generative Models with Sinkhorn Divergences," Working Papers 2017-83, Center for Research in Economics and Statistics.
    4. Xueqin Wang & Wenliang Pan & Wenhao Hu & Yuan Tian & Heping Zhang, 2015. "Conditional Distance Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1726-1734, December.
    5. Jordi Barretina & Giordano Caponigro & Nicolas Stransky & Kavitha Venkatesan & Adam A. Margolin & Sungjoon Kim & Christopher J.Wilson & Joseph Lehár & Gregory V. Kryukov & Dmitriy Sonkin & Anupama Red, 2012. "Addendum: The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity," Nature, Nature, vol. 492(7428), pages 290-290, December.
    6. Jordi Barretina & Giordano Caponigro & Nicolas Stransky & Kavitha Venkatesan & Adam A. Margolin & Sungjoon Kim & Christopher J. Wilson & Joseph Lehár & Gregory V. Kryukov & Dmitriy Sonkin & Anupama Re, 2012. "The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity," Nature, Nature, vol. 483(7391), pages 603-607, March.
    7. Mathew J. Garnett & Elena J. Edelman & Sonja J. Heidorn & Chris D. Greenman & Anahita Dastur & King Wai Lau & Patricia Greninger & I. Richard Thompson & Xi Luo & Jorge Soares & Qingsong Liu & Francesc, 2012. "Systematic identification of genomic markers of drug sensitivity in cancer cells," Nature, Nature, vol. 483(7391), pages 570-575, March.
    8. Emmanuel Candès & Yingying Fan & Lucas Janson & Jinchi Lv, 2018. "Panning for gold: ‘model‐X’ knockoffs for high dimensional controlled variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(3), pages 551-577, June.
    9. Wang, Xia & Hong, Yongmiao, 2018. "Characteristic Function Based Testing For Conditional Independence: A Nonparametric Regression Approach," Econometric Theory, Cambridge University Press, vol. 34(4), pages 815-849, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. G. Gambardella & G. Viscido & B. Tumaini & A. Isacchi & R. Bosotti & D. di Bernardo, 2022. "A single-cell analysis of breast cancer cell lines to study tumour heterogeneity and drug response," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    2. Min Pan & William C. Wright & Richard H. Chapple & Asif Zubair & Manbir Sandhu & Jake E. Batchelder & Brandt C. Huddle & Jonathan Low & Kaley B. Blankenship & Yingzhe Wang & Brittney Gordon & Payton A, 2021. "The chemotherapeutic CX-5461 primarily targets TOP2B and exhibits selective activity in high-risk neuroblastoma," Nature Communications, Nature, vol. 12(1), pages 1-20, December.
    3. Hyeong-Min Lee & William C. Wright & Min Pan & Jonathan Low & Duane Currier & Jie Fang & Shivendra Singh & Stephanie Nance & Ian Delahunty & Yuna Kim & Richard H. Chapple & Yinwen Zhang & Xueying Liu , 2023. "A CRISPR-drug perturbational map for identifying compounds to combine with commonly used chemotherapeutics," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    4. Jurica Levatić & Marina Salvadores & Francisco Fuster-Tormo & Fran Supek, 2022. "Mutational signatures are markers of drug sensitivity of cancer cells," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    5. Zhou, Yeqing & Liu, Jingyuan & Zhu, Liping, 2020. "Test for conditional independence with application to conditional screening," Journal of Multivariate Analysis, Elsevier, vol. 175(C).
    6. Xuehu Zhu & Jun Lu & Jun Zhang & Lixing Zhu, 2021. "Testing for conditional independence: A groupwise dimension reduction‐based adaptive‐to‐model approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 549-576, June.
    7. Maomao Ding & Ruosha Li & Jin Qin & Jing Ning, 2023. "A double‐robust test for high‐dimensional gene coexpression networks conditioning on clinical information," Biometrics, The International Biometric Society, vol. 79(4), pages 3227-3238, December.
    8. Han Jin & Cheng Zhang & Martin Zwahlen & Kalle Feilitzen & Max Karlsson & Mengnan Shi & Meng Yuan & Xiya Song & Xiangyu Li & Hong Yang & Hasan Turkez & Linn Fagerberg & Mathias Uhlén & Adil Mardinoglu, 2023. "Systematic transcriptional analysis of human cell lines for gene expression landscape and tumor representation," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    9. Junyi Chen & Xiaoying Wang & Anjun Ma & Qi-En Wang & Bingqiang Liu & Lang Li & Dong Xu & Qin Ma, 2022. "Deep transfer learning of cancer drug responses by integrating bulk and single-cell RNA-seq data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    10. Omar Alhalabi & Jianfeng Chen & Yuxue Zhang & Yang Lu & Qi Wang & Sumankalai Ramachandran & Rebecca Slack Tidwell & Guangchun Han & Xinmiao Yan & Jieru Meng & Ruiping Wang & Anh G. Hoang & Wei-Lien Wa, 2022. "MTAP deficiency creates an exploitable target for antifolate therapy in 9p21-loss cancers," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    11. Yan Li & Chen Xu & Bing Wang & Fujiang Xu & Fahan Ma & Yuanyuan Qu & Dongxian Jiang & Kai Li & Jinwen Feng & Sha Tian & Xiaohui Wu & Yunzhi Wang & Yang Liu & Zhaoyu Qin & Yalan Liu & Jing Qin & Qi Son, 2022. "Proteomic characterization of gastric cancer response to chemotherapy and targeted therapy reveals potential therapeutic strategies," Nature Communications, Nature, vol. 13(1), pages 1-26, December.
    12. Aina Maria Mas & Enrique Goñi & Igor Ruiz de los Mozos & Aida Arcas & Luisa Statello & Jovanna González & Lorea Blázquez & Wei Ting Chelsea Lee & Dipika Gupta & Álvaro Sejas & Shoko Hoshina & Alexandr, 2023. "ORC1 binds to cis-transcribed RNAs for efficient activation of replication origins," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    13. Nicolae Sapoval & Amirali Aghazadeh & Michael G. Nute & Dinler A. Antunes & Advait Balaji & Richard Baraniuk & C. J. Barberan & Ruth Dannenfelser & Chen Dun & Mohammadamin Edrisi & R. A. Leo Elworth &, 2022. "Current progress and open challenges for applying deep learning across the biosciences," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Fan, Jianqing & Feng, Yang & Xia, Lucy, 2020. "A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models," Journal of Econometrics, Elsevier, vol. 218(1), pages 119-139.
    15. Seungyeul Yoo & Abhilasha Sinha & Dawei Yang & Nasser K. Altorki & Radhika Tandon & Wenhui Wang & Deebly Chavez & Eunjee Lee & Ayushi S. Patel & Takashi Sato & Ranran Kong & Bisen Ding & Eric E. Schad, 2022. "Integrative network analysis of early-stage lung adenocarcinoma identifies aurora kinase inhibition as interceptor of invasion and progression," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    16. Alon Stern & Mariam Fokra & Boris Sarvin & Ahmad Abed Alrahem & Won Dong Lee & Elina Aizenshtein & Nikita Sarvin & Tomer Shlomi, 2023. "Inferring mitochondrial and cytosolic metabolism by coupling isotope tracing and deconvolution," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    17. Sandor Spisak & David Chen & Pornlada Likasitwatanakul & Paul Doan & Zhixin Li & Pratyusha Bala & Laura Vizkeleti & Viktoria Tisza & Pushpamali Silva & Marios Giannakis & Brian Wolpin & Jun Qi & Nilay, 2024. "Identifying regulators of aberrant stem cell and differentiation activity in colorectal cancer using a dual endogenous reporter system," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    18. Mariela Cortés-López & Laura Schulz & Mihaela Enculescu & Claudia Paret & Bea Spiekermann & Mathieu Quesnel-Vallières & Manuel Torres-Diz & Sebastian Unic & Anke Busch & Anna Orekhova & Monika Kuban &, 2022. "High-throughput mutagenesis identifies mutations and RNA-binding proteins controlling CD19 splicing and CART-19 therapy resistance," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    19. Qiwei Jiang & Xiaomei Zhang & Xiaoming Dai & Shiyao Han & Xueji Wu & Lei Wang & Wenyi Wei & Ning Zhang & Wei Xie & Jianping Guo, 2022. "S6K1-mediated phosphorylation of PDK1 impairs AKT kinase activity and oncogenic functions," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    20. Yanli Liu & Zhong Wu & Jin Zhou & Dinesh K. A. Ramadurai & Katelyn L. Mortenson & Estrella Aguilera-Jimenez & Yifei Yan & Xiaojun Yang & Alison M. Taylor & Katherine E. Varley & Jason Gertz & Peter S., 2021. "A predominant enhancer co-amplified with the SOX2 oncogene is necessary and sufficient for its expression in squamous cancer," Nature Communications, Nature, vol. 12(1), pages 1-14, December.

    More about this item

    Keywords

    conditional independence; double-robustness; generalized covariance measure; generative adversarial networks; multiplier bootstrap;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:112550. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.