IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-34932-z.html
   My bibliography  Save this article

GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies

Author

Listed:
  • Zihuai He

    (Stanford University
    Stanford University)

  • Linxi Liu

    (University of Pittsburgh)

  • Michael E. Belloy

    (Stanford University)

  • Yann Guen

    (Stanford University
    Institut du Cerveau - Paris Brain Institute - ICM)

  • Aaron Sossin

    (Stanford University)

  • Xiaoxia Liu

    (Stanford University)

  • Xinran Qi

    (Stanford University)

  • Shiyang Ma

    (Columbia University)

  • Prashnna K. Gyawali

    (Stanford University)

  • Tony Wyss-Coray

    (Stanford University)

  • Hua Tang

    (Stanford University)

  • Chiara Sabatti

    (Stanford University)

  • Emmanuel Candès

    (Stanford University
    Stanford University)

  • Michael D. Greicius

    (Stanford University)

  • Iuliana Ionita-Laza

    (Columbia University)

Abstract

Recent advances in genome sequencing and imputation technologies provide an exciting opportunity to comprehensively study the contribution of genetic variants to complex phenotypes. However, our ability to translate genetic discoveries into mechanistic insights remains limited at this point. In this paper, we propose an efficient knockoff-based method, GhostKnockoff, for genome-wide association studies (GWAS) that leads to improved power and ability to prioritize putative causal variants relative to conventional GWAS approaches. The method requires only Z-scores from conventional GWAS and hence can be easily applied to enhance existing and future studies. The method can also be applied to meta-analysis of multiple GWAS allowing for arbitrary sample overlap. We demonstrate its performance using empirical simulations and two applications: (1) a meta-analysis for Alzheimer’s disease comprising nine overlapping large-scale GWAS, whole-exome and whole-genome sequencing studies and (2) analysis of 1403 binary phenotypes from the UK Biobank data in 408,961 samples of European ancestry. Our results demonstrate that GhostKnockoff can identify putatively functional variants with weaker statistical effects that are missed by conventional association tests.

Suggested Citation

  • Zihuai He & Linxi Liu & Michael E. Belloy & Yann Guen & Aaron Sossin & Xiaoxia Liu & Xinran Qi & Shiyang Ma & Prashnna K. Gyawali & Tony Wyss-Coray & Hua Tang & Chiara Sabatti & Emmanuel Candès & Mich, 2022. "GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-34932-z
    DOI: 10.1038/s41467-022-34932-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-34932-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-34932-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Matteo Sesia & Eugene Katsevich & Stephen Bates & Emmanuel Candès & Chiara Sabatti, 2020. "Multi-resolution localization of causal variants across the genome," Nature Communications, Nature, vol. 11(1), pages 1-10, December.
    2. M Sesia & C Sabatti & E J Candès, 2019. "Gene hunting with hidden Markov model knockoffs," Biometrika, Biometrika Trust, vol. 106(1), pages 1-18.
    3. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    4. Alex Wells & David Heckerman & Ali Torkamani & Li Yin & Jonathan Sebat & Bing Ren & Amalio Telenti & Julia Iulio, 2019. "Ranking of non-coding pathogenic variants and putative essential regions of the human genome," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    5. Emmanuel Candès & Yingying Fan & Lucas Janson & Jinchi Lv, 2018. "Panning for gold: ‘model‐X’ knockoffs for high dimensional controlled variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(3), pages 551-577, June.
    6. M Sesia & C Sabatti & E J Candès, 2019. "Rejoinder: ‘Gene hunting with hidden Markov model knockoffs’," Biometrika, Biometrika Trust, vol. 106(1), pages 35-45.
    7. Daniel Taliun & Daniel N. Harris & Michael D. Kessler & Jedidiah Carlson & Zachary A. Szpiech & Raul Torres & Sarah A. Gagliano Taliun & André Corvelo & Stephanie M. Gogarten & Hyun Min Kang & Achille, 2021. "Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program," Nature, Nature, vol. 590(7845), pages 290-299, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Emmanuel Candès & Chiara Sabatti, 2020. "Discussion of the Paper “Prediction, Estimation, and Attribution” by B. Efron," International Statistical Review, International Statistical Institute, vol. 88(S1), pages 60-63, December.
    2. Nikolaos Ignatiadis & Wolfgang Huber, 2021. "Covariate powered cross‐weighted multiple testing," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(4), pages 720-751, September.
    3. Ruth Heller, 2020. "Comments on: Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 51-55, March.
    4. Quan Sun & Bryce T. Rowland & Jiawen Chen & Anna V. Mikhaylova & Christy Avery & Ulrike Peters & Jessica Lundin & Tara Matise & Steve Buyske & Ran Tao & Rasika A. Mathias & Alexander P. Reiner & Paul , 2024. "Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    5. L Bottolo & S Richardson, 2019. "Discussion of ‘Gene hunting with hidden Markov model knockoffs’," Biometrika, Biometrika Trust, vol. 106(1), pages 19-22.
    6. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    7. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    8. Laura M. Mueller & Abigail Isaacson & Heather Wilson & Anna Salowka & Isabel Tay & Maolian Gong & Nancy Samir Elbarbary & Klemens Raile & Francesca M. Spagnoli, 2024. "Heterozygous missense variant in GLI2 impairs human endocrine pancreas development," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    9. Emre Demirkaya & Yang Feng & Pallavi Basu & Jinchi Lv, 2022. "Large-scale model selection in misspecified generalized linear models [Information theory and an extension of the maximum likelihood principle]," Biometrika, Biometrika Trust, vol. 109(1), pages 123-136.
    10. Mathias Seviiri & Matthew H. Law & Jue-Sheng Ong & Puya Gharahkhani & Pierre Fontanillas & Catherine M. Olsen & David C. Whiteman & Stuart MacGregor, 2022. "A multi-phenotype analysis reveals 19 susceptibility loci for basal cell carcinoma and 15 for squamous cell carcinoma," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    11. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    12. Elena V. Feofanova & Michael R. Brown & Taryn Alkis & Astrid M. Manuel & Xihao Li & Usman A. Tahir & Zilin Li & Kevin M. Mendez & Rachel S. Kelly & Qibin Qi & Han Chen & Martin G. Larson & Rozenn N. L, 2023. "Whole-Genome Sequencing Analysis of Human Metabolome in Multi-Ethnic Populations," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    13. Gaëlle Odelin & Adèle Faucherre & Damien Marchese & Amélie Pinard & Hager Jaouadi & Solena Scouarnec & Raphaël Chiarelli & Younes Achouri & Emilie Faure & Marine Herbane & Alexis Théron & Jean-Françoi, 2023. "Variations in the poly-histidine repeat motif of HOXA1 contribute to bicuspid aortic valve in mouse and zebrafish," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    14. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    15. Erik Schoenmakers & Federica Marelli & Helle F. Jørgensen & W. Edward Visser & Carla Moran & Stefan Groeneweg & Carolina Avalos & Sean J. Jurgens & Nichola Figg & Alison Finigan & Neha Wali & Maura Ag, 2023. "Selenoprotein deficiency disorder predisposes to aortic aneurysm formation," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    16. Sarah E. Garnish & Katherine R. Martin & Maria Kauppi & Victoria E. Jackson & Rebecca Ambrose & Vik Ven Eng & Shene Chiou & Yanxiang Meng & Daniel Frank & Emma C. Tovey Crutchfield & Komal M. Patel & , 2023. "A common human MLKL polymorphism confers resistance to negative regulation by phosphorylation," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    17. Xiaoyi Raymond Gao & Marion Chiariglione & Alexander J. Arch, 2022. "Whole-exome sequencing study identifies rare variants and genes associated with intraocular pressure and glaucoma," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    18. Shaan Khurshid & Julieta Lazarte & James P. Pirruccello & Lu-Chen Weng & Seung Hoan Choi & Amelia W. Hall & Xin Wang & Samuel F. Friedman & Victor Nauffal & Kiran J. Biddinger & Krishna G. Aragam & Pu, 2023. "Clinical and genetic associations of deep learning-derived cardiac magnetic resonance-based left ventricular mass," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    19. Javier Rodríguez-Ubreva & Anna Arutyunyan & Marc Jan Bonder & Lucía Del Pino-Molina & Stephen J. Clark & Carlos de la Calle-Fabregat & Luz Garcia-Alonso & Louis-François Handfield & Laura Ciudad & Edu, 2022. "Single-cell Atlas of common variable immunodeficiency shows germinal center-associated epigenetic dysregulation in B-cell responses," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    20. Michel S. Naslavsky & Marilia O. Scliar & Guilherme L. Yamamoto & Jaqueline Yu Ting Wang & Stepanka Zverinova & Tatiana Karp & Kelly Nunes & José Ricardo Magliocco Ceroni & Diego Lima Carvalho & Carlo, 2022. "Whole-genome sequencing of 1,171 elderly admixed individuals from Brazil," Nature Communications, Nature, vol. 13(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-34932-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.