IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-021-26114-0.html
   My bibliography  Save this article

Calibrated rare variant genetic risk scores for complex disease prediction using large exome sequence repositories

Author

Listed:
  • Ricky Lali

    (Vascular and Stroke Research Institute
    Faculty of Health Sciences)

  • Michael Chong

    (Vascular and Stroke Research Institute
    Faculty of Health Sciences)

  • Arghavan Omidi

    (Vascular and Stroke Research Institute)

  • Pedrum Mohammadi-Shemirani

    (Vascular and Stroke Research Institute
    Faculty of Health Sciences)

  • Ann Le

    (Vascular and Stroke Research Institute
    Faculty of Health Sciences)

  • Edward Cui

    (Vascular and Stroke Research Institute)

  • Guillaume Paré

    (Vascular and Stroke Research Institute
    Faculty of Health Sciences
    Faculty of Health Sciences
    Faculty of Health Sciences)

Abstract

Rare variants are collectively numerous and may underlie a considerable proportion of complex disease risk. However, identifying genuine rare variant associations is challenging due to small effect sizes, presence of technical artefacts, and heterogeneity in population structure. We hypothesize that rare variant burden over a large number of genes can be combined into a predictive rare variant genetic risk score (RVGRS). We propose a method (RV-EXCALIBER) that leverages summary-level data from a large public exome sequencing database (gnomAD) as controls and robustly calibrates rare variant burden to account for the aforementioned biases. A calibrated RVGRS strongly associates with coronary artery disease (CAD) in European and South Asian populations by capturing the aggregate effect of rare variants through a polygenic model of inheritance. The RVGRS identifies 1.5% of the population with substantial risk of early CAD and confers risk even when adjusting for known Mendelian CAD genes, clinical risk factors, and a common variant genetic risk score.

Suggested Citation

  • Ricky Lali & Michael Chong & Arghavan Omidi & Pedrum Mohammadi-Shemirani & Ann Le & Edward Cui & Guillaume Paré, 2021. "Calibrated rare variant genetic risk scores for complex disease prediction using large exome sequence repositories," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26114-0
    DOI: 10.1038/s41467-021-26114-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-021-26114-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-021-26114-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    2. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    3. Ron Do & Nathan O. Stitziel & Hong-Hee Won & Anders Berg Jørgensen & Stefano Duga & Pier Angelica Merlini & Adam Kiezun & Martin Farrall & Anuj Goel & Or Zuk & Illaria Guella & Rosanna Asselta & Lesli, 2015. "Exome sequencing identifies rare LDLR and APOA5 alleles conferring risk for myocardial infarction," Nature, Nature, vol. 518(7537), pages 102-106, February.
    4. Monkol Lek & Konrad J. Karczewski & Eric V. Minikel & Kaitlin E. Samocha & Eric Banks & Timothy Fennell & Anne H. O’Donnell-Luria & James S. Ware & Andrew J. Hill & Beryl B. Cummings & Taru Tukiainen , 2016. "Analysis of protein-coding genetic variation in 60,706 humans," Nature, Nature, vol. 536(7616), pages 285-291, August.
    5. Audrey E Hendricks & Stephen C Billups & Hamish N C Pike & I Sadaf Farooqi & Eleftheria Zeggini & Stephanie A Santorico & Inês Barroso & Josée Dupuis, 2018. "ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls," PLOS Genetics, Public Library of Science, vol. 14(10), pages 1-14, October.
    6. L. Duncan & H. Shen & B. Gelaye & J. Meijsen & K. Ressler & M. Feldman & R. Peterson & B. Domingue, 2019. "Analysis of polygenic risk score usage and performance in diverse human populations," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kian Hong Kock & Patrick K. Kimes & Stephen S. Gisselbrecht & Sachi Inukai & Sabrina K. Phanor & James T. Anderson & Gayatri Ramakrishnan & Colin H. Lipper & Dongyuan Song & Jesse V. Kurland & Julia M, 2024. "DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    2. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    3. Iker Núñez-Carpintero & Maria Rigau & Mattia Bosio & Emily O’Connor & Sally Spendiff & Yoshiteru Azuma & Ana Topf & Rachel Thompson & Peter A. C. ’t Hoen & Teodora Chamova & Ivailo Tournev & Velina Gu, 2024. "Rare disease research workflow using multilayer networks elucidates the molecular determinants of severity in Congenital Myasthenic Syndromes," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. Young Jin Kim & Sanghoon Moon & Mi Yeong Hwang & Sohee Han & Hye-Mi Jang & Jinhwa Kong & Dong Mun Shin & Kyungheon Yoon & Sung Min Kim & Jong-Eun Lee & Anubha Mahajan & Hyun-Young Park & Mark I. McCar, 2022. "The contribution of common and rare genetic variants to variation in metabolic traits in 288,137 East Asians," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    5. Noah Dukler & Mehreen R. Mughal & Ritika Ramani & Yi-Fei Huang & Adam Siepel, 2022. "Extreme purifying selection against point mutations in the human genome," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    6. Sally J. Adua & Anna Arnal-Estapé & Minghui Zhao & Bowen Qi & Zongzhi Z. Liu & Carolyn Kravitz & Heather Hulme & Nicole Strittmatter & Francesc López-Giráldez & Sampada Chande & Alexandra E. Albert & , 2022. "Brain metastatic outgrowth and osimertinib resistance are potentiated by RhoA in EGFR-mutant lung cancer," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    7. Scott D. Findlay & Lindsay Romo & Christopher B. Burge, 2024. "Quantifying negative selection in human 3ʹ UTRs uncovers constrained targets of RNA-binding proteins," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    8. H. Serhat Tetikol & Deniz Turgut & Kubra Narci & Gungor Budak & Ozem Kalay & Elif Arslan & Sinem Demirkaya-Budak & Alexey Dolgoborodov & Duygu Kabakci-Zorlu & Vladimir Semenyuk & Amit Jain & Brandi N., 2022. "Pan-African genome demonstrates how population-specific genome graphs improve high-throughput sequencing data analysis," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    9. Jeffrey D. Wall & J. Fah Sathirapongsasuti & Ravi Gupta & Asif Rasheed & Radha Venkatesan & Saurabh Belsare & Ramesh Menon & Sameer Phalke & Anuradha Mittal & John Fang & Deepak Tanneeru & Manjari Des, 2023. "South Asian medical cohorts reveal strong founder effects and high rates of homozygosity," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    10. Ada J. S. Chan & Worrawat Engchuan & Miriam S. Reuter & Zhuozhi Wang & Bhooma Thiruvahindrapuram & Brett Trost & Thomas Nalpathamkalam & Carol Negrijn & Sylvia Lamoureux & Giovanna Pellecchia & Rohan , 2022. "Genome-wide rare variant score associates with morphological subtypes of autism spectrum disorder," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    11. Wenan Chen & Shuoguo Wang & Saima Sultana Tithi & David W. Ellison & Daniel J. Schaid & Gang Wu, 2022. "A rare variant analysis framework using public genotype summary counts to prioritize disease-predisposition genes," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    12. Stephanie O. Erjavec & Sahar Gelfman & Alexa R. Abdelaziz & Eunice Y. Lee & Isha Monga & Anna Alkelai & Iuliana Ionita-Laza & Lynn Petukhova & Angela M. Christiano, 2022. "Whole exome sequencing in Alopecia Areata identifies rare variants in KRT82," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    13. Ananyo Choudhury & Jean-Tristan Brandenburg & Tinashe Chikowore & Dhriti Sengupta & Palwende Romuald Boua & Nigel J. Crowther & Godfred Agongo & Gershim Asiki & F. Xavier Gómez-Olivé & Isaac Kisiangan, 2022. "Meta-analysis of sub-Saharan African studies provides insights into genetic architecture of lipid traits," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    14. Gudny A. Arnadottir & Asmundur Oddsson & Brynjar O. Jensson & Svanborg Gisladottir & Mariella T. Simon & Asgeir O. Arnthorsson & Hildigunnur Katrinardottir & Run Fridriksdottir & Erna V. Ivarsdottir &, 2022. "Population-level deficit of homozygosity unveils CPSF3 as an intellectual disability syndrome gene," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    15. Bian Li & Dan M. Roden & John A. Capra, 2022. "The 3D mutational constraint on amino acid sites in the human proteome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Saaket Agrawal & Minxian Wang & Marcus D. R. Klarqvist & Kirk Smith & Joseph Shin & Hesam Dashti & Nathaniel Diamant & Seung Hoan Choi & Sean J. Jurgens & Patrick T. Ellinor & Anthony Philippakis & Me, 2022. "Inherited basis of visceral, abdominal subcutaneous and gluteofemoral fat depots," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    17. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    18. Abhilash Bandam & Eedris Busari & Chloi Syranidou & Jochen Linssen & Detlef Stolten, 2022. "Classification of Building Types in Germany: A Data-Driven Modeling Approach," Data, MDPI, vol. 7(4), pages 1-23, April.
    19. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    20. Boonstra Philip S. & Little Roderick J.A. & West Brady T. & Andridge Rebecca R. & Alvarado-Leiton Fernanda, 2021. "A Simulation Study of Diagnostics for Selection Bias," Journal of Official Statistics, Sciendo, vol. 37(3), pages 751-769, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26114-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.