IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1011412.html
   My bibliography  Save this article

Prioritizing disease-related rare variants by integrating gene expression data

Author

Listed:
  • Hanmin Guo
  • Alexander Eckehart Urban
  • Wing Hung Wong

Abstract

Rare variants, comprising the vast majority of human genetic variations, are likely to have more deleterious impact in the context of human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer’s disease reveals 16 rare variants within 15 genes with extreme carrier statistics. We also found strong excess of rare variants among the top prioritized genes in patients compared to that in healthy individuals. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.Author summary: Existing rare variants association methods often lack statistical power when sample sizes are small. Here we propose a novel integrative statistical framework, the carrier statistic, which can leverage paired genotype and gene expression data to quantify the functional impact of rare variants and enhance detection power of rare variants responsible for disease. Extensive simulations demonstrate that carrier statistic provides well-calibrated false discovery rates, shows substantially higher sensitivity compared to existing methods, and remains robust under unbalanced case-control ratios. Through analyzing real multi-omics dataset for Alzheimer’s disease, we identified 16 rare variants within 15 genes with extreme carrier statistics. We hope that the results presented in this paper can highlight the promise of the carrier statistic approach and will encourage future disease studies to collect both genotype and gene expression data for the same individuals. As multi-omics and genome sequencing data continue to expand, we anticipate that carrier statistics will be a valuable tool for elucidating the molecular mechanism underlying human complex diseases.

Suggested Citation

  • Hanmin Guo & Alexander Eckehart Urban & Wing Hung Wong, 2024. "Prioritizing disease-related rare variants by integrating gene expression data," PLOS Genetics, Public Library of Science, vol. 20(9), pages 1-16, September.
  • Handle: RePEc:plo:pgen00:1011412
    DOI: 10.1371/journal.pgen.1011412
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1011412
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1011412&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1011412?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Claudia Giambartolomei & Damjan Vukcevic & Eric E Schadt & Lude Franke & Aroon D Hingorani & Chris Wallace & Vincent Plagnol, 2014. "Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics," PLOS Genetics, Public Library of Science, vol. 10(5), pages 1-15, May.
    2. Benjamin M Neale & Manuel A Rivas & Benjamin F Voight & David Altshuler & Bernie Devlin & Marju Orho-Melander & Sekar Kathiresan & Shaun M Purcell & Kathryn Roeder & Mark J Daly, 2011. "Testing for an Unusual Distribution of Rare Variants," PLOS Genetics, Public Library of Science, vol. 7(3), pages 1-8, March.
    3. Chris Wallace, 2021. "A more accurate method for colocalisation analysis allowing for multiple causal variants," PLOS Genetics, Public Library of Science, vol. 17(9), pages 1-11, September.
    4. Christopher DeBoever & Yosuke Tanigawa & Malene E. Lindholm & Greg McInnes & Adam Lavertu & Erik Ingelsson & Chris Chang & Euan A. Ashley & Carlos D. Bustamante & Mark J. Daly & Manuel A. Rivas, 2018. "Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study," Nature Communications, Nature, vol. 9(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ichcha Manipur & Guillermo Reales & Jae Hoon Sul & Myung Kyun Shin & Simonne Longerich & Adrian Cortes & Chris Wallace, 2024. "CoPheScan: phenome-wide association studies accounting for linkage disequilibrium," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    2. Wesley L Crouse & Gregory R Keele & Madeleine S Gastonguay & Gary A Churchill & William Valdar, 2022. "A Bayesian model selection approach to mediation analysis," PLOS Genetics, Public Library of Science, vol. 18(5), pages 1-33, May.
    3. Valur Emilsson & Elias F. Gudmundsson & Thorarinn Jonmundsson & Brynjolfur G. Jonsson & Michael Twarog & Valborg Gudmundsdottir & Zhiguang Li & Nancy Finkel & Stephen Poor & Xin Liu & Robert Esterberg, 2022. "A proteogenomic signature of age-related macular degeneration in blood," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. V. E. Jackson & Y. Wu & R. Bonelli & J. P. Owen & L. W. Scott & S. Farashi & Y. Kihara & M. L. Gantner & C. Egan & K. M. Williams & B. R. E. Ansell & A. Tufail & A. Y. Lee & M. Bahlo, 2025. "Multi-omic spatial effects on high-resolution AI-derived retinal thickness," Nature Communications, Nature, vol. 16(1), pages 1-19, December.
    5. Matthew T. Patrick & Qinmengge Li & Rachael Wasikowski & Nehal Mehta & Johann E. Gudjonsson & James T. Elder & Xiang Zhou & Lam C. Tsoi, 2022. "Shared genetic risk factors and causal association between psoriasis and coronary artery disease," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    6. Amil M. Shah & Peder L. Myhre & Victoria Arthur & Pranav Dorbala & Humaira Rasheed & Leo F. Buckley & Brian Claggett & Guning Liu & Jianzhong Ma & Ngoc Quynh Nguyen & Kunihiro Matsushita & Chiadi Ndum, 2024. "Large scale plasma proteomics identifies novel proteins and protein networks associated with heart failure development," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    7. Maik Pietzner & Eleanor Wheeler & Julia Carrasco-Zanini & Nicola D. Kerrison & Erin Oerton & Mine Koprulu & Jian’an Luan & Aroon D. Hingorani & Steve A. Williams & Nicholas J. Wareham & Claudia Langen, 2021. "Synergistic insights into human health from aptamer- and antibody-based proteomic profiling," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    8. Lucas A. Mavromatis & Daniel B. Rosoff & Andrew S. Bell & Jeesun Jung & Josephin Wagner & Falk W. Lohoff, 2023. "Multi-omic underpinnings of epigenetic aging and human longevity," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    9. Karl Smith-Byrne & Åsa Hedman & Marios Dimitriou & Trishna Desai & Alexandr V. Sokolov & Helgi B. Schioth & Mine Koprulu & Maik Pietzner & Claudia Langenberg & Joshua Atkins & Ricardo Cortez Penha & J, 2024. "Identifying therapeutic targets for cancer among 2074 circulating proteins and risk of nine cancers," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    10. Bryan R. Gorman & Sun-Gou Ji & Michael Francis & Anoop K. Sendamarai & Yunling Shi & Poornima Devineni & Uma Saxena & Elizabeth Partan & Andrea K. DeVito & Jinyoung Byun & Younghun Han & Xiangjun Xiao, 2024. "Multi-ancestry GWAS meta-analyses of lung cancer reveal susceptibility loci and elucidate smoking-independent genetic risk," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    11. Jacob Joseph & Chang Liu & Qin Hui & Krishna Aragam & Zeyuan Wang & Brian Charest & Jennifer E. Huffman & Jacob M. Keaton & Todd L. Edwards & Serkalem Demissie & Luc Djousse & Juan P. Casas & J. Micha, 2022. "Genetic architecture of heart failure with preserved versus reduced ejection fraction," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    12. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    13. Julia Schröder & Vitalia Schüller & Andrea May & Christian Gerges & Mario Anders & Jessica Becker & Timo Hess & Nicole Kreuser & René Thieme & Kerstin U Ludwig & Tania Noder & Marino Venerito & Lothar, 2019. "Identification of loci of functional relevance to Barrett’s esophagus and esophageal adenocarcinoma: Cross-referencing of expression quantitative trait loci data from disease-relevant tissues with gen," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-12, December.
    14. Lili Liu & Atlas Khan & Elena Sanchez-Rodriguez & Francesca Zanoni & Yifu Li & Nicholas Steers & Olivia Balderes & Junying Zhang & Priya Krithivasan & Robert A. LeDesma & Clara Fischman & Scott J. Heb, 2022. "Genetic regulation of serum IgA levels and susceptibility to common immune, infectious, kidney, and cardio-metabolic traits," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    15. Marta Alcalde-Herraiz & JunQing Xie & Danielle Newby & Clara Prats & Dipender Gill & María Gordillo-Marañón & Daniel Prieto-Alhambra & Martí Català & Albert Prats-Uribe, 2024. "Effect of genetically predicted sclerostin on cardiovascular biomarkers, risk factors, and disease outcomes," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    16. Sylvia Hartmann & Summaira Yasmeen & Benjamin M. Jacobs & Spiros Denaxas & Munir Pirmohamed & Eric R. Gamazon & Mark J. Caulfield & Harry Hemingway & Maik Pietzner & Claudia Langenberg, 2023. "ADRA2A and IRX1 are putative risk genes for Raynaud’s phenomenon," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    17. Brittany L. Mitchell & Jake R. Saklatvala & Nick Dand & Fiona A. Hagenbeek & Xin Li & Josine L. Min & Laurent Thomas & Meike Bartels & Jouke Hottenga & Michelle K. Lupton & Dorret I. Boomsma & Xianjun, 2022. "Genome-wide association meta-analysis identifies 29 new acne susceptibility loci," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    18. Elizabeth C. Goode & Laura Fachal & Nikolaos Panousis & Loukas Moutsianas & Rebecca E. McIntyre & Benjamin Yu Hang Bai & Norihito Kawasaki & Alexandra Wittmann & Tim Raine & Simon M. Rushbrook & Carl , 2024. "Fine-mapping and molecular characterisation of primary sclerosing cholangitis genetic risk loci," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    19. Zichen Zhang & Ye Eun Bae & Jonathan R. Bradley & Lang Wu & Chong Wu, 2022. "SUMMIT: An integrative approach for better transcriptomic data imputation improves causal gene identification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    20. Pietro Demela & Nicola Pirastu & Blagoje Soskic, 2023. "Cross-disorder genetic analysis of immune diseases reveals distinct gene associations that converge on common pathways," Nature Communications, Nature, vol. 14(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1011412. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.