IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-45135-z.html
   My bibliography  Save this article

Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI

Author

Listed:
  • Quan Sun

    (University of North Carolina at Chapel Hill)

  • Bryce T. Rowland

    (University of North Carolina at Chapel Hill)

  • Jiawen Chen

    (University of North Carolina at Chapel Hill)

  • Anna V. Mikhaylova

    (University of Washington)

  • Christy Avery

    (University of North Carolina at Chapel Hill)

  • Ulrike Peters

    (Fred Hutchinson Cancer Center)

  • Jessica Lundin

    (Fred Hutchinson Cancer Center)

  • Tara Matise

    (Rutgers University)

  • Steve Buyske

    (Rutgers University)

  • Ran Tao

    (Vanderbilt University Medical Center
    Vanderbilt University Medical Center)

  • Rasika A. Mathias

    (Johns Hopkins University)

  • Alexander P. Reiner

    (University of Washington)

  • Paul L. Auer

    (and Cancer Center, Medical College of Wisconsin)

  • Nancy J. Cox

    (Vanderbilt University Medical Center
    Vanderbilt University Medical Center)

  • Charles Kooperberg

    (Fred Hutchinson Cancer Center)

  • Timothy A. Thornton

    (University of Washington)

  • Laura M. Raffield

    (University of North Carolina at Chapel Hill)

  • Yun Li

    (University of North Carolina at Chapel Hill
    University of North Carolina at Chapel Hill)

Abstract

Polygenic risk scores (PRS) have shown successes in clinics, but most PRS methods focus only on participants with distinct primary continental ancestry without accommodating recently-admixed individuals with mosaic continental ancestry backgrounds for different segments of their genomes. Here, we develop GAUDI, a novel penalized-regression-based method specifically designed for admixed individuals. GAUDI explicitly models ancestry-differential effects while borrowing information across segments with shared ancestry in admixed genomes. We demonstrate marked advantages of GAUDI over other methods through comprehensive simulation and real data analyses for traits with associated variants exhibiting ancestral-differential effects. Leveraging data from the Women’s Health Initiative study, we show that GAUDI improves PRS prediction of white blood cell count and C-reactive protein in African Americans by > 64% compared to alternative methods, and even outperforms PRS-CSx with large European GWAS for some scenarios. We believe GAUDI will be a valuable tool to mitigate disparities in PRS performance in admixed individuals.

Suggested Citation

  • Quan Sun & Bryce T. Rowland & Jiawen Chen & Anna V. Mikhaylova & Christy Avery & Ulrike Peters & Jessica Lundin & Tara Matise & Steve Buyske & Ran Tao & Rasika A. Mathias & Alexander P. Reiner & Paul , 2024. "Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45135-z
    DOI: 10.1038/s41467-024-45135-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-45135-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-45135-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    2. Saredo Said & Raha Pazoki & Ville Karhunen & Urmo Võsa & Symen Ligthart & Barbara Bodinier & Fotios Koskeridis & Paul Welsh & Behrooz Z. Alizadeh & Daniel I. Chasman & Naveed Sattar & Marc Chadeau-Hya, 2022. "Genetic analysis of over half a million people characterises C-reactive protein loci," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    3. Genevieve L. Wojcik & Mariaelisa Graff & Katherine K. Nishimura & Ran Tao & Jeffrey Haessler & Christopher R. Gignoux & Heather M. Highland & Yesha M. Patel & Elena P. Sorokin & Christy L. Avery & Gil, 2019. "Genetic analyses of diverse populations improves discovery for complex traits," Nature, Nature, vol. 570(7762), pages 514-518, June.
    4. Davide Marnetto & Katri Pärna & Kristi Läll & Ludovica Molinaro & Francesco Montinaro & Toomas Haller & Mait Metspalu & Reedik Mägi & Krista Fischer & Luca Pagani, 2020. "Ancestry deconvolution and partial polygenic score can improve susceptibility predictions in recently admixed individuals," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
    5. Robert Tibshirani & Michael Saunders & Saharon Rosset & Ji Zhu & Keith Knight, 2005. "Sparsity and smoothness via the fused lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(1), pages 91-108, February.
    6. Daniel Taliun & Daniel N. Harris & Michael D. Kessler & Jedidiah Carlson & Zachary A. Szpiech & Raul Torres & Sarah A. Gagliano Taliun & André Corvelo & Stephanie M. Gogarten & Hyun Min Kang & Achille, 2021. "Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program," Nature, Nature, vol. 590(7845), pages 290-299, February.
    7. Jiacheng Miao & Hanmin Guo & Gefei Song & Zijie Zhao & Lin Hou & Qiongshi Lu, 2023. "Quantifying portable genetic effects and improving cross-ancestry genetic prediction with GWAS summary statistics," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shaan Khurshid & Julieta Lazarte & James P. Pirruccello & Lu-Chen Weng & Seung Hoan Choi & Amelia W. Hall & Xin Wang & Samuel F. Friedman & Victor Nauffal & Kiran J. Biddinger & Krishna G. Aragam & Pu, 2023. "Clinical and genetic associations of deep learning-derived cardiac magnetic resonance-based left ventricular mass," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    2. Jingning Zhang & Jianan Zhan & Jin Jin & Cheng Ma & Ruzhang Zhao & Jared O’Connell & Yunxuan Jiang & Bertram L. Koelsch & Haoyu Zhang & Nilanjan Chatterjee, 2024. "An ensemble penalized regression method for multi-ancestry polygenic risk prediction," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    3. Zihuai He & Linxi Liu & Michael E. Belloy & Yann Guen & Aaron Sossin & Xiaoxia Liu & Xinran Qi & Shiyang Ma & Prashnna K. Gyawali & Tony Wyss-Coray & Hua Tang & Chiara Sabatti & Emmanuel Candès & Mich, 2022. "GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    4. Ananyo Choudhury & Jean-Tristan Brandenburg & Tinashe Chikowore & Dhriti Sengupta & Palwende Romuald Boua & Nigel J. Crowther & Godfred Agongo & Gershim Asiki & F. Xavier Gómez-Olivé & Isaac Kisiangan, 2022. "Meta-analysis of sub-Saharan African studies provides insights into genetic architecture of lipid traits," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    5. Alexandra Barry & Michelle T. McNulty & Xiaoyuan Jia & Yask Gupta & Hanna Debiec & Yang Luo & China Nagano & Tomoko Horinouchi & Seulgi Jung & Manuela Colucci & Dina F. Ahram & Adele Mitrotti & Aditi , 2023. "Multi-population genome-wide association study implicates immune and non-immune factors in pediatric steroid-sensitive nephrotic syndrome," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    6. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    7. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    9. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    10. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    11. Francis X. Diebold & Kamil Yilmaz, 2016. "Trans-Atlantic Equity Volatility Connectedness: U.S. and European Financial Institutions, 2004–2014," Journal of Financial Econometrics, Oxford University Press, vol. 14(1), pages 81-127.
    12. Katrina G. Claw & Casey R. Dorr & Erica L. Woodahl, 2024. "Implementing community-engaged pharmacogenomics in Indigenous communities," Nature Communications, Nature, vol. 15(1), pages 1-5, December.
    13. Laura M. Mueller & Abigail Isaacson & Heather Wilson & Anna Salowka & Isabel Tay & Maolian Gong & Nancy Samir Elbarbary & Klemens Raile & Francesca M. Spagnoli, 2024. "Heterozygous missense variant in GLI2 impairs human endocrine pancreas development," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    14. Jian Guo & Elizaveta Levina & George Michailidis & Ji Zhu, 2010. "Pairwise Variable Selection for High-Dimensional Model-Based Clustering," Biometrics, The International Biometric Society, vol. 66(3), pages 793-804, September.
    15. Franck Rapaport & Christina Leslie, 2010. "Determining Frequent Patterns of Copy Number Alterations in Cancer," PLOS ONE, Public Library of Science, vol. 5(8), pages 1-10, August.
    16. Lu Tang & Ling Zhou & Peter X. K. Song, 2019. "Fusion learning algorithm to combine partially heterogeneous Cox models," Computational Statistics, Springer, vol. 34(1), pages 395-414, March.
    17. Young‐Geun Choi & Lawrence P. Hanrahan & Derek Norton & Ying‐Qi Zhao, 2022. "Simultaneous spatial smoothing and outlier detection using penalized regression, with application to childhood obesity surveillance from electronic health records," Biometrics, The International Biometric Society, vol. 78(1), pages 324-336, March.
    18. Mathias Seviiri & Matthew H. Law & Jue-Sheng Ong & Puya Gharahkhani & Pierre Fontanillas & Catherine M. Olsen & David C. Whiteman & Stuart MacGregor, 2022. "A multi-phenotype analysis reveals 19 susceptibility loci for basal cell carcinoma and 15 for squamous cell carcinoma," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    19. Molly C. Klanderman & Kathryn B. Newhart & Tzahi Y. Cath & Amanda S. Hering, 2020. "Fault isolation for a complex decentralized waste water treatment facility," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 931-951, August.
    20. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45135-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.