IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-60056-1.html
   My bibliography  Save this article

Performance of deep-learning-based approaches to improve polygenic scores

Author

Listed:
  • Martin Kelemen

    (University of Cambridge
    University of Cambridge)

  • Yu Xu

    (University of Cambridge
    University of Cambridge)

  • Tao Jiang

    (University of Cambridge
    University of Cambridge)

  • Jing Hua Zhao

    (University of Cambridge
    University of Cambridge)

  • Carl A. Anderson

    (Wellcome Sanger Institute)

  • Chris Wallace

    (University of Cambridge
    University of Cambridge)

  • Adam Butterworth

    (University of Cambridge
    University of Cambridge
    University of Cambridge
    Wellcome Genome Campus and University of Cambridge)

  • Michael Inouye

    (University of Cambridge
    University of Cambridge
    University of Cambridge
    Wellcome Genome Campus and University of Cambridge)

Abstract

Polygenic scores, which estimate an individual’s genetic propensity for a disease or trait, have the potential to become part of genomic healthcare. Neural-network based deep-learning has emerged as a method of intense interest to model complex, nonlinear phenomena, which may be adapted to exploit gene-gene and gene-environment interactions to potentially improve polygenic scores. We fit neural-network models to both simulated and 28 real traits in the UK Biobank. To infer the amount of nonlinearity present in a phenotype, we also present a framework using neural-networks, which controls for the potential confounding effect of linkage disequilibrium. Although we found evidence for small amounts of nonlinear effects, neural-network models were outperformed by linear regression models for both genetic-only and genetic+environmental input scenarios. In this work, we find that the usefulness of neural-networks for generating polygenic scores may currently be limited and confounded by joint tagging effects due to linkage disequilibrium.

Suggested Citation

  • Martin Kelemen & Yu Xu & Tao Jiang & Jing Hua Zhao & Carl A. Anderson & Chris Wallace & Adam Butterworth & Michael Inouye, 2025. "Performance of deep-learning-based approaches to improve polygenic scores," Nature Communications, Nature, vol. 16(1), pages 1-9, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60056-1
    DOI: 10.1038/s41467-025-60056-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-60056-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-60056-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Tian Ge & Chia-Yen Chen & Yang Ni & Yen-Chen Anne Feng & Jordan W. Smoller, 2019. "Polygenic prediction via Bayesian regression and continuous shrinkage priors," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    2. Gad Abraham & Rainer Malik & Ekaterina Yonova-Doing & Agus Salim & Tingting Wang & John Danesh & Adam S. Butterworth & Joanna M. M. Howson & Michael Inouye & Martin Dichgans, 2019. "Genomic risk score offers predictive performance comparable to clinical risk factors for ischaemic stroke," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    3. Clare Bycroft & Colin Freeman & Desislava Petkova & Gavin Band & Lloyd T. Elliott & Kevin Sharp & Allan Motyer & Damjan Vukcevic & Olivier Delaneau & Jared O’Connell & Adrian Cortes & Samantha Welsh &, 2018. "The UK Biobank resource with deep phenotyping and genomic data," Nature, Nature, vol. 562(7726), pages 203-209, October.
    4. Alexander I. Young & Fabian Wauthier & Peter Donnelly, 2016. "Multiple novel gene-by-environment interactions modify the effect of FTO variants on body mass index," Nature Communications, Nature, vol. 7(1), pages 1-12, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hou, Wenyan & Liu, Yuxin & Hao, Xingjie & Qi, Jike & Jiang, Yuchen & Huang, Shuiping & Zeng, Ping, 2025. "Relatively independent and complementary roles of family history and polygenic risk score in age at onset and incident cases of 12 common diseases," Social Science & Medicine, Elsevier, vol. 371(C).
    2. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    3. Magdalena Zimoń & Yunfeng Huang & Anthi Trasta & Aliaksandr Halavatyi & Jimmy Z. Liu & Chia-Yen Chen & Peter Blattmann & Bernd Klaus & Christopher D. Whelan & David Sexton & Sally John & Wolfgang Hube, 2021. "Pairwise effects between lipid GWAS genes modulate lipid plasma levels and cellular uptake," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    4. Rikifumi Ohta & Yosuke Tanigawa & Yuta Suzuki & Manolis Kellis & Shinichi Morishita, 2024. "A polygenic score method boosted by non-additive models," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    5. Carla Márquez-Luna & Steven Gazal & Po-Ru Loh & Samuel S. Kim & Nicholas Furlotte & Adam Auton & Alkes L. Price, 2021. "Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    6. Injeong Shim & Hiroyuki Kuwahara & NingNing Chen & Mais O. Hashem & Lama AlAbdi & Mohamed Abouelhoda & Hong-Hee Won & Pradeep Natarajan & Patrick T. Ellinor & Amit V. Khera & Xin Gao & Fowzan S. Alkur, 2023. "Clinical utility of polygenic scores for cardiometabolic disease in Arabs," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    7. Yuji Yamamoto & Yuya Shirai & Kyuto Sonehara & Shinichi Namba & Takafumi Ojima & Kenichi Yamamoto & Ryuya Edahiro & Ken Suzuki & Akinori Kanai & Yoshiya Oda & Yutaka Suzuki & Takayuki Morisaki & Akira, 2025. "Dissecting cross-population polygenic heterogeneity across respiratory and cardiometabolic diseases," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    8. Andras Gezsi & Sandra Auwera & Hannu Mäkinen & Nora Eszlari & Gabor Hullam & Tamas Nagy & Sarah Bonk & Rubèn González-Colom & Xenia Gonda & Linda Garvert & Teemu Paajanen & Zsofia Gal & Kevin Kirchner, 2024. "Unique genetic and risk-factor profiles in clusters of major depressive disorder-related multimorbidity trajectories," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    9. Alexander T. Williams & Jing Chen & Kayesha Coley & Chiara Batini & Abril Izquierdo & Richard Packer & Erik Abner & Stavroula Kanoni & David J. Shepherd & Robert C. Free & Edward J. Hollox & Nigel J. , 2023. "Genome-wide association study of thyroid-stimulating hormone highlights new genes, pathways and associations with thyroid disease," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    10. Clara Albiñana & Zhihong Zhu & Andrew J. Schork & Andrés Ingason & Hugues Aschard & Isabell Brikell & Cynthia M. Bulik & Liselotte V. Petersen & Esben Agerbo & Jakob Grove & Merete Nordentoft & David , 2023. "Multi-PGS enhances polygenic prediction by combining 937 polygenic scores," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    11. Kenichi Yamamoto & Shinichi Namba & Kyuto Sonehara & Ken Suzuki & Saori Sakaue & Niall P. Cooke & Shinichi Higashiue & Shuzo Kobayashi & Hisaaki Afuso & Kosho Matsuura & Yojiro Mitsumoto & Yasuhiko Fu, 2024. "Genetic legacy of ancient hunter-gatherer Jomon in Japanese populations," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    12. Wei Jiang & Ling Chen & Matthew J. Girgenti & Hongyu Zhao, 2024. "Tuning parameters for polygenic risk score methods using GWAS summary statistics from training data," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    13. James P. Pirruccello & Paolo Achille & Seung Hoan Choi & Joel T. Rämö & Shaan Khurshid & Mahan Nekoui & Sean J. Jurgens & Victor Nauffal & Shinwan Kany & Kenney Ng & Samuel F. Friedman & Puneet Batra , 2024. "Deep learning of left atrial structure and function provides link to atrial fibrillation risk," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    14. Bingxin Zhao & Yujue Li & Zirui Fan & Zhenyi Wu & Juan Shu & Xiaochen Yang & Yilin Yang & Xifeng Wang & Bingxuan Li & Xiyao Wang & Carlos Copana & Yue Yang & Jinjie Lin & Yun Li & Jason L. Stein & Joa, 2024. "Eye-brain connections revealed by multimodal retinal and brain imaging genetics," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    15. Yosuke Tanigawa & Junyang Qian & Guhan Venkataraman & Johanne Marie Justesen & Ruilin Li & Robert Tibshirani & Trevor Hastie & Manuel A Rivas, 2022. "Significant sparse polygenic risk scores across 813 traits in UK Biobank," PLOS Genetics, Public Library of Science, vol. 18(3), pages 1-21, March.
    16. Young Jin Kim & Sanghoon Moon & Mi Yeong Hwang & Sohee Han & Hye-Mi Jang & Jinhwa Kong & Dong Mun Shin & Kyungheon Yoon & Sung Min Kim & Jong-Eun Lee & Anubha Mahajan & Hyun-Young Park & Mark I. McCar, 2022. "The contribution of common and rare genetic variants to variation in metabolic traits in 288,137 East Asians," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    17. Chamlee Cho & Beomsu Kim & Dan Say Kim & Mi Yeong Hwang & Injeong Shim & Minku Song & Yeong Chan Lee & Sang-Hyuk Jung & Sung Kweon Cho & Woong-Yang Park & Woojae Myung & Bong-Jo Kim & Ron Do & Hyon K., 2024. "Large-scale cross-ancestry genome-wide meta-analysis of serum urate," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    18. Douglas P. Loesch & Manik Garg & Dorota Matelska & Dimitrios Vitsios & Xiao Jiang & Scott C. Ritchie & Benjamin B. Sun & Heiko Runz & Christopher D. Whelan & Rury R. Holman & Robert J. Mentz & Filipe , 2025. "Identification of plasma proteomic markers underlying polygenic risk of type 2 diabetes and related comorbidities," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    19. Sang‑Hyuk Jung & Haemin Kim & Young Mi Jung & Manu Shivakumar & Brenda Xiao & Jaeyoung Kim & Beomjin Jang & Jae-Seung Yun & Hong-Hee Won & Chan-Wook Park & Joong Shin Park & Jong Kwan Jun & Dokyoon Ki, 2025. "Healthy lifestyle reduces cardiovascular risk in women with genetic predisposition to hypertensive disorders of pregnancy," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    20. Junyang Qian & Yosuke Tanigawa & Wenfei Du & Matthew Aguirre & Chris Chang & Robert Tibshirani & Manuel A Rivas & Trevor Hastie, 2020. "A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank," PLOS Genetics, Public Library of Science, vol. 16(10), pages 1-30, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60056-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.