IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-61891-y.html
   My bibliography  Save this article

Comprehensive interaction modeling with machine learning improves prediction of disease risk in the UK Biobank

Author

Listed:
  • Heli Julkunen

    (Aalto University)

  • Juho Rousu

    (Aalto University)

Abstract

Understanding how risk factors interact to jointly influence disease risk can provide insights into disease development and improve risk prediction. Here we introduce survivalFM, a machine learning extension to the widely used Cox proportional hazards model that enables scalable estimation of all potential pairwise interaction effects on time-to-event outcomes. The method approximates interaction effects using a low-rank factorization, allowing it to overcome the computational and statistical limitations typically associated with high-dimensional interaction modeling. Applied to the UK Biobank dataset across nine disease examples and diverse clinical and omics risk factors, survivalFM improves prediction performance in terms of discrimination, explained variation, and reclassification in 30.6%, 41.7%, and 94.4% of the scenarios tested, respectively. In a clinical cardiovascular risk prediction scenario using the established QRISK3 model, the method adds predictive value by identifying interactions beyond the age interaction effects currently included. These results demonstrate that comprehensive modeling of interactions can facilitate advanced insights into disease development and improve risk predictions.

Suggested Citation

  • Heli Julkunen & Juho Rousu, 2025. "Comprehensive interaction modeling with machine learning improves prediction of disease risk in the UK Biobank," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61891-y
    DOI: 10.1038/s41467-025-61891-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-61891-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-61891-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Patrick Royston, 2006. "Explained variation for survival models," Stata Journal, StataCorp LLC, vol. 6(1), pages 83-96, March.
    2. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    3. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    4. Clare Bycroft & Colin Freeman & Desislava Petkova & Gavin Band & Lloyd T. Elliott & Kevin Sharp & Allan Motyer & Damjan Vukcevic & Olivier Delaneau & Jared O’Connell & Adrian Cortes & Samantha Welsh &, 2018. "The UK Biobank resource with deep phenotyping and genomic data," Nature, Nature, vol. 562(7726), pages 203-209, October.
    5. Heli Julkunen & Anna Cichońska & Mika Tiainen & Harri Koskela & Kristian Nybo & Valtteri Mäkelä & Jussi Nokso-Koivisto & Kati Kristiansson & Markus Perola & Veikko Salomaa & Pekka Jousilahti & Annamar, 2023. "Atlas of plasma NMR biomarkers for health and disease in 118,461 individuals from the UK Biobank," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    6. Shiyu Zhang & Zheng Wang & Yijing Wang & Yixiao Zhu & Qiao Zhou & Xingxing Jian & Guihu Zhao & Jian Qiu & Kun Xia & Beisha Tang & Julian Mutz & Jinchen Li & Bin Li, 2024. "A metabolomic profile of biological aging in 250,341 individuals from the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    7. Benjamin B. Sun & Joshua Chiou & Matthew Traylor & Christian Benner & Yi-Hsiang Hsu & Tom G. Richardson & Praveen Surendran & Anubha Mahajan & Chloe Robins & Steven G. Vasquez-Grinnell & Liping Hou & , 2023. "Plasma proteomic associations with genetics and health in the UK Biobank," Nature, Nature, vol. 622(7982), pages 329-338, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Filippos Anagnostakis & Sarah Ko & Mehrshad Saadatinia & Jingyue Wang & Christos Davatzikos & Junhao Wen, 2025. "Multi-organ metabolome biological age implicates cardiometabolic conditions and mortality risk," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    2. Chun Shen & Ruohan Zhang & Jintai Yu & Barbara J. Sahakian & Wei Cheng & Jianfeng Feng, 2025. "Plasma proteomic signatures of social isolation and loneliness associated with morbidity and mortality," Nature Human Behaviour, Nature, vol. 9(3), pages 569-583, March.
    3. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    4. Benedicte Sjo Tislevoll & Monica Hellesøy & Oda Helen Eck Fagerholt & Stein-Erik Gullaksen & Aashish Srivastava & Even Birkeland & Dimitrios Kleftogiannis & Pilar Ayuda-Durán & Laure Piechaczyk & Dagi, 2023. "Early response evaluation by single cell signaling profiling in acute myeloid leukemia," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    5. Zhening Liu & Hangkai Huang & Jiarong Xie & Yingying Xu & Chengfu Xu, 2024. "Circulating fatty acids and risk of hepatocellular carcinoma and chronic liver disease mortality in the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    6. Andreas Groll & Gerhard Tutz, 2017. "Variable selection in discrete survival models including heterogeneity," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(2), pages 305-338, April.
    7. Shiyu Zhang & Zheng Wang & Yijing Wang & Yixiao Zhu & Qiao Zhou & Xingxing Jian & Guihu Zhao & Jian Qiu & Kun Xia & Beisha Tang & Julian Mutz & Jinchen Li & Bin Li, 2024. "A metabolomic profile of biological aging in 250,341 individuals from the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    8. Matthew F Dixon, 2017. "Sequence Classification of the Limit Order Book using Recurrent Neural Networks," Papers 1707.05642, arXiv.org.
    9. Mine Koprulu & Eleanor Wheeler & Nicola D. Kerrison & Spiros Denaxas & Julia Carrasco-Zanini & Chloe M. Orkin & Harry Hemingway & Nicholas J. Wareham & Maik Pietzner & Claudia Langenberg, 2025. "Sex differences in the genetic regulation of the human plasma proteome," Nature Communications, Nature, vol. 16(1), pages 1-10, December.
    10. Jie Xiong & Zhitong Bing & Yanlin Su & Defeng Deng & Xiaoning Peng, 2014. "An Integrated mRNA and microRNA Expression Signature for Glioblastoma Multiforme Prognosis," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-8, May.
    11. Liao Zhu & Robert A. Jarrow & Martin T. Wells, 2021. "Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 11(04), pages 1-30, December.
    12. Yuji Yamamoto & Yuya Shirai & Kyuto Sonehara & Shinichi Namba & Takafumi Ojima & Kenichi Yamamoto & Ryuya Edahiro & Ken Suzuki & Akinori Kanai & Yoshiya Oda & Yutaka Suzuki & Takayuki Morisaki & Akira, 2025. "Dissecting cross-population polygenic heterogeneity across respiratory and cardiometabolic diseases," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    13. Abolfazl Doostparast Torshizi & Dongnhu T. Truong & Liping Hou & Bart Smets & Christopher D. Whelan & Shuwei Li, 2024. "Proteogenomic network analysis reveals dysregulated mechanisms and potential mediators in Parkinson’s disease," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    14. Gal Dinstag & David Amar & Erik Ingelsson & Euan Ashley & Ron Shamir, 2019. "Personalized prediction of adverse heart and kidney events using baseline and longitudinal data from SPRINT and ACCORD," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-12, August.
    15. Paul Ghelasi & Florian Ziel, 2024. "From day-ahead to mid and long-term horizons with econometric electricity price forecasting models," Papers 2406.00326, arXiv.org, revised Aug 2024.
    16. Krista Freimann & Anneke Brümmer & Robert Warmerdam & Tarran S. Rupall & Ana Laura Hernández-Ledesma & Joshua Chiou & Emily R. Holzinger & Joseph C. Maranville & Nikolina Nakic & Halit Ongen & Luca St, 2025. "Trans-eQTL mapping prioritises USP18 as a negative regulator of interferon response at a lupus risk locus," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    17. Shikhar Uttam & Andrew M. Stern & Christopher J. Sevinsky & Samantha Furman & Filippo Pullara & Daniel Spagnolo & Luong Nguyen & Albert Gough & Fiona Ginty & D. Lansing Taylor & S. Chakra Chennubhotla, 2020. "Spatial domain analysis predicts risk of colorectal cancer recurrence and infers associated tumor microenvironment networks," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    18. Robert A. Jarrow & Rinald Murataj & Martin T. Wells & Liao Zhu, 2023. "The Low-Volatility Anomaly And The Adaptive Multi-Factor Model," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 26(04n05), pages 1-33, August.
    19. Matt Fillingim & Christophe Tanguay-Sabourin & Marc Parisien & Azin Zare & Gianluca V. Guglietti & Jax Norman & Bogdan Petre & Andrey Bortsov & Mark Ware & Jordi Perez & Mathieu Roy & Luda Diatchenko , 2025. "Biological markers and psychosocial factors predict chronic pain conditions," Nature Human Behaviour, Nature, vol. 9(8), pages 1710-1725, August.
    20. Liao Zhu, 2021. "The Adaptive Multi-Factor Model and the Financial Market," Papers 2107.14410, arXiv.org, revised Aug 2021.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61891-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.