IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-46663-4.html
   My bibliography  Save this article

Data-driven identification of predictive risk biomarkers for subgroups of osteoarthritis using interpretable machine learning

Author

Listed:
  • Rikke Linnemann Nielsen

    (Novo Nordisk Research Centre Oxford)

  • Thomas Monfeuga

    (Novo Nordisk Research Centre Oxford)

  • Robert R. Kitchen

    (Novo Nordisk Research Centre Oxford)

  • Line Egerod

    (Novo Nordisk Research Centre Oxford)

  • Luis G. Leal

    (Novo Nordisk Research Centre Oxford)

  • August Thomas Hjortshøj Schreyer

    (Novo Nordisk Research Centre Oxford)

  • Frederik Steensgaard Gade

    (Novo Nordisk A/S)

  • Carol Sun

    (Novo Nordisk Research Centre Oxford)

  • Marianne Helenius

    (Technical University of Denmark)

  • Lotte Simonsen

    (Novo Nordisk A/S)

  • Marianne Willert

    (Novo Nordisk A/S)

  • Abd A. Tahrani

    (Novo Nordisk A/S)

  • Zahra McVey

    (Novo Nordisk Research Centre Oxford)

  • Ramneek Gupta

    (Novo Nordisk Research Centre Oxford)

Abstract

Osteoarthritis (OA) is increasing in prevalence and has a severe impact on patients’ lives. However, our understanding of biomarkers driving OA risk remains limited. We developed a model predicting the five-year risk of OA diagnosis, integrating retrospective clinical, lifestyle and biomarker data from the UK Biobank (19,120 patients with OA, ROC-AUC: 0.72, 95%CI (0.71–0.73)). Higher age, BMI and prescription of non-steroidal anti-inflammatory drugs contributed most to increased OA risk prediction ahead of diagnosis. We identified 14 subgroups of OA risk profiles. These subgroups were validated in an independent set of patients evaluating the 11-year OA risk, with 88% of patients being uniquely assigned to one of the 14 subgroups. Individual OA risk profiles were characterised by personalised biomarkers. Omics integration demonstrated the predictive importance of key OA genes and pathways (e.g., GDF5 and TGF-β signalling) and OA-specific biomarkers (e.g., CRTAC1 and COL9A1). In summary, this work identifies opportunities for personalised OA prevention and insights into its underlying pathogenesis.

Suggested Citation

  • Rikke Linnemann Nielsen & Thomas Monfeuga & Robert R. Kitchen & Line Egerod & Luis G. Leal & August Thomas Hjortshøj Schreyer & Frederik Steensgaard Gade & Carol Sun & Marianne Helenius & Lotte Simons, 2024. "Data-driven identification of predictive risk biomarkers for subgroups of osteoarthritis using interpretable machine learning," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46663-4
    DOI: 10.1038/s41467-024-46663-4
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-46663-4
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-46663-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Heli Julkunen & Anna Cichońska & Mika Tiainen & Harri Koskela & Kristian Nybo & Valtteri Mäkelä & Jussi Nokso-Koivisto & Kati Kristiansson & Markus Perola & Veikko Salomaa & Pekka Jousilahti & Annamar, 2023. "Atlas of plasma NMR biomarkers for health and disease in 118,461 individuals from the UK Biobank," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    2. Benjamin B. Sun & Joshua Chiou & Matthew Traylor & Christian Benner & Yi-Hsiang Hsu & Tom G. Richardson & Praveen Surendran & Anubha Mahajan & Chloe Robins & Steven G. Vasquez-Grinnell & Liping Hou & , 2023. "Plasma proteomic associations with genetics and health in the UK Biobank," Nature, Nature, vol. 622(7982), pages 329-338, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Filippos Anagnostakis & Sarah Ko & Mehrshad Saadatinia & Jingyue Wang & Christos Davatzikos & Junhao Wen, 2025. "Multi-organ metabolome biological age implicates cardiometabolic conditions and mortality risk," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    2. Chun Shen & Ruohan Zhang & Jintai Yu & Barbara J. Sahakian & Wei Cheng & Jianfeng Feng, 2025. "Plasma proteomic signatures of social isolation and loneliness associated with morbidity and mortality," Nature Human Behaviour, Nature, vol. 9(3), pages 569-583, March.
    3. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    4. Baihan Wang & Alfred Pozarickij & Mohsen Mazidi & Neil Wright & Pang Yao & Saredo Said & Andri Iona & Christiana Kartsonaki & Hannah Fry & Kuang Lin & Yiping Chen & Huaidong Du & Daniel Avery & Dan Sc, 2025. "Comparative studies of 2168 plasma proteins measured by two affinity-based platforms in 4000 Chinese adults," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    5. Zhening Liu & Hangkai Huang & Jiarong Xie & Yingying Xu & Chengfu Xu, 2024. "Circulating fatty acids and risk of hepatocellular carcinoma and chronic liver disease mortality in the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    6. Yash Patel & Jean Shin & Eeva Sliz & Ariana Tang & Aniket Mishra & Rui Xia & Edith Hofer & Hema Sekhar Reddy Rajula & Ruiqi Wang & Frauke Beyer & Katrin Horn & Max Riedl & Jing Yu & Henry Völzke & Rob, 2024. "Genetic risk factors underlying white matter hyperintensities and cortical atrophy," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    7. Shiyu Zhang & Zheng Wang & Yijing Wang & Yixiao Zhu & Qiao Zhou & Xingxing Jian & Guihu Zhao & Jian Qiu & Kun Xia & Beisha Tang & Julian Mutz & Jinchen Li & Bin Li, 2024. "A metabolomic profile of biological aging in 250,341 individuals from the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    8. Jing Sun & Yue Liu & Jianhui Zhao & Bin Lu & Siyun Zhou & Wei Lu & Jingsun Wei & Yeting Hu & Xiangxing Kong & Junshun Gao & Hong Guan & Junli Gao & Qian Xiao & Xue Li, 2024. "Plasma proteomic and polygenic profiling improve risk stratification and personalized screening for colorectal cancer," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    9. Naman S. Shetty & Mokshad Gaonkar & Nirav Patel & Akhil Pampana & Nehal Vekariya & Peng Li & Garima Arora & Pankaj Arora, 2024. "Determinants of transthyretin levels and their association with adverse clinical outcomes among UK Biobank participants," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
    10. Mine Koprulu & Eleanor Wheeler & Nicola D. Kerrison & Spiros Denaxas & Julia Carrasco-Zanini & Chloe M. Orkin & Harry Hemingway & Nicholas J. Wareham & Maik Pietzner & Claudia Langenberg, 2025. "Sex differences in the genetic regulation of the human plasma proteome," Nature Communications, Nature, vol. 16(1), pages 1-10, December.
    11. Yuji Yamamoto & Yuya Shirai & Kyuto Sonehara & Shinichi Namba & Takafumi Ojima & Kenichi Yamamoto & Ryuya Edahiro & Ken Suzuki & Akinori Kanai & Yoshiya Oda & Yutaka Suzuki & Takayuki Morisaki & Akira, 2025. "Dissecting cross-population polygenic heterogeneity across respiratory and cardiometabolic diseases," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    12. Abolfazl Doostparast Torshizi & Dongnhu T. Truong & Liping Hou & Bart Smets & Christopher D. Whelan & Shuwei Li, 2024. "Proteogenomic network analysis reveals dysregulated mechanisms and potential mediators in Parkinson’s disease," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    13. Yijun Li & Zhaozhong Zhu & Carlos A. Camargo & Janice A. Espinola & Kohei Hasegawa & Liming Liang, 2025. "Epigenomic and proteomic analyses provide insights into early-life immune regulation and asthma development in infants," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    14. Hongru Li & Jingyi Zhao & Jinglan Dai & Dongfang You & Yang Zhao & David C. Christiani & Feng Chen & Sipeng Shen, 2025. "Multi-ancestry sequencing-based genome-wide association study of C-reactive protein in 513,273 genomes," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    15. Jia You & Yu Guo & Yi Zhang & Ju-Jiao Kang & Lin-Bo Wang & Jian-Feng Feng & Wei Cheng & Jin-Tai Yu, 2023. "Plasma proteomic profiles predict individual future health risk," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    16. Javier Botey-Bataller & Hedwig D. Vrijmoeth & Jeanine Ursinus & Bart-Jan Kullberg & Cees C. Wijngaard & Hadewych Hofstede & Ahmed Alaswad & Manoj K. Gupta & Lennart M. Roesner & Jochen Huehn & Thomas , 2024. "A comprehensive genetic map of cytokine responses in Lyme borreliosis," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    17. Peng Ren & Xiao-He Hou & Zeyu Li & Jia You & Yuzhu Li & Wei Zhang & Weikang Gong & Bei Zhang & Bangsheng Wu & Linbo Wang & Chun Shen & Yujie Zhao & Qing Ma & Jujiao Kang & Yuchao Jiang & Neil Roberts , 2025. "Atlas of proteomic signatures of brain structure and its links to brain disorders," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    18. Anders Mälarstig & Felix Grassmann & Leo Dahl & Marios Dimitriou & Dianna McLeod & Marike Gabrielson & Karl Smith-Byrne & Cecilia E. Thomas & Tzu-Hsuan Huang & Simon K. G. Forsberg & Per Eriksson & Mi, 2023. "Evaluation of circulating plasma proteins in breast cancer using Mendelian randomisation," Nature Communications, Nature, vol. 14(1), pages 1-9, December.
    19. Wenbo Jiang & Hang Wang & Yiding Geng & Meijuan Guo & Yingdong Zuo & Sijia Han & Zijie Liu & Shuaijun Chen & Shuzhen Fan & Shangying Li & Conghui Qiao & Qianzhu Li & Bai Li & Yunpeng Zhang & Wei Wei &, 2025. "The health impacts and genetic architecture of food liking in cardio-metabolic diseases," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    20. Dennis Meer & Zillur Rahman & Aigar Ottas & Pravesh Parekh & Gleda Kutrolli & Sara E. Stinson & Maria Koromina & Jaroslav Rokicki & Ida E. Sønderby & Nadine Parker & Markos Tesfaye & Guy Hindley & Lin, 2025. "Pleiotropic and sex-specific genetic mechanisms of circulating metabolic markers," Nature Communications, Nature, vol. 16(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46663-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.