IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-62318-4.html
   My bibliography  Save this article

PreMode predicts mode-of-action of missense variants by deep graph representation learning of protein sequence and structural context

Author

Listed:
  • Guojie Zhong

    (Columbia University Irving Medical Center
    Columbia University)

  • Yige Zhao

    (Columbia University Irving Medical Center
    Columbia University)

  • Demi Zhuang

    (Columbia University Irving Medical Center)

  • Wendy K. Chung

    (Boston Children’s Hospital and Harvard Medical School)

  • Yufeng Shen

    (Columbia University Irving Medical Center
    Columbia University Irving Medical Center
    Columbia University)

Abstract

Accurate prediction of the functional impact of missense variants is important for disease gene discovery, clinical genetic diagnostics, therapeutic strategies, and protein engineering. Previous efforts have focused on predicting a binary pathogenicity classification, but the functional impact of missense variants is multi-dimensional. Pathogenic missense variants in the same gene may act through different modes of action (i.e., gain/loss-of-function) by affecting different aspects of protein function. They may result in distinct clinical conditions that require different treatments. We develop a new method, PreMode, to perform gene-specific mode-of-action predictions. PreMode models effects of coding sequence variants using SE(3)-equivariant graph neural networks on protein sequences and structures. Using the largest-to-date set of missense variants with known modes of action, we show that PreMode reaches state-of-the-art performance in multiple types of mode-of-action predictions by efficient transfer-learning. Additionally, PreMode’s prediction of G/LoF variants in a kinase is consistent with inactive-active conformation transition energy changes. Finally, we show that PreMode enables efficient study design of deep mutational scans and can be expanded to fitness optimization of non-human proteins with active learning.

Suggested Citation

  • Guojie Zhong & Yige Zhao & Demi Zhuang & Wendy K. Chung & Yufeng Shen, 2025. "PreMode predicts mode-of-action of missense variants by deep graph representation learning of protein sequence and structural context," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-62318-4
    DOI: 10.1038/s41467-025-62318-4
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-62318-4
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-62318-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lukas Gerasimavicius & Benjamin J. Livesey & Joseph A. Marsh, 2022. "Loss-of-function, gain-of-function and dominant-negative mutations have profoundly different effects on protein structure," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    2. Jonathan Frazer & Pascal Notin & Mafalda Dias & Aidan Gomez & Joseph K. Min & Kelly Brock & Yarin Gal & Debora S. Marks, 2021. "Disease variant prediction with deep generative models of evolutionary data," Nature, Nature, vol. 599(7883), pages 91-95, November.
    3. Timothy H. Tran & Albert H. Chan & Lucy C. Young & Lakshman Bindu & Chris Neale & Simon Messing & Srisathiyanarayanan Dharmaiah & Troy Taylor & John-Paul Denson & Dominic Esposito & Dwight V. Nissley , 2021. "KRAS interaction with RAF1 RAS-binding domain and cysteine-rich domain provides insights into RAS-mediated RAF activation," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    4. Juliana A. Martinez Fiesco & David E. Durrant & Deborah K. Morrison & Ping Zhang, 2022. "Structural insights into the BRAF monomer-to-dimer transition mediated by RAS binding," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    5. John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kian Hong Kock & Patrick K. Kimes & Stephen S. Gisselbrecht & Sachi Inukai & Sabrina K. Phanor & James T. Anderson & Gayatri Ramakrishnan & Colin H. Lipper & Dongyuan Song & Jesse V. Kurland & Julia M, 2024. "DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    2. Eunyoung Park & Shaun Rawson & Anna Schmoker & Byeong-Won Kim & Sehee Oh & Kangkang Song & Hyesung Jeon & Michael J. Eck, 2023. "Cryo-EM structure of a RAS/RAF recruitment complex," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    3. Daniel J. Diaz & Chengyue Gong & Jeffrey Ouyang-Zhang & James M. Loy & Jordan Wells & David Yang & Andrew D. Ellington & Alexandros G. Dimakis & Adam R. Klivans, 2024. "Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. Yige Zhao & Tian Lan & Guojie Zhong & Jake Hagen & Hongbing Pan & Wendy K. Chung & Yufeng Shen, 2025. "A probabilistic graphical model for estimating selection coefficients of nonsynonymous variants from human population sequence data," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    5. Dong Man Jang & Kayla Boxer & Byung Hak Ha & Emre Tkacik & Talya Levitz & Shaun Rawson & Rebecca J. Metivier & Anna Schmoker & Hyesung Jeon & Michael J. Eck, 2025. "Cryo-EM structures of CRAF/MEK1/14-3-3 complexes in autoinhibited and open-monomer states reveal features of RAF regulation," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    6. Mihaly Badonyi & Joseph A. Marsh, 2025. "Prevalence of loss-of-function, gain-of-function and dominant-negative mechanisms across genetic disease phenotypes," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    7. Guoling Li & Xue Dong & Jiamin Luo & Tanglong Yuan & Tong Li & Guoli Zhao & Hainan Zhang & Jingxing Zhou & Zhenhai Zeng & Shuna Cui & Haoqiang Wang & Yin Wang & Yuyang Yu & Yuan Yuan & Erwei Zuo & Chu, 2024. "Engineering TadA ortholog-derived cytosine base editor without motif preference and adenosine activity limitation," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    8. Ziyi Zhou & Liang Zhang & Yuanxi Yu & Banghao Wu & Mingchen Li & Liang Hong & Pan Tan, 2024. "Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    9. Yinglu Cui & Yanchun Chen & Jinyuan Sun & Tong Zhu & Hua Pang & Chunli Li & Wen-Chao Geng & Bian Wu, 2024. "Computational redesign of a hydrolase for nearly complete PET depolymerization at industrially relevant high-solids loading," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    10. Mofan Feng & Xiaoxi Wei & Xi Zheng & Liangjie Liu & Lin Lin & Manying Xia & Guang He & Yi Shi & Qing Lu, 2024. "Decoding Missense Variants by Incorporating Phase Separation via Machine Learning," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    11. Lasse M. Blaabjerg & Nicolas Jonsson & Wouter Boomsma & Amelie Stein & Kresten Lindorff-Larsen, 2024. "SSEmb: A joint embedding of protein sequence and structure enables robust variant effect predictions," Nature Communications, Nature, vol. 15(1), pages 1-9, December.
    12. Yaan J. Jang & Qi-Qi Qin & Si-Yu Huang & Arun T. John Peter & Xue-Ming Ding & Benoît Kornmann, 2024. "Accurate prediction of protein function using statistics-informed graph networks," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    13. Erik Abner & Kanwal Batool & Nele Taba & Tiit Nikopensius & Kristi Läll & Anastasiia Alekseienko & Anders Eriksson & Joel Rämö & Hele Haapaniemi & Hanna Maria Kariis & Liis Haljasmägi & Urmo Võsa & Ta, 2025. "Characterization of prevalent genetic variants in the Estonian Biobank body-mass index GWAS," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    14. Cheyenne Ziegler & Jonathan Martin & Claude Sinner & Faruck Morcos, 2023. "Latent generative landscapes as maps of functional diversity in protein sequence space," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    15. Brandon M. Murphy & Elizabeth M. Terrell & Venkat R. Chirasani & Tirzah J. Weiss & Rachel E. Lew & Andrea M. Holderbaum & Aastha Dhakal & Valentina Posada & Marie Fort & Michael S. Bodnar & Leiah M. C, 2022. "Enhanced BRAF engagement by NRAS mutants capable of promoting melanoma initiation," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Eli Fritz McDonald & Kathryn E Oliver & Jonathan P Schlebach & Jens Meiler & Lars Plate, 2024. "Benchmarking AlphaMissense pathogenicity predictions against cystic fibrosis variants," PLOS ONE, Public Library of Science, vol. 19(1), pages 1-20, January.
    17. Ye Yuan & Lei Chen & Kexu Song & Miaomiao Cheng & Ling Fang & Lingfei Kong & Lanlan Yu & Ruonan Wang & Zhendong Fu & Minmin Sun & Qian Wang & Chengjun Cui & Haojue Wang & Jiuyang He & Xiaonan Wang & Y, 2024. "Stable peptide-assembled nanozyme mimicking dual antifungal actions," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    18. Ivica Odorčić & Mohamed Belal Hamed & Sam Lismont & Lucía Chávez-Gutiérrez & Rouslan G. Efremov, 2024. "Apo and Aβ46-bound γ-secretase structures provide insights into amyloid-β processing by the APH-1B isoform," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    19. Léon Faure & Bastien Mollet & Wolfram Liebermeister & Jean-Loup Faulon, 2023. "A neural-mechanistic hybrid approach improving the predictive power of genome-scale metabolic models," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    20. Pantelis Livanos & Choy Kriechbaum & Sophia Remers & Arvid Herrmann & Sabine Müller, 2025. "Kinesin-12 POK2 polarization is a prerequisite for a fully functional division site and aids cell plate positioning," Nature Communications, Nature, vol. 16(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-62318-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.