IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1013268.html
   My bibliography  Save this article

EUP: Enhanced cross-species prediction of ubiquitination sites via a conditional variational autoencoder network based on ESM2

Author

Listed:
  • Junhao Liu
  • Zeyu Luo
  • Rui Wang
  • Xin Li
  • Yawen Sun
  • Zongqing Chen
  • Yu-Juan Zhang

Abstract

Ubiquitination is critical in biomedical research. Predicting ubiquitination sites based on deep learning model have advanced the study of ubiquitination. However, traditional supervised model limits in the scenarios where labels are scarcity across species. To address this issue, we introduce EUP, an online webserver for ubiquitination prediction and model interpretation for multi-species. EUP is constructed by extracting lysine site-dependent features from pretrained language model ESM2. Then, utilizing conditional variational inference to reduce the ESM2 features to a lower-dimensional latent representation. By constructing downstream models built on this latent feature representation, EUP exhibited superior performance in predicting ubiquitination sites across species, while maintaining low inference latency. Furthermore, key features for predicting ubiquitination sites were identified across animals, plants, and microbes. The identification of shared key features that capture evolutionarily conserved traits enhances the interpretability of the EUP model for ubiquitination prediction. EUP is free and available at (https://eup.aibtit.com/).Author summary: We present EUP, an AI-powered web-based tool designed to predict ubiquitination sites on protein sequences. Ubiquitination is a critical regulatory process in cells, where small ubiquitin molecules are attached to specific lysine residues to control protein stability and function. Identifying these modification sites is essential for understanding cellular mechanisms, but experimental detection is time-consuming and often unavailable for many non-model organisms. To address this challenge, EUP leverages a pretrained protein language model to extract informative features from sequences and applies variational inference to predict likely ubiquitination sites. The tool supports prediction across animals, plants, and microbes, and highlights both conserved and species-specific patterns. This provides users with interpretable insights into how ubiquitination may vary across evolution. EUP is freely available at (https://eup.aibtit.com) and offers a practical solution for researchers studying protein regulation across diverse biological systems.

Suggested Citation

  • Junhao Liu & Zeyu Luo & Rui Wang & Xin Li & Yawen Sun & Zongqing Chen & Yu-Juan Zhang, 2025. "EUP: Enhanced cross-species prediction of ubiquitination sites via a conditional variational autoencoder network based on ESM2," PLOS Computational Biology, Public Library of Science, vol. 21(7), pages 1-23, July.
  • Handle: RePEc:plo:pcbi00:1013268
    DOI: 10.1371/journal.pcbi.1013268
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013268
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1013268&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1013268?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013268. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.