IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-63412-3.html
   My bibliography  Save this article

LassoESM a tailored language model for enhanced lasso peptide property prediction

Author

Listed:
  • Xuenan Mi

    (University of Illinois Urbana-Champaign)

  • Susanna E. Barrett

    (University of Illinois Urbana-Champaign
    University of Illinois Urbana-Champaign)

  • Douglas A. Mitchell

    (Vanderbilt University School of Medicine
    Vanderbilt University)

  • Diwakar Shukla

    (University of Illinois Urbana-Champaign
    University of Illinois Urbana-Champaign
    University of Illinois Urbana-Champaign
    University of Illinois Urbana-Champaign)

Abstract

Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a diverse group of natural products. The lasso peptide class of RiPPs adopt a unique [1]rotaxane conformation formed by a lasso cyclase, conferring diverse bioactivities and remarkable stability. The prediction of lasso peptide properties, such as substrate compatibility with a particular lasso cyclase or desired biological activity, remains challenging due to limited experimental data and the complexity of substrate fitness landscapes. Here, we develop LassoESM, a tailored language model that improves lasso peptide property prediction. LassoESM embeddings enable accurate prediction of substrate compatibility, facilitate identification of novel non-cognate cyclase–substrate pairs, and enhance prediction of RNA polymerase inhibitory activity, a biological activity of several known lasso peptides. We anticipate that LassoESM and future iterations will be instrumental in the rational design and discovery of lasso peptides with tailored functions.

Suggested Citation

  • Xuenan Mi & Susanna E. Barrett & Douglas A. Mitchell & Diwakar Shukla, 2025. "LassoESM a tailored language model for enhanced lasso peptide property prediction," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63412-3
    DOI: 10.1038/s41467-025-63412-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-63412-3
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-63412-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wenwu Zeng & Yutao Dou & Liangrui Pan & Liwen Xu & Shaoliang Peng, 2024. "Improving prediction performance of general protein language model by domain-adaptive pretraining on DNA-binding protein," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    2. Yunan Luo & Guangde Jiang & Tianhao Yu & Yang Liu & Lam Vo & Hantian Ding & Yufeng Su & Wesley Wei Qian & Huimin Zhao & Jian Peng, 2021. "ECNet is an evolutionary context-integrated deep learning framework for protein engineering," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    3. Xingyu Ouyang & Xinchun Ran & Han Xu & Runeem Al-Abssi & Yi-Lei Zhao & A. James Link & Zhongyue J. Yang, 2025. "LassoPred: a tool to predict the 3D structure of lasso peptides," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yinghui Chen & Yunxin Xu & Di Liu & Yaoguang Xing & Haipeng Gong, 2024. "An end-to-end framework for the prediction of protein structure and fitness from single sequence," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    2. Nan Zheng & Yongchao Cai & Zehua Zhang & Huimin Zhou & Yu Deng & Shuang Du & Mai Tu & Wei Fang & Xiaole Xia, 2025. "Tailoring industrial enzymes for thermostability and activity evolution by the machine learning-based iCASE strategy," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    3. Ziyi Zhou & Liang Zhang & Yuanxi Yu & Banghao Wu & Mingchen Li & Liang Hong & Pan Tan, 2024. "Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    4. Kerr Ding & Jiaqi Luo & Yunan Luo, 2024. "Leveraging conformal prediction to annotate enzyme function space with limited false positives," PLOS Computational Biology, Public Library of Science, vol. 20(5), pages 1-21, May.
    5. Kerr Ding & Michael Chin & Yunlong Zhao & Wei Huang & Binh Khanh Mai & Huanan Wang & Peng Liu & Yang Yang & Yunan Luo, 2024. "Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    6. Zixuan Fan & Yan Xu, 2024. "Predicting the Functional Changes in Protein Mutations Through the Application of BiLSTM and the Self-Attention Mechanism," Annals of Data Science, Springer, vol. 11(3), pages 1077-1094, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63412-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.