IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v602y2022i7897d10.1038_s41586-021-04383-5.html
   My bibliography  Save this article

A backbone-centred energy function of neural networks for protein design

Author

Listed:
  • Bin Huang

    (University of Science and Technology of China)

  • Yang Xu

    (University of Science and Technology of China)

  • Xiuhong Hu

    (University of Science and Technology of China)

  • Yongrui Liu

    (University of Science and Technology of China)

  • Shanhui Liao

    (University of Science and Technology of China)

  • Jiahai Zhang

    (University of Science and Technology of China)

  • Chengdong Huang

    (University of Science and Technology of China
    University of Science and Technology of China)

  • Jingjun Hong

    (University of Science and Technology of China)

  • Quan Chen

    (University of Science and Technology of China
    University of Science and Technology of China)

  • Haiyan Liu

    (University of Science and Technology of China
    University of Science and Technology of China
    University of Science and Technology of China)

Abstract

A protein backbone structure is designable if a substantial number of amino acid sequences exist that autonomously fold into it1,2. It has been suggested that the designability of backbones is governed mainly by side chain-independent or side chain type-insensitive molecular interactions3–5, indicating an approach for designing new backbones (ready for amino acid selection) based on continuous sampling and optimization of the backbone-centred energy surface. However, a sufficiently comprehensive and precise energy function has yet to be established for this purpose. Here we show that this goal is met by a statistical model named SCUBA (for Side Chain-Unknown Backbone Arrangement) that uses neural network-form energy terms. These terms are learned with a two-step approach that comprises kernel density estimation followed by neural network training and can analytically represent multidimensional, high-order correlations in known protein structures. We report the crystal structures of nine de novo proteins whose backbones were designed to high precision using SCUBA, four of which have novel, non-natural overall architectures. By eschewing use of fragments from existing protein structures, SCUBA-driven structure design facilitates far-reaching exploration of the designable backbone space, thus extending the novelty and diversity of the proteins amenable to de novo design.

Suggested Citation

  • Bin Huang & Yang Xu & Xiuhong Hu & Yongrui Liu & Shanhui Liao & Jiahai Zhang & Chengdong Huang & Jingjun Hong & Quan Chen & Haiyan Liu, 2022. "A backbone-centred energy function of neural networks for protein design," Nature, Nature, vol. 602(7897), pages 523-528, February.
  • Handle: RePEc:nat:nature:v:602:y:2022:i:7897:d:10.1038_s41586-021-04383-5
    DOI: 10.1038/s41586-021-04383-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-021-04383-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-021-04383-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shuangjia Zheng & Tao Zeng & Chengtao Li & Binghong Chen & Connor W. Coley & Yuedong Yang & Ruibo Wu, 2022. "Deep learning driven biosynthetic pathways navigation for natural products with BioNavi-NP," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    2. Peicong Lin & Yumeng Yan & Huanyu Tao & Sheng-You Huang, 2023. "Deep transfer learning for inter-chain contact predictions of transmembrane protein complexes," Nature Communications, Nature, vol. 14(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:602:y:2022:i:7897:d:10.1038_s41586-021-04383-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.