IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003298.html
   My bibliography  Save this article

An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis

Author

Listed:
  • Pralay Mitra
  • David Shultis
  • Jeffrey R Brender
  • Jeff Czajka
  • David Marsh
  • Felicia Gray
  • Tomasz Cierpicki
  • Yang Zhang

Abstract

Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality.Author Summary: The goal of computational protein design is to create new protein sequences of desirable structure and biological function. Most protein design methods are developed to search for sequences with the lowest free-energy based on physics-based force fields following Anfinsen's thermodynamic hypothesis. A major obstacle of such approaches is the inaccuracy of the force-field design, which cannot accurately describe atomic interactions or correctly recognize protein folds. We propose a novel method which uses evolutionary information, in the form of sequence profiles from structure families, to guide the sequence design. Since sequence profiles are generally more accurate than physics-based potentials in protein fold recognition, a unique advantage lies on that it targets the design procedure to a family of protein sequence profiles to enhance the robustness of designed sequences. The method was tested on 87 proteins and the designed sequences can be folded by I-TASSER to models with an average RMSD 2.1 Å. As a case study of large-scale application, the method is extended to redesign all structurally resolved proteins in the human pathogenic bacteria, Mycobacterium tuberculosis. Five sequences varying in fold and sizes were characterized by circular dichroism and NMR spectroscopy experiments and three were shown to have ordered tertiary structure.

Suggested Citation

  • Pralay Mitra & David Shultis & Jeffrey R Brender & Jeff Czajka & David Marsh & Felicia Gray & Tomasz Cierpicki & Yang Zhang, 2013. "An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis," PLOS Computational Biology, Public Library of Science, vol. 9(10), pages 1-18, October.
  • Handle: RePEc:plo:pcbi00:1003298
    DOI: 10.1371/journal.pcbi.1003298
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003298
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003298&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003298?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nobuyasu Koga & Rie Tatsumi-Koga & Gaohua Liu & Rong Xiao & Thomas B. Acton & Gaetano T. Montelione & David Baker, 2012. "Principles for designing ideal protein structures," Nature, Nature, vol. 491(7423), pages 222-227, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jorge Roel-Touris & Marta Nadal & Enrique Marcos, 2023. "Single-chain dimers from de novo immunoglobulins as robust scaffolds for multiple binding loops," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    2. Anindya Roy & Lei Shi & Ashley Chang & Xianchi Dong & Andres Fernandez & John C. Kraft & Jing Li & Viet Q. Le & Rebecca Viazzo Winegar & Gerald Maxwell Cherf & Dean Slocum & P. Daniel Poulson & Garret, 2023. "De novo design of highly selective miniprotein inhibitors of integrins αvβ6 and αvβ8," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    3. Hiroto Murata & Hayao Imakawa & Nobuyasu Koga & George Chikenji, 2021. "The register shift rules for βαβ-motifs for de novo protein design," PLOS ONE, Public Library of Science, vol. 16(8), pages 1-24, August.
    4. Jaume Bonet & Sarah Wehrle & Karen Schriever & Che Yang & Anne Billet & Fabian Sesterhenn & Andreas Scheck & Freyr Sverrisson & Barbora Veselkova & Sabrina Vollers & Roxanne Lourman & Mélanie Villard , 2018. "Rosetta FunFolDes – A general framework for the computational design of functional proteins," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-30, November.
    5. Sagar D Khare & Timothy A Whitehead, 2015. "Introduction to the Rosetta Special Collection," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-5, December.
    6. Willow Coyote-Maestas & David Nedrud & Antonio Suma & Yungui He & Kenneth A. Matreyek & Douglas M. Fowler & Vincenzo Carnevale & Chad L. Myers & Daniel Schmidt, 2021. "Probing ion channel functional architecture and domain recombination compatibility by massively parallel domain insertion profiling," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    7. Thomas W. Linsky & Kyle Noble & Autumn R. Tobin & Rachel Crow & Lauren Carter & Jeffrey L. Urbauer & David Baker & Eva-Maria Strauch, 2022. "Sampling of structure and sequence space of small protein folds," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    8. Marc Corrales & Pol Cuscó & Dinara R Usmanova & Heng-Chang Chen & Natalya S Bogatyreva & Guillaume J Filion & Dmitry N Ivankov, 2015. "Machine Learning: How Much Does It Tell about Protein Folding Rates?," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-12, November.
    9. Zsolt Fazekas & Dóra K. Menyhárd & András Perczel, 2024. "LoCoHD: a metric for comparing local environments of proteins," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    10. Kozyrev, S.V. & Volovich, I.V., 2014. "Quinary lattice model of secondary structures of polymers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 393(C), pages 86-95.
    11. Lindsey A. Doyle & Brittany Takushi & Ryan D. Kibler & Lukas F. Milles & Carolina T. Orozco & Jonathan D. Jones & Sophie E. Jackson & Barry L. Stoddard & Philip Bradley, 2023. "De novo design of knotted tandem repeat proteins," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Tamuka M. Chidyausiku & Soraia R. Mendes & Jason C. Klima & Marta Nadal & Ulrich Eckhard & Jorge Roel-Touris & Scott Houliston & Tibisay Guevara & Hugh K. Haddox & Adam Moyer & Cheryl H. Arrowsmith & , 2022. "De novo design of immunoglobulin-like domains," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    13. Fabian Sesterhenn & Che Yang & Jaume Bonet & Johannes T. Cramer & Xiaolin Wen & Yimeng Wang & Chi I. Chiang & Luciano Andres Abriata & Iga Kucharska & Giacomo Castoro & Sabrina S. Vollers & Marie Gall, 2020. "De novo protein design enables the precise induction of RSV-neutralizing antibodies," Post-Print hal-02677103, HAL.
    14. Rebecca F Alford & Andrew Leaver-Fay & Lynda Gonzales & Erin L Dolan & Jeffrey J Gray, 2017. "A cyber-linked undergraduate research experience in computational biomolecular structure prediction and design," PLOS Computational Biology, Public Library of Science, vol. 13(12), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003298. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.