IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-46203-0.html
   My bibliography  Save this article

Machine learning-aided design and screening of an emergent protein function in synthetic cells

Author

Listed:
  • Shunshi Kohyama

    (Max Planck Institute of Biochemistry)

  • Béla P. Frohn

    (Max Planck Institute of Biochemistry)

  • Leon Babl

    (Max Planck Institute of Biochemistry)

  • Petra Schwille

    (Max Planck Institute of Biochemistry)

Abstract

Recently, utilization of Machine Learning (ML) has led to astonishing progress in computational protein design, bringing into reach the targeted engineering of proteins for industrial and biomedical applications. However, the design of proteins for emergent functions of core relevance to cells, such as the ability to spatiotemporally self-organize and thereby structure the cellular space, is still extremely challenging. While on the generative side conditional generative models and multi-state design are on the rise, for emergent functions there is a lack of tailored screening methods as typically needed in a protein design project, both computational and experimental. Here we describe a proof-of-principle of how such screening, in silico and in vitro, can be achieved for ML-generated variants of a protein that forms intracellular spatiotemporal patterns. For computational screening we use a structure-based divide-and-conquer approach to find the most promising candidates, while for the subsequent in vitro screening we use synthetic cell-mimics as established by Bottom-Up Synthetic Biology. We then show that the best screened candidate can indeed completely substitute the wildtype gene in Escherichia coli. These results raise great hopes for the next level of synthetic biology, where ML-designed synthetic proteins will be used to engineer cellular functions.

Suggested Citation

  • Shunshi Kohyama & Béla P. Frohn & Leon Babl & Petra Schwille, 2024. "Machine learning-aided design and screening of an emergent protein function in synthetic cells," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46203-0
    DOI: 10.1038/s41467-024-46203-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-46203-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-46203-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Elisa Godino & Jonás Noguera López & David Foschepoth & Céline Cleij & Anne Doerr & Clara Ferrer Castellà & Christophe Danelon, 2019. "De novo synthesized Min proteins drive oscillatory liposome deformation and regulate FtsA-FtsZ cytoskeletal patterns," Nature Communications, Nature, vol. 10(1), pages 1-12, December.
    2. Shunshi Kohyama & Adrián Merino-Salomón & Petra Schwille, 2022. "In vitro assembly, positioning and contraction of a division ring in minimal cells," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. Alex Hawkins-Hooker & Florence Depardieu & Sebastien Baur & Guillaume Couairon & Arthur Chen & David Bikard, 2021. "Generating functional protein variants with variational autoencoders," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-23, February.
    4. Michael Socolich & Steve W. Lockless & William P. Russ & Heather Lee & Kevin H. Gardner & Rama Ranganathan, 2005. "Evolutionary information for specifying a protein fold," Nature, Nature, vol. 437(7058), pages 512-518, September.
    5. Vladimir Gligorijević & P. Douglas Renfrew & Tomasz Kosciolek & Julia Koehler Leman & Daniel Berenberg & Tommi Vatanen & Chris Chandler & Bryn C. Taylor & Ian M. Fisk & Hera Vlamakis & Ramnik J. Xavie, 2021. "Structure-based protein function prediction using graph convolutional networks," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    6. Po-Ssu Huang & Scott E. Boyken & David Baker, 2016. "The coming of age of de novo protein design," Nature, Nature, vol. 537(7620), pages 320-327, September.
    7. Joseph L. Watson & David Juergens & Nathaniel R. Bennett & Brian L. Trippe & Jason Yim & Helen E. Eisenach & Woody Ahern & Andrew J. Borst & Robert J. Ragotte & Lukas F. Milles & Basile I. M. Wicky & , 2023. "De novo design of protein structure and function with RFdiffusion," Nature, Nature, vol. 620(7976), pages 1089-1100, August.
    8. Hongyuan Lu & Daniel J. Diaz & Natalie J. Czarnecki & Congzhi Zhu & Wantae Kim & Raghav Shroff & Daniel J. Acosta & Bradley R. Alexander & Hannah O. Cole & Yan Zhang & Nathaniel A. Lynd & Andrew D. El, 2022. "Machine learning-aided engineering of hydrolases for PET depolymerization," Nature, Nature, vol. 604(7907), pages 662-667, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noelia Ferruz & Steffen Schmidt & Birte Höcker, 2022. "ProtGPT2 is a deep unsupervised language model for protein design," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    2. Amir Pandi & David Adam & Amir Zare & Van Tuan Trinh & Stefan L. Schaefer & Marie Burt & Björn Klabunde & Elizaveta Bobkova & Manish Kushwaha & Yeganeh Foroughijabbari & Peter Braun & Christoph Spahn , 2023. "Cell-free biosynthesis combined with deep learning accelerates de novo-development of antimicrobial peptides," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    3. Simon d’Oelsnitz & Daniel J. Diaz & Wantae Kim & Daniel J. Acosta & Tyler L. Dangerfield & Mason W. Schechter & Matthew B. Minus & James R. Howard & Hannah Do & James M. Loy & Hal S. Alper & Y. Jessie, 2024. "Biosensor and machine learning-aided engineering of an amaryllidaceae enzyme," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    4. Francisco McGee & Sandro Hauri & Quentin Novinger & Slobodan Vucetic & Ronald M. Levy & Vincenzo Carnevale & Allan Haldane, 2021. "The generative capacity of probabilistic protein sequence models," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    5. Yasser Roudi & Sheila Nirenberg & Peter E Latham, 2009. "Pairwise Maximum Entropy Models for Studying Large Biological Systems: When They Can Work and When They Can't," PLOS Computational Biology, Public Library of Science, vol. 5(5), pages 1-18, May.
    6. Simeon D. Castle & Michiel Stock & Thomas E. Gorochowski, 2024. "Engineering is evolution: a perspective on design processes to engineer biology," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    7. Xinlei Wei & Xue Yang & Congcong Hu & Qiangzi Li & Qianqian Liu & Yue Wu & Leipeng Xie & Xiao Ning & Fei Li & Tao Cai & Zhiguang Zhu & Yi-Heng P. Job Zhang & Yanfei Zhang & Xuejun Chen & Chun You, 2024. "ATP-free in vitro biotransformation of starch-derived maltodextrin into poly-3-hydroxybutyrate via acetyl-CoA," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    8. Agnese I. Curatolo & Ofer Kimchi & Carl P. Goodrich & Ryan K. Krueger & Michael P. Brenner, 2023. "A computational toolbox for the assembly yield of complex and heterogeneous structures," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    9. Biao Ruan & Yanan He & Yingwei Chen & Eun Jung Choi & Yihong Chen & Dana Motabar & Tsega Solomon & Richard Simmerman & Thomas Kauffman & D. Travis Gallagher & John Orban & Philip N. Bryan, 2023. "Design and characterization of a protein fold switching network," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    10. Teng Bao & Yuanchao Qian & Yongping Xin & James J. Collins & Ting Lu, 2023. "Engineering microbial division of labor for plastic upcycling," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    11. Erik van Nimwegen, 2016. "Inferring Contacting Residues within and between Proteins: What Do the Probabilities Mean?," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-10, May.
    12. Hwaseok Hong & Dongwoo Ki & Hogyun Seo & Jiyoung Park & Jaewon Jang & Kyung-Jin Kim, 2023. "Discovery and rational engineering of PET hydrolase with both mesophilic and thermophilic PET hydrolase properties," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Zhuozhi Chen & Rongdi Duan & Yunjie Xiao & Yi Wei & Hanxiao Zhang & Xinzhao Sun & Shen Wang & Yingying Cheng & Xue Wang & Shanwei Tong & Yunxiao Yao & Cheng Zhu & Haitao Yang & Yanyan Wang & Zefang Wa, 2022. "Biodegradation of highly crystallized poly(ethylene terephthalate) through cell surface codisplay of bacterial PETase and hydrophobin," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    14. Anton Kocheturov & Panos M. Pardalos & Athanasia Karakitsiou, 2019. "Massive datasets and machine learning for computational biomedicine: trends and challenges," Annals of Operations Research, Springer, vol. 276(1), pages 5-34, May.
    15. Jennifer L Lahti & Adam P Silverman & Jennifer R Cochran, 2009. "Interrogating and Predicting Tolerated Sequence Diversity in Protein Folds: Application to E. elaterium Trypsin Inhibitor-II Cystine-Knot Miniprotein," PLOS Computational Biology, Public Library of Science, vol. 5(9), pages 1-15, September.
    16. Katarzyna Świderek & Susana Velasco-Lozano & Miquel À. Galmés & Ion Olazabal & Haritz Sardon & Fernando López-Gallego & Vicent Moliner, 2023. "Mechanistic studies of a lipase unveil effect of pH on hydrolysis products of small PET modules," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    17. Anni Li & Yijie Sheng & Haiyang Cui & Minghui Wang & Luxuan Wu & Yibo Song & Rongrong Yang & Xiujuan Li & He Huang, 2023. "Discovery and mechanism-guided engineering of BHET hydrolases for improved PET recycling and upcycling," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    18. Samuel Miravet-Verde & Rocco Mazzolini & Carolina Segura-Morales & Alicia Broto & Maria Lluch-Senar & Luis Serrano, 2024. "ProTInSeq: transposon insertion tracking by ultra-deep DNA sequencing to identify translated large and small ORFs," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    19. Fatima A. Davila-Hernandez & Biao Jin & Harley Pyles & Shuai Zhang & Zheming Wang & Timothy F. Huddy & Asim K. Bera & Alex Kang & Chun-Long Chen & James J. Yoreo & David Baker, 2023. "Directing polymorph specific calcium carbonate formation with de novo protein templates," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    20. Smrithi Krishnan R & Kalyanashis Jana & Amina H. Shaji & Karthika S. Nair & Anjali Devi Das & Devika Vikraman & Harsha Bajaj & Ulrich Kleinekathöfer & Kozhinjampara R. Mahendran, 2022. "Assembly of transmembrane pores from mirror-image peptides," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46203-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.