IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-46203-0.html
   My bibliography  Save this article

Machine learning-aided design and screening of an emergent protein function in synthetic cells

Author

Listed:
  • Shunshi Kohyama

    (Max Planck Institute of Biochemistry)

  • Béla P. Frohn

    (Max Planck Institute of Biochemistry)

  • Leon Babl

    (Max Planck Institute of Biochemistry)

  • Petra Schwille

    (Max Planck Institute of Biochemistry)

Abstract

Recently, utilization of Machine Learning (ML) has led to astonishing progress in computational protein design, bringing into reach the targeted engineering of proteins for industrial and biomedical applications. However, the design of proteins for emergent functions of core relevance to cells, such as the ability to spatiotemporally self-organize and thereby structure the cellular space, is still extremely challenging. While on the generative side conditional generative models and multi-state design are on the rise, for emergent functions there is a lack of tailored screening methods as typically needed in a protein design project, both computational and experimental. Here we describe a proof-of-principle of how such screening, in silico and in vitro, can be achieved for ML-generated variants of a protein that forms intracellular spatiotemporal patterns. For computational screening we use a structure-based divide-and-conquer approach to find the most promising candidates, while for the subsequent in vitro screening we use synthetic cell-mimics as established by Bottom-Up Synthetic Biology. We then show that the best screened candidate can indeed completely substitute the wildtype gene in Escherichia coli. These results raise great hopes for the next level of synthetic biology, where ML-designed synthetic proteins will be used to engineer cellular functions.

Suggested Citation

  • Shunshi Kohyama & Béla P. Frohn & Leon Babl & Petra Schwille, 2024. "Machine learning-aided design and screening of an emergent protein function in synthetic cells," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46203-0
    DOI: 10.1038/s41467-024-46203-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-46203-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-46203-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Elisa Godino & Jonás Noguera López & David Foschepoth & Céline Cleij & Anne Doerr & Clara Ferrer Castellà & Christophe Danelon, 2019. "De novo synthesized Min proteins drive oscillatory liposome deformation and regulate FtsA-FtsZ cytoskeletal patterns," Nature Communications, Nature, vol. 10(1), pages 1-12, December.
    2. Shunshi Kohyama & Adrián Merino-Salomón & Petra Schwille, 2022. "In vitro assembly, positioning and contraction of a division ring in minimal cells," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. Alex Hawkins-Hooker & Florence Depardieu & Sebastien Baur & Guillaume Couairon & Arthur Chen & David Bikard, 2021. "Generating functional protein variants with variational autoencoders," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-23, February.
    4. Michael Socolich & Steve W. Lockless & William P. Russ & Heather Lee & Kevin H. Gardner & Rama Ranganathan, 2005. "Evolutionary information for specifying a protein fold," Nature, Nature, vol. 437(7058), pages 512-518, September.
    5. Vladimir Gligorijević & P. Douglas Renfrew & Tomasz Kosciolek & Julia Koehler Leman & Daniel Berenberg & Tommi Vatanen & Chris Chandler & Bryn C. Taylor & Ian M. Fisk & Hera Vlamakis & Ramnik J. Xavie, 2021. "Structure-based protein function prediction using graph convolutional networks," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    6. Po-Ssu Huang & Scott E. Boyken & David Baker, 2016. "The coming of age of de novo protein design," Nature, Nature, vol. 537(7620), pages 320-327, September.
    7. Joseph L. Watson & David Juergens & Nathaniel R. Bennett & Brian L. Trippe & Jason Yim & Helen E. Eisenach & Woody Ahern & Andrew J. Borst & Robert J. Ragotte & Lukas F. Milles & Basile I. M. Wicky & , 2023. "De novo design of protein structure and function with RFdiffusion," Nature, Nature, vol. 620(7976), pages 1089-1100, August.
    8. Hongyuan Lu & Daniel J. Diaz & Natalie J. Czarnecki & Congzhi Zhu & Wantae Kim & Raghav Shroff & Daniel J. Acosta & Bradley R. Alexander & Hannah O. Cole & Yan Zhang & Nathaniel A. Lynd & Andrew D. El, 2022. "Machine learning-aided engineering of hydrolases for PET depolymerization," Nature, Nature, vol. 604(7907), pages 662-667, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noelia Ferruz & Steffen Schmidt & Birte Höcker, 2022. "ProtGPT2 is a deep unsupervised language model for protein design," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    2. Simon d’Oelsnitz & Daniel J. Diaz & Wantae Kim & Daniel J. Acosta & Tyler L. Dangerfield & Mason W. Schechter & Matthew B. Minus & James R. Howard & Hannah Do & James M. Loy & Hal S. Alper & Y. Jessie, 2024. "Biosensor and machine learning-aided engineering of an amaryllidaceae enzyme," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    3. Francisco McGee & Sandro Hauri & Quentin Novinger & Slobodan Vucetic & Ronald M. Levy & Vincenzo Carnevale & Allan Haldane, 2021. "The generative capacity of probabilistic protein sequence models," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    4. Amir Pandi & David Adam & Amir Zare & Van Tuan Trinh & Stefan L. Schaefer & Marie Burt & Björn Klabunde & Elizaveta Bobkova & Manish Kushwaha & Yeganeh Foroughijabbari & Peter Braun & Christoph Spahn , 2023. "Cell-free biosynthesis combined with deep learning accelerates de novo-development of antimicrobial peptides," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Thomas W. Linsky & Kyle Noble & Autumn R. Tobin & Rachel Crow & Lauren Carter & Jeffrey L. Urbauer & David Baker & Eva-Maria Strauch, 2022. "Sampling of structure and sequence space of small protein folds," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    6. Shubin Li & Yingming Zhao & Shuqi Wu & Xiangxiang Zhang & Boyu Yang & Liangfei Tian & Xiaojun Han, 2023. "Regulation of species metabolism in synthetic community systems by environmental pH oscillations," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    7. Yasser Roudi & Sheila Nirenberg & Peter E Latham, 2009. "Pairwise Maximum Entropy Models for Studying Large Biological Systems: When They Can Work and When They Can't," PLOS Computational Biology, Public Library of Science, vol. 5(5), pages 1-18, May.
    8. Shou-Wen Wang & Anne-Florence Bitbol & Ned S Wingreen, 2019. "Revealing evolutionary constraints on proteins through sequence analysis," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-16, April.
    9. Ziqi Gao & Chenran Jiang & Jiawen Zhang & Xiaosen Jiang & Lanqing Li & Peilin Zhao & Huanming Yang & Yong Huang & Jia Li, 2023. "Hierarchical graph learning for protein–protein interaction," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    10. Xinlei Wei & Xue Yang & Congcong Hu & Qiangzi Li & Qianqian Liu & Yue Wu & Leipeng Xie & Xiao Ning & Fei Li & Tao Cai & Zhiguang Zhu & Yi-Heng P. Job Zhang & Yanfei Zhang & Xuejun Chen & Chun You, 2024. "ATP-free in vitro biotransformation of starch-derived maltodextrin into poly-3-hydroxybutyrate via acetyl-CoA," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    11. Sabrina Meindlhumer & Fridtjof Brauns & Jernej Rudi Finžgar & Jacob Kerssemakers & Cees Dekker & Erwin Frey, 2023. "Directing Min protein patterns with advective bulk flow," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    12. Youying Mu & Chengzhuo Duan & Xin Li & Yongbo Wu, 2023. "A Monitoring Method for Corporate Environmental Performance Based on Data Fusion in China under the Double Carbon Target," Sustainability, MDPI, vol. 15(12), pages 1-16, June.
    13. P. Konstantin Richter & Paula Blázquez-Sánchez & Ziyue Zhao & Felipe Engelberger & Christian Wiebeler & Georg Künze & Ronny Frank & Dana Krinke & Emanuele Frezzotti & Yuliia Lihanova & Patricia Falken, 2023. "Structure and function of the metagenomic plastic-degrading polyester hydrolase PHL7 bound to its product," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    14. Jordan Yang & Nandita Naik & Jagdish Suresh Patel & Christopher S Wylie & Wenze Gu & Jessie Huang & F Marty Ytreberg & Mandar T Naik & Daniel M Weinreich & Brenda M Rubenstein, 2020. "Predicting the viability of beta-lactamase: How folding and binding free energies correlate with beta-lactamase fitness," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-26, May.
    15. Agnese I. Curatolo & Ofer Kimchi & Carl P. Goodrich & Ryan K. Krueger & Michael P. Brenner, 2023. "A computational toolbox for the assembly yield of complex and heterogeneous structures," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    16. Wei Lu & Jixian Zhang & Weifeng Huang & Ziqiao Zhang & Xiangyu Jia & Zhenyu Wang & Leilei Shi & Chengtao Li & Peter G. Wolynes & Shuangjia Zheng, 2024. "DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    17. Biao Ruan & Yanan He & Yingwei Chen & Eun Jung Choi & Yihong Chen & Dana Motabar & Tsega Solomon & Richard Simmerman & Thomas Kauffman & D. Travis Gallagher & John Orban & Philip N. Bryan, 2023. "Design and characterization of a protein fold switching network," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    18. Yinglu Cui & Yanchun Chen & Jinyuan Sun & Tong Zhu & Hua Pang & Chunli Li & Wen-Chao Geng & Bian Wu, 2024. "Computational redesign of a hydrolase for nearly complete PET depolymerization at industrially relevant high-solids loading," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    19. Teng Bao & Yuanchao Qian & Yongping Xin & James J. Collins & Ting Lu, 2023. "Engineering microbial division of labor for plastic upcycling," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    20. Pengfei Tian & Robert B Best, 2020. "Exploring the sequence fitness landscape of a bridge between protein folds," PLOS Computational Biology, Public Library of Science, vol. 16(10), pages 1-19, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-46203-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.