IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008197.html
   My bibliography  Save this article

Integrating structure-based machine learning and co-evolution to investigate specificity in plant sesquiterpene synthases

Author

Listed:
  • Janani Durairaj
  • Elena Melillo
  • Harro J Bouwmeester
  • Jules Beekwilder
  • Dick de Ridder
  • Aalt D J van Dijk

Abstract

Sesquiterpene synthases (STSs) catalyze the formation of a large class of plant volatiles called sesquiterpenes. While thousands of putative STS sequences from diverse plant species are available, only a small number of them have been functionally characterized. Sequence identity-based screening for desired enzymes, often used in biotechnological applications, is difficult to apply here as STS sequence similarity is strongly affected by species. This calls for more sophisticated computational methods for functionality prediction. We investigate the specificity of precursor cation formation in these elusive enzymes. By inspecting multi-product STSs, we demonstrate that STSs have a strong selectivity towards one precursor cation. We use a machine learning approach combining sequence and structure information to accurately predict precursor cation specificity for STSs across all plant species. We combine this with a co-evolutionary analysis on the wealth of uncharacterized putative STS sequences, to pinpoint residues and distant functional contacts influencing cation formation and reaction pathway selection. These structural factors can be used to predict and engineer enzymes with specific functions, as we demonstrate by predicting and characterizing two novel STSs from Citrus bergamia.Author summary: Predicting enzyme function is a popular problem in the bioinformatics field that grows more pressing with the increase in protein sequences, and more attainable with the increase in experimentally characterized enzymes. Terpenes and terpenoids form the largest classes of natural products and find use in many drugs, flavouring agents, and perfumes. Terpene synthases catalyze the biosynthesis of terpenes via multiple cyclizations and carbocation rearrangements, generating a vast array of product skeletons. In this work, we present a three-pronged computational approach to predict carbocation specificity in sesquiterpene synthases, a subset of terpene synthases with one of the highest diversities of products. Using homology modelling, machine learning and co-evolutionary analysis, our approach combines sparse structural data, large amounts of uncharacterized sequence data, and the current set of experimentally characterized enzymes to provide insight into residues and structural regions that likely play a role in determining product specificity. Similar techniques can be re-purposed for function prediction and enzyme engineering in many other classes of enzymes.

Suggested Citation

  • Janani Durairaj & Elena Melillo & Harro J Bouwmeester & Jules Beekwilder & Dick de Ridder & Aalt D J van Dijk, 2021. "Integrating structure-based machine learning and co-evolution to investigate specificity in plant sesquiterpene synthases," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-21, March.
  • Handle: RePEc:plo:pcbi00:1008197
    DOI: 10.1371/journal.pcbi.1008197
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008197
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008197&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008197?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Melissa Salmon & Caroline Laurendon & Maria Vardakou & Jitender Cheema & Marianne Defernez & Sol Green & Juan A. Faraldos & Paul E. O’Maille, 2015. "Emergence of terpene cyclization in Artemisia annua," Nature Communications, Nature, vol. 6(1), pages 1-10, May.
    2. Yasuo Yoshikuni & Thomas E. Ferrin & Jay D. Keasling, 2006. "Designed divergent evolution of enzyme function," Nature, Nature, vol. 440(7087), pages 1078-1082, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Codruta Ignea & Morten H. Raadam & Aikaterini Koutsaviti & Yong Zhao & Yao-Tao Duan & Maria Harizani & Karel Miettinen & Panagiota Georgantea & Mads Rosenfeldt & Sara E. Viejo-Ledesma & Mikael A. Pete, 2022. "Expanding the terpene biosynthetic code with non-canonical 16 carbon atom building blocks," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    2. Boxue Tian & C Dale Poulter & Matthew P Jacobson, 2016. "Defining the Product Chemical Space of Monoterpenoid Synthases," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-13, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008197. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.