Author
Listed:
- Olivier Mailhot
- Vincent Frappier
- François Major
- Rafael J Najmanovich
Abstract
The Elastic Network Contact Model (ENCoM) is a coarse-grained normal mode analysis (NMA) model unique in its all-atom sensitivity to the sequence of the studied macromolecule and thus to the effect of mutations. We adapted ENCoM to simulate the dynamics of ribonucleic acid (RNA) molecules, benchmarked its performance against other popular NMA models and used it to study the 3D structural dynamics of human microRNA miR-125a, leveraging high-throughput experimental maturation efficiency data of over 26 000 sequence variants. We also introduce a novel way of using dynamical information from NMA to train multivariate linear regression models, with the purpose of highlighting the most salient contributions of dynamics to function. ENCoM has a similar performance profile on RNA than on proteins when compared to the Anisotropic Network Model (ANM), the most widely used coarse-grained NMA model; it has the advantage on predicting large-scale motions while ANM performs better on B-factors prediction. A stringent benchmark from the miR-125a maturation dataset, in which the training set contains no sequence information in common with the testing set, reveals that ENCoM is the only tested model able to capture signal beyond the sequence. This ability translates to better predictive power on a second benchmark in which sequence features are shared between the train and test sets. When training the linear regression model using all available data, the dynamical features identified as necessary for miR-125a maturation point to known patterns but also offer new insights into the biogenesis of microRNAs. Our novel approach combining NMA with multivariate linear regression is generalizable to any macromolecule for which relatively high-throughput mutational data is available.Author summary: Ribonucleic acids (RNAs) are biomolecules which play essential roles in the function of all living organisms. These molecules can adopt defined 3D structures in the cell, but they also move around their equilibrium structure. RNA function is intimately related to structural dynamics, which, however, can be costly to simulate. In the present study, we adapt a fast method for the computational study of protein dynamics, called ENCoM, to work on RNA molecules. We benchmark its performance against other similar methods and find that ENCoM has a clear advantage when it comes to predicting large-scale dynamics. Moreover, ENCoM is unique in its ability to predict the effect of mutations on structural dynamics, as was already shown for proteins. This ability extends to RNA: we capture dynamics-function relationships apparent from experimental maturation efficiency data on over 26 000 sequence variants of a human microRNA, miR-125a. These dynamics-function relationships are learned by a novel linear model combining the reduced ENCoM dynamical information and the energy of folding. The low computational cost of this technique opens up the possibility of high-throughput prediction of RNA and protein functional properties from sequence information, if starting structures are known or can be predicted.
Suggested Citation
Olivier Mailhot & Vincent Frappier & François Major & Rafael J Najmanovich, 2022.
"Sequence-sensitive elastic network captures dynamical features necessary for miR-125a maturation,"
PLOS Computational Biology, Public Library of Science, vol. 18(12), pages 1-28, December.
Handle:
RePEc:plo:pcbi00:1010777
DOI: 10.1371/journal.pcbi.1010777
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1010777. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.