IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000406.html
   My bibliography  Save this article

A Probabilistic Model of RNA Conformational Space

Author

Listed:
  • Jes Frellsen
  • Ida Moltke
  • Martin Thiim
  • Kanti V Mardia
  • Jesper Ferkinghoff-Borg
  • Thomas Hamelryck

Abstract

The increasing importance of non-coding RNA in biology and medicine has led to a growing interest in the problem of RNA 3-D structure prediction. As is the case for proteins, RNA 3-D structure prediction methods require two key ingredients: an accurate energy function and a conformational sampling procedure. Both are only partly solved problems. Here, we focus on the problem of conformational sampling. The current state of the art solution is based on fragment assembly methods, which construct plausible conformations by stringing together short fragments obtained from experimental structures. However, the discrete nature of the fragments necessitates the use of carefully tuned, unphysical energy functions, and their non-probabilistic nature impairs unbiased sampling. We offer a solution to the sampling problem that removes these important limitations: a probabilistic model of RNA structure that allows efficient sampling of RNA conformations in continuous space, and with associated probabilities. We show that the model captures several key features of RNA structure, such as its rotameric nature and the distribution of the helix lengths. Furthermore, the model readily generates native-like 3-D conformations for 9 out of 10 test structures, solely using coarse-grained base-pairing information. In conclusion, the method provides a theoretical and practical solution for a major bottleneck on the way to routine prediction and simulation of RNA structure and dynamics in atomic detail.Author Summary: The importance of RNA in biology and medicine has increased immensely over the last several years, due to the discovery of a wide range of important biological processes that are under the guidance of non-coding RNA. As is the case with proteins, the function of an RNA molecule is encoded in its three-dimensional (3-D) structure, which in turn is determined by the molecule's sequence. Therefore, interest in the computational prediction of the 3-D structure of RNA from sequence is great. One of the main bottlenecks in routine prediction and simulation of RNA structure and dynamics is sampling, the efficient generation of RNA-like conformations, ideally in a mathematically and physically sound way. Current methods require the use of unphysical energy functions to amend the shortcomings of the sampling procedure. We have developed a mathematical model that describes RNA's conformational space in atomic detail, without the shortcomings of other sampling methods. As an illustration of its potential, we describe a simple yet efficient method to sample conformations that are compatible with a given secondary structure. An implementation of the sampling method, called BARNACLE, is freely available.

Suggested Citation

  • Jes Frellsen & Ida Moltke & Martin Thiim & Kanti V Mardia & Jesper Ferkinghoff-Borg & Thomas Hamelryck, 2009. "A Probabilistic Model of RNA Conformational Space," PLOS Computational Biology, Public Library of Science, vol. 5(6), pages 1-11, June.
  • Handle: RePEc:plo:pcbi00:1000406
    DOI: 10.1371/journal.pcbi.1000406
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000406
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000406&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000406?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Thomas Hamelryck & John T Kent & Anders Krogh, 2006. "Sampling Realistic Protein Conformations Using Local Structural Bias," PLOS Computational Biology, Public Library of Science, vol. 2(9), pages 1-13, September.
    2. K. V. Mardia, 1999. "Directional statistics and shape analysis," Journal of Applied Statistics, Taylor & Francis Journals, vol. 26(8), pages 949-957.
    3. Marc Parisien & François Major, 2008. "The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data," Nature, Nature, vol. 452(7183), pages 51-55, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jun Li & Wei Zhu & Jun Wang & Wenfei Li & Sheng Gong & Jian Zhang & Wei Wang, 2018. "RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-18, November.
    2. Mardia, Kanti V. & Wiechers, Henrik & Eltzner, Benjamin & Huckemann, Stephan F., 2022. "Principal component analysis and clustering on manifolds," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Mélanie Boudard & Julie Bernauer & Dominique Barth & Johanne Cohen & Alain Denise, 2015. "GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-21, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Toshihiro Abe & Arthur Pewsey, 2011. "Sine-skewed circular distributions," Statistical Papers, Springer, vol. 52(3), pages 683-707, August.
    2. Fernández de Marcos Giménez de los Galanes, Alberto & García Portugués, Eduardo, 2022. "Data-driven stabilizations of goodness-of-fit tests," DES - Working Papers. Statistics and Econometrics. WS 35324, Universidad Carlos III de Madrid. Departamento de Estadística.
    3. Kanti V. Mardia, 2021. "Comments on: Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 59-63, March.
    4. Koch, Erwan & Robert, Christian Y., 2019. "Geometric ergodicity for some space–time max-stable Markov chains," Statistics & Probability Letters, Elsevier, vol. 145(C), pages 43-49.
    5. Abe, Toshihiro & Miyata, Yoichi & Shiohama, Takayuki, 2023. "Bayesian estimation for mode and anti-mode preserving circular distributions," Econometrics and Statistics, Elsevier, vol. 27(C), pages 136-160.
    6. M. Jones & Arthur Pewsey & Shogo Kato, 2015. "On a class of circulas: copulas for circular distributions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(5), pages 843-862, October.
    7. Bee, Marco & Benedetti, Roberto & Espa, Giuseppe, 2017. "Approximate maximum likelihood estimation of the Bingham distribution," Computational Statistics & Data Analysis, Elsevier, vol. 108(C), pages 84-96.
    8. Nicosia, Aurélien & Duchesne, Thierry & Rivest, Louis-Paul & Fortin, Daniel, 2017. "A general hidden state random walk model for animal movement," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 76-95.
    9. Liang Liu & Shi-Jie Chen, 2012. "Coarse-Grained Prediction of RNA Loop Structures," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-15, November.
    10. Kienzle, Marco & Sterling, David & Zhou, Shijie & Wang, You-Gan, 2016. "Maximum likelihood estimation of natural mortality and quantification of temperature effects on catchability of brown tiger prawn (Penaeus esculentus) in Moreton Bay (Australia) using logbook data," Ecological Modelling, Elsevier, vol. 322(C), pages 1-9.
    11. Jun Li & Wei Zhu & Jun Wang & Wenfei Li & Sheng Gong & Jian Zhang & Wei Wang, 2018. "RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-18, November.
    12. Arthur Pewsey & Eduardo García-Portugués, 2021. "Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 1-58, March.
    13. Ge Han & Yi Xue, 2022. "Rational design of hairpin RNA excited states reveals multi-step transitions," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Fernández-Durán Juan José & Gregorio-Domínguez MarÍa Mercedes, 2014. "Modeling angles in proteins and circular genomes using multivariate angular distributions based on multiple nonnegative trigonometric sums," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(1), pages 1-18, February.
    15. De Iaco, S. & Palma, M. & Posa, D., 2005. "Modeling and prediction of multivariate space-time random fields," Computational Statistics & Data Analysis, Elsevier, vol. 48(3), pages 525-547, March.
    16. Fernández-de-Marcos, Alberto & García-Portugués, Eduardo, 2023. "Data-driven stabilizations of goodness-of-fit tests," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    17. Auderset, Claude & Mazza, Christian & Ruh, Ernst A., 2005. "Angular Gaussian and Cauchy estimation," Journal of Multivariate Analysis, Elsevier, vol. 93(1), pages 180-197, March.
    18. Yasuhito Tsuruta & Masahiko Sagae, 2020. "Theoretical properties of bandwidth selectors for kernel density estimation on the circle," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(2), pages 511-530, April.
    19. Carnicero, José Antonio & Wiper, Michael Peter & Ausín Olivera, María Concepción, 2010. "Circular Bernstein polynomial distributions," DES - Working Papers. Statistics and Econometrics. WS ws102511, Universidad Carlos III de Madrid. Departamento de Estadística.
    20. Hornik, Kurt & Grün, Bettina, 2014. "On standard conjugate families for natural exponential families with bounded natural parameter space," Journal of Multivariate Analysis, Elsevier, vol. 126(C), pages 14-24.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000406. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.