IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1013904.html

Hierarchical analysis of RNA secondary structures with pseudoknots based on sections

Author

Listed:
  • Ryota Masuki
  • Donn Liew
  • Ee Hou Yong

Abstract

Predicting RNA structures containing pseudoknots remains computationally challenging due to high processing costs and complexity. While standard methods for pseudoknot prediction require O(N6) time complexity, we present a hierarchical approach that significantly reduces computational cost while maintaining prediction accuracy. Our method analyzes RNA structures by dividing them into contiguous regions of unpaired bases (“sections”) derived from known secondary structures. We examine pseudoknot interactions between sections using a nearest-neighbor energy model with dynamic programming. Our algorithm scales as O(n2ℓ4), offering substantial computational advantages over existing global prediction methods. Analysis of 726 transfer messenger RNA and 454 Ribonuclease P RNA sequences reveals that biologically relevant pseudoknots are highly concentrated among section pairs with large minimum free energy (MFE) gain. Over 90% of connected section pairs appear within just the top 3% of section pairs ranked by MFE gain. For 2-clusters, our method achieves high prediction accuracy with sensitivity exceeding 0.9 and positive predictive value above 0.8. For 3-clusters, we discovered asymmetric behavior where “former” section pairs (formed early in the sequence) are predicted accurately, while “latter” section pairs do not follow local energy predictions. This asymmetry suggests that complex pseudoknot formation follows sequential co-transcriptional folding rather than global energy minimization, providing insights into RNA folding dynamics.Author summary: RNA molecules fold into structures to perform biological functions. However, predicting complex RNA structures known as “pseudoknots” is computationally expensive. Current methods often attempt to calculate the entire structure simultaneously, which requires significant computational resources. In this paper, we introduce a hierarchical approach that simplifies pseudoknot prediction. We break the RNA sequence into smaller “sections” of unpaired bases and calculate the energy required for these sections to bind locally, rather than solving for the global structure. Our analysis shows that strong local interactions are favored by biology; with over 90% of pseudoknots occurring within the top 3% of the most energetically favorable section pairs. This finding allows us to focus computational effort on the small subset of interactions that are most likely to form pseudoknots, rather than testing every possible combination. Our method achieves >90% sensitivity for simple 2-section pseudoknots. However, for complex 3-section pseudoknots, only early-forming connections are predictable. This reveals that RNA does not simply fold into the most stable structure. Instead, folding is sequential, with earlier regions establishing interactions that constrain the final structure before synthesis of the later regions.

Suggested Citation

  • Ryota Masuki & Donn Liew & Ee Hou Yong, 2026. "Hierarchical analysis of RNA secondary structures with pseudoknots based on sections," PLOS Computational Biology, Public Library of Science, vol. 22(1), pages 1-18, January.
  • Handle: RePEc:plo:pcbi00:1013904
    DOI: 10.1371/journal.pcbi.1013904
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013904
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1013904&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1013904?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mitchell Guttman & John L. Rinn, 2012. "Modular regulatory principles of large non-coding RNAs," Nature, Nature, vol. 482(7385), pages 339-346, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sun, Silin & Xiu, Yuhan & Zhou, Bingwei & Qi, Zixuan & Liu, Bo & Long, Haixia, 2025. "Prediction of lncRNA-disease association based on heterogeneous graph contrastive learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 677(C).
    2. Shivali Patel & Alec N. Sexton & Madison S. Strine & Craig B. Wilen & Matthew D. Simon & Anna Marie Pyle, 2023. "Systematic detection of tertiary structural modules in large RNAs and RNP interfaces by Tb-seq," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    3. Tian Tian & Chunjian Li & Jing Xiao & Yi Shen & Yihua Lu & Liying Jiang & Xun Zhuang & Minjie Chu, 2016. "Quantitative Assessment of the Polymorphisms in the HOTAIR lncRNA and Cancer Risk: A Meta-Analysis of 8 Case-Control Studies," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-11, March.
    4. Lihui Liu & Ziyang Liu & Qinghua Liu & Wei Wu & Peng Lin & Xing Liu & Yuechuan Zhang & Dongpeng Wang & Briana C. Prager & Ryan C. Gimple & Jichuan Yu & Weixi Zhao & Qiulian Wu & Wei Zhang & Erzhong Wu, 2023. "LncRNA INHEG promotes glioma stem cell maintenance and tumorigenicity through regulating rRNA 2’-O-methylation," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    5. Roberta Esposito & Andrés Lanzós & Tina Uroda & Sunandini Ramnarayanan & Isabel Büchi & Taisia Polidori & Hugo Guillen-Ramirez & Ante Mihaljevic & Bernard Mefi Merlin & Lia Mela & Eugenio Zoni & Lusin, 2023. "Tumour mutations in long noncoding RNAs enhance cell fitness," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    6. Weiwei Han & Zhenyu Zhang & Bangshun He & Yijun Xu & Jun Zhang & Weijun Cao, 2017. "Integrated analysis of long non-coding RNAs in human gastric cancer: An in silico study," PLOS ONE, Public Library of Science, vol. 12(8), pages 1-12, August.
    7. Christian Much & Erika L. Lasda & Isabela T. Pereira & Tenaya K. Vallery & Daniel Ramirez & Jordan P. Lewandowski & Robin D. Dowell & Michael J. Smallegan & John L. Rinn, 2024. "The temporal dynamics of lncRNA Firre-mediated epigenetic and transcriptional regulation," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    8. Harshita Sharma & Matthew N. Z. Valentine & Naoko Toki & Hiromi Nishiyori Sueki & Stefano Gustincich & Hazuki Takahashi & Piero Carninci, 2024. "Decryption of sequence, structure, and functional features of SINE repeat elements in SINEUP non-coding RNA-mediated post-transcriptional gene regulation," Nature Communications, Nature, vol. 15(1), pages 1-19, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013904. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.