Author
Listed:
- Ryota Masuki
- Donn Liew
- Ee Hou Yong
Abstract
Predicting RNA structures containing pseudoknots remains computationally challenging due to high processing costs and complexity. While standard methods for pseudoknot prediction require O(N6) time complexity, we present a hierarchical approach that significantly reduces computational cost while maintaining prediction accuracy. Our method analyzes RNA structures by dividing them into contiguous regions of unpaired bases (“sections”) derived from known secondary structures. We examine pseudoknot interactions between sections using a nearest-neighbor energy model with dynamic programming. Our algorithm scales as O(n2ℓ4), offering substantial computational advantages over existing global prediction methods. Analysis of 726 transfer messenger RNA and 454 Ribonuclease P RNA sequences reveals that biologically relevant pseudoknots are highly concentrated among section pairs with large minimum free energy (MFE) gain. Over 90% of connected section pairs appear within just the top 3% of section pairs ranked by MFE gain. For 2-clusters, our method achieves high prediction accuracy with sensitivity exceeding 0.9 and positive predictive value above 0.8. For 3-clusters, we discovered asymmetric behavior where “former” section pairs (formed early in the sequence) are predicted accurately, while “latter” section pairs do not follow local energy predictions. This asymmetry suggests that complex pseudoknot formation follows sequential co-transcriptional folding rather than global energy minimization, providing insights into RNA folding dynamics.Author summary: RNA molecules fold into structures to perform biological functions. However, predicting complex RNA structures known as “pseudoknots” is computationally expensive. Current methods often attempt to calculate the entire structure simultaneously, which requires significant computational resources. In this paper, we introduce a hierarchical approach that simplifies pseudoknot prediction. We break the RNA sequence into smaller “sections” of unpaired bases and calculate the energy required for these sections to bind locally, rather than solving for the global structure. Our analysis shows that strong local interactions are favored by biology; with over 90% of pseudoknots occurring within the top 3% of the most energetically favorable section pairs. This finding allows us to focus computational effort on the small subset of interactions that are most likely to form pseudoknots, rather than testing every possible combination. Our method achieves >90% sensitivity for simple 2-section pseudoknots. However, for complex 3-section pseudoknots, only early-forming connections are predictable. This reveals that RNA does not simply fold into the most stable structure. Instead, folding is sequential, with earlier regions establishing interactions that constrain the final structure before synthesis of the later regions.
Suggested Citation
Ryota Masuki & Donn Liew & Ee Hou Yong, 2026.
"Hierarchical analysis of RNA secondary structures with pseudoknots based on sections,"
PLOS Computational Biology, Public Library of Science, vol. 22(1), pages 1-18, January.
Handle:
RePEc:plo:pcbi00:1013904
DOI: 10.1371/journal.pcbi.1013904
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013904. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.