IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-60872-5.html
   My bibliography  Save this article

RiNALMo: general-purpose RNA language models can generalize well on structure prediction tasks

Author

Listed:
  • Rafael Josip Penić

    (University of Zagreb)

  • Tin Vlašić

    (Agency for Science Technology and Research (A*STAR))

  • Roland G. Huber

    (Agency for Science, Technology and Research (A*STAR))

  • Yue Wan

    (Agency for Science Technology and Research (A*STAR))

  • Mile Šikić

    (University of Zagreb
    Agency for Science Technology and Research (A*STAR))

Abstract

While RNA has recently been recognized as an interesting small-molecule drug target, many challenges remain to be addressed before we take full advantage of it. This emphasizes the necessity to improve our understanding of its structures and functions. Over the years, sequencing technologies have produced an enormous amount of unlabeled RNA data, which hides a huge potential. Motivated by the successes of protein language models, we introduce RiboNucleic Acid Language Model (RiNALMo) to unveil the hidden code of RNA. RiNALMo is the largest RNA language model to date, with 650M parameters pre-trained on 36M non-coding RNA sequences from several databases. It can extract hidden knowledge and capture the underlying structure information implicitly embedded within the RNA sequences. RiNALMo achieves state-of-the-art results on several downstream tasks. Notably, we show that its generalization capabilities overcome the inability of other deep learning methods for secondary structure prediction to generalize on unseen RNA families.

Suggested Citation

  • Rafael Josip Penić & Tin Vlašić & Roland G. Huber & Yue Wan & Mile Šikić, 2025. "RiNALMo: general-purpose RNA language models can generalize well on structure prediction tasks," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60872-5
    DOI: 10.1038/s41467-025-60872-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-60872-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-60872-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jaswinder Singh & Jack Hanson & Kuldip Paliwal & Yaoqi Zhou, 2019. "RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning," Nature Communications, Nature, vol. 10(1), pages 1-13, December.
    2. John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
    3. Kengo Sato & Manato Akiyama & Yasubumi Sakakibara, 2021. "RNA secondary structure prediction using deep learning with thermodynamic integration," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    4. Jicong Cao & Eva Maria Novoa & Zhizhuo Zhang & William C. W. Chen & Dianbo Liu & Gigi C. G. Choi & Alan S. L. Wong & Claudia Wehrspaun & Manolis Kellis & Timothy K. Lu, 2021. "High-throughput 5′ UTR engineering for enhanced protein production in non-viral gene therapies," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    5. Noelia Ferruz & Steffen Schmidt & Birte Höcker, 2022. "ProtGPT2 is a deep unsupervised language model for protein design," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Heqin Zhu & Fenghe Tang & Quan Quan & Ke Chen & Peng Xiong & S. Kevin Zhou, 2025. "Deep generalizable prediction of RNA secondary structure via base pair motif energy," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    2. Yuki Kagaya & Zicong Zhang & Nabil Ibtehaz & Xiao Wang & Tsukasa Nakamura & Pranav Deep Punuru & Daisuke Kihara, 2025. "NuFold: end-to-end approach for RNA tertiary structure prediction with flexible nucleobase center representation," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    3. Yang Li & Chengxin Zhang & Chenjie Feng & Robin Pearce & P. Lydia Freddolino & Yang Zhang, 2023. "Integrating end-to-end learning with deep geometrical potentials for ab initio RNA structure prediction," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    4. Peicong Lin & Yumeng Yan & Huanyu Tao & Sheng-You Huang, 2023. "Deep transfer learning for inter-chain contact predictions of transmembrane protein complexes," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    5. Veda Sheersh Boorla & Costas D. Maranas, 2025. "CatPred: a comprehensive framework for deep learning in vitro enzyme kinetic parameters," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    6. Rezwan Siddiquee & Carol H. Pong & Ruth M. Hall & Sandro F. Ataide, 2024. "A programmable seekRNA guides target selection by IS1111 and IS110 type insertion sequences," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    7. Kevin E. Wu & Kevin K. Yang & Rianne Berg & Sarah Alamdari & James Y. Zou & Alex X. Lu & Ava P. Amini, 2024. "Protein structure generation via folding diffusion," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    8. Gustavo Arango-Argoty & Elly Kipkogei & Ross Stewart & Gerald J. Sun & Arijit Patra & Ioannis Kagiampakis & Etai Jacob, 2025. "Pretrained transformers applied to clinical studies improve predictions of treatment efficacy and associated biomarkers," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    9. Wenkai Wang & Chenjie Feng & Renmin Han & Ziyi Wang & Lisha Ye & Zongyang Du & Hong Wei & Fa Zhang & Zhenling Peng & Jianyi Yang, 2023. "trRosettaRNA: automated prediction of RNA 3D structure with transformer network," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    10. Amir Pandi & David Adam & Amir Zare & Van Tuan Trinh & Stefan L. Schaefer & Marie Burt & Björn Klabunde & Elizaveta Bobkova & Manish Kushwaha & Yeganeh Foroughijabbari & Peter Braun & Christoph Spahn , 2023. "Cell-free biosynthesis combined with deep learning accelerates de novo-development of antimicrobial peptides," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. Aidan T. Riley & James M. Robson & Aiganysh Ulanova & Alexander A. Green, 2025. "Generative and predictive neural networks for the design of functional RNA molecules," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    12. Wenwu Zeng & Yutao Dou & Liangrui Pan & Liwen Xu & Shaoliang Peng, 2024. "Improving prediction performance of general protein language model by domain-adaptive pretraining on DNA-binding protein," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    13. Sijie Chen & Tong Lin & Ruchira Basu & Jeremy Ritchey & Shen Wang & Yichuan Luo & Xingcan Li & Dehua Pei & Levent Burak Kara & Xiaolin Cheng, 2024. "Design of target specific peptide inhibitors using generative deep learning and molecular dynamics simulations," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    14. Sophia Vincoff & Shrey Goel & Kseniia Kholina & Rishab Pulugurta & Pranay Vure & Pranam Chatterjee, 2025. "FusOn-pLM: a fusion oncoprotein-specific language model via adjusted rate masking," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    15. David Ding & Ada Y. Shaw & Sam Sinai & Nathan Rollins & Noam Prywes & David F. Savage & Michael T. Laub & Debora S. Marks, 2024. "Protein design using structure-based residue preferences," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    16. Pantelis Livanos & Choy Kriechbaum & Sophia Remers & Arvid Herrmann & Sabine Müller, 2025. "Kinesin-12 POK2 polarization is a prerequisite for a fully functional division site and aids cell plate positioning," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    17. Surabhi Kokane & Ashutosh Gulati & Pascal F. Meier & Rei Matsuoka & Tanadet Pipatpolkai & Giuseppe Albano & Tin Manh Ho & Lucie Delemotte & Daniel Fuster & David Drew, 2025. "PIP2-mediated oligomerization of the endosomal sodium/proton exchanger NHE9," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    18. Justin Riper & Arleth O. Martinez-Claros & Lie Wang & Hannah E. Schneiderman & Sweta Maheshwari & Monica C. Pillon, 2025. "CryoEM structure of the SLFN14 endoribonuclease reveals insight into RNA binding and cleavage," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    19. Pierre Azoulay & Joshua Krieger & Abhishek Nagaraj, 2024. "Old Moats for New Models: Openness, Control, and Competition in Generative Artificial Intelligence," NBER Chapters, in: Entrepreneurship and Innovation Policy and the Economy, volume 4, pages 7-46, National Bureau of Economic Research, Inc.
    20. Anthony C. Bishop & Glorisé Torres-Montalvo & Sravya Kotaru & Kyle Mimun & A. Joshua Wand, 2023. "Robust automated backbone triple resonance NMR assignments of proteins using Bayesian-based simulated annealing," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60872-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.