Author
Listed:
- Mauricio Perez
(Genome)
- Michiko Kimoto
(Nanos
#02-08 The Cavendish)
- Priscilla Rajakumar
(Genome)
- Chayaporn Suphavilai
(Genome)
- Rafael Peres da Silva
(Genome)
- Hui Pen Tan
(Nanos
#02-08 The Cavendish)
- Nicholas Ting Xun Ong
(Genome)
- Hannah Nicholas
(Genome)
- Ichiro Hirao
(Nanos
#02-08 The Cavendish)
- Wei Leong Chew
(Genome
National University of Singapore)
- Niranjan Nagarajan
(Genome
National University of Singapore)
Abstract
The discovery of non-canonical bases (NCBs) and development of synthetic xeno-nucleic acids (XNAs) has spawned interest in many applications in viral genomics, synthetic biology and DNA storage. However, inability to do high-throughput sequencing of NCBs has been a significant limitation. We demonstrate that XNAs with NCBs can be robustly sequenced on a MinION system ( > 2.3×106 reads/flowcell) to obtain significantly distinct signals from controls (median fold-change >6×). To enable AI-model training, we synthesized and sequenced a complex pool of 1,024 NCB-containing oligonucleotides with varied 6-mer contexts and high purity ( > 90%). Bootstrapped models assisted in data preparation, and data augmentation with spliced reads provided high context diversity, enabling learning of generalizable models to decipher NCB-containing sequences with high accuracy ( > 80%) and specificity (99%). These results highlight the versatility of nanopore sequencing for interrogating unusual nucleic acids, and the potential to transform the study of genetic material beyond those that use canonical bases.
Suggested Citation
Mauricio Perez & Michiko Kimoto & Priscilla Rajakumar & Chayaporn Suphavilai & Rafael Peres da Silva & Hui Pen Tan & Nicholas Ting Xun Ong & Hannah Nicholas & Ichiro Hirao & Wei Leong Chew & Niranjan , 2025.
"Direct high-throughput deconvolution of non-canonical bases via nanopore sequencing and bootstrapped learning,"
Nature Communications, Nature, vol. 16(1), pages 1-12, December.
Handle:
RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-62347-z
DOI: 10.1038/s41467-025-62347-z
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-62347-z. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.