IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-58576-x.html
   My bibliography  Save this article

Accurate cross-species 5mC detection for Oxford Nanopore sequencing in plants with DeepPlant

Author

Listed:
  • He-Xu Chen

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science
    Sun Yat-Sen University)

  • Zhen-Dong Liu

    (Shanghai Polytechnic University)

  • Xin Bai

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science)

  • Bo Wu

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science)

  • Rong Song

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science)

  • Hui-Cong Yao

    (Sun Yat-Sen University)

  • Ying Chen

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science)

  • Wei Chi

    (Southern Medical University)

  • Qian Hua

    (Beijing University of Chinese Medicine)

  • Liang Cheng

    (Harbin Medical University)

  • Chuan-Le Xiao

    (Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science)

Abstract

Nanopore sequencing enables comprehensive detection of 5-methylcytosine (5mC), particularly in repeat regions. However, CHH methylation detection in plants is limited by the scarcity of high-methylation positive samples, reducing generalization across species. Dorado, the only tool for plant 5mC detection on the R10.4 platform, lacks extensive species testing. Here, we develop DeepPlant, a deep learning model incorporating both Bi-LSTM and Transformer architectures, which significantly improves CHH detection accuracy and performs well for CpG and CHG motifs. We address the scarcity of methylation-positive CHH training samples through screening species with abundant high-methylation CHH sites using bisulfite-sequencing and generate datasets that cover diverse 9-mer motifs for training and testing DeepPlant. Evaluated across nine species, DeepPlant achieves high whole-genome methylation frequency correlations (0.705-0.838) with BS-seq data on CHH, improved by 23.4- 117.6% compared to Dorado. DeepPlant also demonstrates superior single-molecule accuracy and F1 score, offering strong generalization for plant epigenetics research.

Suggested Citation

  • He-Xu Chen & Zhen-Dong Liu & Xin Bai & Bo Wu & Rong Song & Hui-Cong Yao & Ying Chen & Wei Chi & Qian Hua & Liang Cheng & Chuan-Le Xiao, 2025. "Accurate cross-species 5mC detection for Oxford Nanopore sequencing in plants with DeepPlant," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58576-x
    DOI: 10.1038/s41467-025-58576-x
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-58576-x
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-58576-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zhengming Wang & David C. Baulcombe, 2020. "Transposon age and non-CG methylation," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
    2. Li He & Huan Huang & Mariem Bradai & Cheng Zhao & Yin You & Jun Ma & Lun Zhao & Rosa Lozano-Durán & Jian-Kang Zhu, 2022. "DNA methylation-free Arabidopsis reveals crucial roles of DNA methylation in regulating gene expression and development," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    3. Peng Ni & Neng Huang & Fan Nie & Jun Zhang & Zhi Zhang & Bo Wu & Lu Bai & Wende Liu & Chuan-Le Xiao & Feng Luo & Jianxin Wang, 2021. "Genome-wide detection of cytosine methylations in plant from Nanopore data using deep learning," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    4. Mian Umair Ahsan & Anagha Gouru & Joe Chan & Wanding Zhou & Kai Wang, 2024. "A signal processing and deep learning framework for methylation detection using Oxford Nanopore sequencing," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yujuan Liu & Jie Shu & Zhi Zhang & Ning Ding & Jinyuan Liu & Jun Liu & Yuhai Cui & Changhu Wang & Chen Chen, 2024. "A conserved Pol II elongator SPT6L mediates Pol V transcription to regulate RNA-directed DNA methylation in Arabidopsis," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    2. Tuomas Hämälä & Christopher Moore & Laura Cowan & Matthew Carlile & David Gopaulchan & Marie K. Brandrud & Siri Birkeland & Matthew Loose & Filip Kolář & Marcus A. Koch & Levi Yant, 2024. "Impact of whole-genome duplications on structural variant evolution in Cochlearia," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    3. Kangxi Du & Jiabing Wu & Jiachen Wang & Wenhao Xie & Liufan Yin & Xiang Li & Chao Li & Aiwu Dong, 2024. "The chromatin remodeling factor OsINO80 promotes H3K27me3 and H3K9me2 deposition and maintains TE silencing in rice," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    4. Qingyuan Fan & Xuyang Zhao & Junyao Li & Ronghui Liu & Ming Liu & Qishun Feng & Yanping Long & Yang Fu & Jixian Zhai & Qing Pan & Yi Li, 2025. "De novo non-canonical nanopore basecalling enables private communication using heavily-modified DNA data at single-molecule level," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    5. Xiaohe Lin & Junjie Yin & Yifan Wang & Jing Yao & Qingshun Q. Li & Vit Latzel & Oliver Bossdorf & Yuan-Ye Zhang, 2024. "Environment-induced heritable variations are common in Arabidopsis thaliana," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    6. Dominik Stanojević & Zhe Li & Sara Bakić & Roger Foo & Mile Šikić, 2024. "Rockfish: A transformer-based model for accurate 5-methylcytosine prediction from nanopore sequencing," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    7. Li He & Huan Huang & Mariem Bradai & Cheng Zhao & Yin You & Jun Ma & Lun Zhao & Rosa Lozano-Durán & Jian-Kang Zhu, 2022. "DNA methylation-free Arabidopsis reveals crucial roles of DNA methylation in regulating gene expression and development," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    8. Ida Shinder & Richard Hu & Hyun Joo Ji & Kuan-Hao Chao & Mihaela Pertea, 2023. "EASTR: Identifying and eliminating systematic alignment errors in multi-exon genes," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    9. Peng Ni & Neng Huang & Fan Nie & Jun Zhang & Zhi Zhang & Bo Wu & Lu Bai & Wende Liu & Chuan-Le Xiao & Feng Luo & Jianxin Wang, 2021. "Genome-wide detection of cytosine methylations in plant from Nanopore data using deep learning," Nature Communications, Nature, vol. 12(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58576-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.