IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-023-44290-z.html
   My bibliography  Save this article

Effective binning of metagenomic contigs using contrastive multi-view representation learning

Author

Listed:
  • Ziye Wang

    (Fudan University)

  • Ronghui You

    (Fudan University)

  • Haitao Han

    (Fudan University)

  • Wei Liu

    (Fudan University)

  • Fengzhu Sun

    (University of Southern California)

  • Shanfeng Zhu

    (Fudan University
    Shanghai Qi Zhi Institute
    Ministry of Education
    Fudan University)

Abstract

Contig binning plays a crucial role in metagenomic data analysis by grouping contigs from the same or closely related genomes. However, existing binning methods face challenges in practical applications due to the diversity of data types and the difficulties in efficiently integrating heterogeneous information. Here, we introduce COMEBin, a binning method based on contrastive multi-view representation learning. COMEBin utilizes data augmentation to generate multiple fragments (views) of each contig and obtains high-quality embeddings of heterogeneous features (sequence coverage and k-mer distribution) through contrastive learning. Experimental results on multiple simulated and real datasets demonstrate that COMEBin outperforms state-of-the-art binning methods, particularly in recovering near-complete genomes from real environmental samples. COMEBin outperforms other binning methods remarkably when integrated into metagenomic analysis pipelines, including the recovery of potentially pathogenic antibiotic-resistant bacteria (PARB) and moderate or higher quality bins containing potential biosynthetic gene clusters (BGCs).

Suggested Citation

  • Ziye Wang & Ronghui You & Haitao Han & Wei Liu & Fengzhu Sun & Shanfeng Zhu, 2024. "Effective binning of metagenomic contigs using contrastive multi-view representation learning," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-023-44290-z
    DOI: 10.1038/s41467-023-44290-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-44290-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-44290-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Junjie Qin & Ruiqiang Li & Jeroen Raes & Manimozhiyan Arumugam & Kristoffer Solvsten Burgdorf & Chaysavanh Manichanh & Trine Nielsen & Nicolas Pons & Florence Levenez & Takuji Yamada & Daniel R. Mende, 2010. "A human gut microbial gene catalogue established by metagenomic sequencing," Nature, Nature, vol. 464(7285), pages 59-65, March.
    2. Gregory D. Poore & Evguenia Kopylova & Qiyun Zhu & Carolina Carpenter & Serena Fraraccio & Stephen Wandro & Tomasz Kosciolek & Stefan Janssen & Jessica Metcalf & Se Jin Song & Jad Kanbar & Sandrine Mi, 2020. "Microbiome analyses of blood and tissues suggest cancer diagnostic approach," Nature, Nature, vol. 579(7800), pages 567-574, March.
    3. Shaojun Pan & Chengkai Zhu & Xing-Ming Zhao & Luis Pedro Coelho, 2022. "A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    4. Marsha C. Wibowo & Zhen Yang & Maxime Borry & Alexander Hübner & Kun D. Huang & Braden T. Tierney & Samuel Zimmerman & Francisco Barajas-Olmos & Cecilia Contreras-Cubas & Humberto García-Ortiz & Angél, 2021. "Reconstruction of ancient microbial genomes from the human gut," Nature, Nature, vol. 594(7862), pages 234-239, June.
    5. Stephen Nayfach & Zhou Jason Shi & Rekha Seshadri & Katherine S. Pollard & Nikos C. Kyrpides, 2019. "New insights from uncultivated genomes of the global human gut microbiome," Nature, Nature, vol. 568(7753), pages 505-510, April.
    6. Lucas Paoli & Hans-Joachim Ruscheweyh & Clarissa C. Forneris & Florian Hubrich & Satria Kautsar & Agneya Bhushan & Alessandro Lotti & Quentin Clayssen & Guillem Salazar & Alessio Milanese & Charlotte , 2022. "Biosynthetic potential of the global ocean microbiome," Nature, Nature, vol. 607(7917), pages 111-118, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bin Ma & Caiyu Lu & Yiling Wang & Jingwen Yu & Kankan Zhao & Ran Xue & Hao Ren & Xiaofei Lv & Ronghui Pan & Jiabao Zhang & Yongguan Zhu & Jianming Xu, 2023. "A genomic catalogue of soil microbiomes boosts mining of biodiversity and genetic resources," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    2. Fiona B. Tamburini & Dylan Maghini & Ovokeraye H. Oduaran & Ryan Brewster & Michaella R. Hulley & Venesa Sahibdeen & Shane A. Norris & Stephen Tollman & Kathleen Kahn & Ryan G. Wagner & Alisha N. Wade, 2022. "Short- and long-read metagenomics of urban and rural South African gut microbiomes reveal a transitional composition and undescribed taxa," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    3. Shuqin Zeng & Dhrati Patangia & Alexandre Almeida & Zhemin Zhou & Dezhi Mu & R. Paul Ross & Catherine Stanton & Shaopu Wang, 2022. "A compendium of 32,277 metagenome-assembled genomes and over 80 million genes from the early-life human gut microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Mingyue Cheng & Shuai Luo & Peng Zhang & Guangzhou Xiong & Kai Chen & Chuanqi Jiang & Fangdian Yang & Hanhui Huang & Pengshuo Yang & Guanxi Liu & Yuhao Zhang & Sang Ba & Ping Yin & Jie Xiong & Wei Mia, 2024. "A genome and gene catalog of the aquatic microbiomes of the Tibetan Plateau," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    5. Li Zhang & Karen R. Jonscher & Zuyuan Zhang & Yi Xiong & Ryan S. Mueller & Jacob E. Friedman & Chongle Pan, 2022. "Islet autoantibody seroconversion in type-1 diabetes is associated with metagenome-assembled genomes in infant gut microbiomes," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    6. Leandro C. Hermida & E. Michael Gertz & Eytan Ruppin, 2022. "Predicting cancer prognosis and drug response from the tumor microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    7. Ashwag Shami & Rewaa S. Jalal & Ruba A. Ashy & Haneen W. Abuauf & Lina Baz & Mohammed Y. Refai & Aminah A. Barqawi & Hanadi M. Baeissa & Manal A. Tashkandi & Sahar Alshareef & Aala A. Abulfaraj, 2022. "Use of Metagenomic Whole Genome Shotgun Sequencing Data in Taxonomic Assignment of Dipterygium glaucum Rhizosphere and Surrounding Bulk Soil Microbiomes, and Their Response to Watering," Sustainability, MDPI, vol. 14(14), pages 1-21, July.
    8. Yunmin Yang & Binbin Chu & Jiayi Cheng & Jiali Tang & Bin Song & Houyu Wang & Yao He, 2022. "Bacteria eat nanoprobes for aggregation-enhanced imaging and killing diverse microorganisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    9. Qi, Lijuan & Wu, Jiansong & Chen, Ye & Wen, Qing & Xu, Haitao & Wang, Yuyang, 2020. "Shape-controllable binderless self-supporting hydrogel anode for microbial fuel cells," Renewable Energy, Elsevier, vol. 156(C), pages 1325-1335.
    10. Candice R. Gurbatri & Georgette A. Radford & Laura Vrbanac & Jongwon Im & Elaine M. Thomas & Courtney Coker & Samuel R. Taylor & YoungUk Jang & Ayelet Sivan & Kyu Rhee & Anas A. Saleh & Tiffany Chien , 2024. "Engineering tumor-colonizing E. coli Nissle 1917 for detection and treatment of colorectal neoplasia," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    11. Djawad Radjabzadeh & Jos A. Bosch & André G. Uitterlinden & Aeilko H. Zwinderman & M. Arfan Ikram & Joyce B. J. Meurs & Annemarie I. Luik & Max Nieuwdorp & Anja Lok & Cornelia M. Duijn & Robert Kraaij, 2022. "Gut microbiome-wide association study of depressive symptoms," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    12. Angelina Beavogui & Auriane Lacroix & Nicolas Wiart & Julie Poulain & Tom O. Delmont & Lucas Paoli & Patrick Wincker & Pedro H. Oliveira, 2024. "The defensome of complex bacterial communities," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    13. Barbara Emmenegger & Julien Massoni & Christine M. Pestalozzi & Miriam Bortfeld-Miller & Benjamin A. Maier & Julia A. Vorholt, 2023. "Identifying microbiota community patterns important for plant protection using synthetic communities and machine learning," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    14. Lara S. Franco & Danielle F. Shanahan & Richard A. Fuller, 2017. "A Review of the Benefits of Nature Experiences: More Than Meets the Eye," IJERPH, MDPI, vol. 14(8), pages 1-29, August.
    15. Corentin Hochart & Lucas Paoli & Hans-Joachim Ruscheweyh & Guillem Salazar & Emilie Boissin & Sarah Romac & Julie Poulain & Guillaume Bourdin & Guillaume Iwankow & Clémentine Moulin & Maren Ziegler & , 2023. "Ecology of Endozoicomonadaceae in three coral genera across the Pacific Ocean," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    16. Xiaoji (Christine) Liu & Kevin D. Floate & Monika A. Gorzelak & Devin B. Holman & Scott Hrycauk & Hiroshi Kubota & Newton Lupwayi & Jonathan A. D. Neilson & Rodrigo Ortega Polo & Renée M. Petri & Lan , 2023. "Prairie Agroecosystems: Interconnected Microbiomes of Livestock, Soil and Insects," Agriculture, MDPI, vol. 13(2), pages 1-28, January.
    17. Lharbi Dridi & Fernando Altamura & Emmanuel Gonzalez & Olivia Lui & Ryszard Kubinski & Reilly Pidgeon & Adrian Montagut & Jasmine Chong & Jianguo Xia & Corinne F. Maurice & Bastien Castagner, 2023. "Identifying glycan consumers in human gut microbiota samples using metabolic labeling coupled with fluorescence-activated cell sorting," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    18. Lingjing Jiang & Niina Haiminen & Anna‐Paola Carrieri & Shi Huang & Yoshiki Vázquez‐Baeza & Laxmi Parida & Ho‐Cheol Kim & Austin D. Swafford & Rob Knight & Loki Natarajan, 2022. "Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data," Biometrics, The International Biometric Society, vol. 78(3), pages 1155-1167, September.
    19. Piotr Rozwalak & Jakub Barylski & Yasas Wijesekara & Bas E. Dutilh & Andrzej Zielezinski, 2024. "Ultraconserved bacteriophage genome sequence identified in 1300-year-old human palaeofaeces," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    20. Zhenyan Zhang & Qi Zhang & Tingzhang Wang & Nuohan Xu & Tao Lu & Wenjie Hong & Josep Penuelas & Michael Gillings & Meixia Wang & Wenwen Gao & Haifeng Qian, 2022. "Assessment of global health risk of antibiotic resistance genes," Nature Communications, Nature, vol. 13(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-023-44290-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.