IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-58442-w.html
   My bibliography  Save this article

Lineage-specific microbial protein prediction enables large-scale exploration of protein ecology within the human gut

Author

Listed:
  • Matthias A. Schmitz

    (RWTH University Hospital)

  • Nicholas J. Dimonaco

    (Queen’s University Belfast
    Aberystwyth University)

  • Thomas Clavel

    (RWTH University Hospital)

  • Thomas C. A. Hitch

    (RWTH University Hospital)

Abstract

Microbes use a range of genetic codes and gene structures, yet these are often ignored during metagenomic analysis. This causes spurious protein predictions, preventing functional assignment which limits our understanding of ecosystems. To resolve this, we developed a lineage-specific gene prediction approach that uses the correct genetic code based on the taxonomic assignment of genetic fragments, removes incomplete protein predictions, and optimises prediction of small proteins. Applied to 9634 metagenomes and 3594 genomes from the human gut, this approach increased the landscape of captured expressed microbial proteins by 78.9%, including previously hidden functional groups. Optimised small protein prediction captured 3,772,658 small protein clusters, which form an improved microbial protein catalogue of the human gut (MiProGut). To enable the ecological study of a protein’s prevalence and association with host parameters, we developed InvestiGUT, a tool which integrates both the protein sequences and sample metadata. Accurate prediction of proteins is critical to providing a functional understanding of microbiomes, enhancing our ability to study interactions between microbes and hosts.

Suggested Citation

  • Matthias A. Schmitz & Nicholas J. Dimonaco & Thomas Clavel & Thomas C. A. Hitch, 2025. "Lineage-specific microbial protein prediction enables large-scale exploration of protein ecology within the human gut," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58442-w
    DOI: 10.1038/s41467-025-58442-w
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-58442-w
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-58442-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sigal Leviatan & Saar Shoer & Daphna Rothschild & Maria Gorodetski & Eran Segal, 2022. "An expanded reference map of the human gut microbiome reveals hundreds of previously unknown species," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    2. Luis Pedro Coelho & Renato Alves & Álvaro Rodríguez Río & Pernille Neve Myers & Carlos P. Cantalapiedra & Joaquín Giner-Lamia & Thomas Sebastian Schmidt & Daniel R. Mende & Askarbek Orakov & Ivica Let, 2022. "Towards the biogeography of prokaryotic genes," Nature, Nature, vol. 601(7892), pages 252-256, January.
    3. Xin V. Li & Irina Leonardi & Gregory G. Putzel & Alexa Semon & William D. Fiers & Takato Kusakabe & Woan-Yu Lin & Iris H. Gao & Itai Doron & Alejandra Gutierrez-Guerrero & Meghan B. DeCelie & Guilherm, 2022. "Author Correction: Immune regulation by fungal strain diversity in inflammatory bowel disease," Nature, Nature, vol. 608(7922), pages 21-21, August.
    4. Lisa Van den Broeck & Dinesh Kiran Bhosale & Kuncheng Song & Cássio Flavio Fonseca de Lima & Michael Ashley & Tingting Zhu & Shanshuo Zhu & Brigitte Van De Cotte & Pia Neyt & Anna C. Ortiz & Tiffany R, 2023. "Functional annotation of proteins for signaling network inference in non-model species," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Xin V. Li & Irina Leonardi & Gregory G. Putzel & Alexa Semon & William D. Fiers & Takato Kusakabe & Woan-Yu Lin & Iris H. Gao & Itai Doron & Alejandra Gutierrez-Guerrero & Meghan B. DeCelie & Guilherm, 2022. "Immune regulation by fungal strain diversity in inflammatory bowel disease," Nature, Nature, vol. 603(7902), pages 672-678, March.
    6. Jan-Hendrik Hehemann & Gaëlle Correc & Tristan Barbeyron & William Helbert & Mirjam Czjzek & Gurvan Michel, 2010. "Transfer of carbohydrate-active enzymes from marine bacteria to Japanese gut microbiota," Nature, Nature, vol. 464(7290), pages 908-912, April.
    7. Shuqin Zeng & Dhrati Patangia & Alexandre Almeida & Zhemin Zhou & Dezhi Mu & R. Paul Ross & Catherine Stanton & Shaopu Wang, 2022. "A compendium of 32,277 metagenome-assembled genomes and over 80 million genes from the early-life human gut microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tingting Zhou & Norma V. Solis & Michaela Marshall & Qing Yao & Rachel Garleb & Mengli Yang & Eric Pearlman & Scott G. Filler & Haoping Liu, 2024. "Hyphal Als proteins act as CR3 ligands to promote immune responses against Candida albicans," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    2. Tingting Zhou & Norma V. Solis & Michaela Marshall & Qing Yao & Eric Pearlman & Scott G. Filler & Haoping Liu, 2025. "Fungal Als proteins hijack host death effector domains to promote inflammasome signaling," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    3. Steven J. Biller & M. Gray Ryan & Jasmine Li & Andrew Burger & John M. Eppley & Thomas Hackl & Edward F. DeLong, 2025. "Distinct horizontal gene transfer potential of extracellular vesicles versus viral-like particles in marine habitats," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    4. Luisa M Arias-Giraldo & Marina Muñoz & Carolina Hernández & Giovanny Herrera & Natalia Velásquez-Ortiz & Omar Cantillo-Barraza & Plutarco Urbano & Juan David Ramírez, 2020. "Species-dependent variation of the gut bacterial communities across Trypanosoma cruzi insect vectors," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-16, November.
    5. Xianzhe Gong & Álvaro Rodríguez Río & Le Xu & Zhiyi Chen & Marguerite V. Langwig & Lei Su & Mingxue Sun & Jaime Huerta-Cepas & Valerie Anda & Brett J. Baker, 2022. "New globally distributed bacterial phyla within the FCB superphylum," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    6. Laura Baldassarre & Hua Ying & Adam M. Reitzel & Sören Franzenburg & Sebastian Fraune, 2022. "Microbiota mediated plasticity promotes thermal adaptation in the sea anemone Nematostella vectensis," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    7. Wenhui Li & Xianyue Jiang & Wuke Wang & Liya Hou & Runze Cai & Yongqian Li & Qiuxi Gu & Qinchang Chen & Peixiang Ma & Jin Tang & Menghao Guo & Guohui Chuai & Xingxu Huang & Jun Zhang & Qi Liu, 2024. "Discovering CRISPR-Cas system with self-processing pre-crRNA capability by foundation models," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    8. Kang Li & Zeng Dan & Luobu Gesang & Hong Wang & Yongjian Zhou & Yanlei Du & Yi Ren & Yixiang Shi & Yuqiang Nie, 2016. "Comparative Analysis of Gut Microbiota of Native Tibetan and Han Populations Living at Different Altitudes," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-16, May.
    9. Wei Chen & Min Qiu & Petra Paizs & Miriam Sadowski & Toma Ramonaite & Lieby Zborovsky & Raquel Mejias-Luque & Klaus-Peter Janßen & James Kinross & Robert D. Goldin & Monica Rebec & Manuel Liebeke & Zo, 2025. "Universal, untargeted detection of bacteria in tissues using metabolomics workflows," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    10. Yiqian Duan & Célio Dias Santos-Júnior & Thomas Sebastian Schmidt & Anthony Fullam & Breno L. S. Almeida & Chengkai Zhu & Michael Kuhn & Xing-Ming Zhao & Peer Bork & Luis Pedro Coelho, 2024. "A catalog of small proteins from the global microbiome," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    11. Shaojun Pan & Chengkai Zhu & Xing-Ming Zhao & Luis Pedro Coelho, 2022. "A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    12. Xiyang Dong & Yongyi Peng & Muhua Wang & Laura Woods & Wenxue Wu & Yong Wang & Xi Xiao & Jiwei Li & Kuntong Jia & Chris Greening & Zongze Shao & Casey R. J. Hubert, 2023. "Evolutionary ecology of microbial populations inhabiting deep sea sediments associated with cold seeps," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Zepeng Qu & Hongbin Liu & Ji Yang & Linggang Zheng & Jumin Huang & Ziming Wang & Chun Xie & Wenlong Zuo & Xiong Xia & Lin Sun & Yifa Zhou & Ying Xie & Jingguang Lu & Yizhun Zhu & Lili Yu & Lihua Liu &, 2025. "Selective utilization of medicinal polysaccharides by human gut Bacteroides and Parabacteroides species," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    14. Patrick J. Dörner & Harithaa Anandakumar & Ivo Röwekamp & Facundo Fiocca Vernengo & Belén Millet Pascual-Leone & Marta Krzanowski & Josua Sellmaier & Ulrike Brüning & Raphaela Fritsche-Guenther & Lenn, 2024. "Clinically used broad-spectrum antibiotics compromise inflammatory monocyte-dependent antibacterial defense in the lung," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    15. Irena Beidler & Nicola Steinke & Tim Schulze & Chandni Sidhu & Daniel Bartosik & Marie-Katherin Zühlke & Laura Torres Martin & Joris Krull & Theresa Dutschei & Borja Ferrero-Bordera & Julia Rielicke &, 2024. "Alpha-glucans from bacterial necromass indicate an intra-population loop within the marine carbon cycle," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    16. Ayya Keshet & Eran Segal, 2024. "Identification of gut microbiome features associated with host metabolic health in a large population-based cohort," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    17. Ning Duan & Emily Hand & Mannuku Pheko & Shikha Sharma & Akintunde Emiola, 2024. "Structure-guided discovery of anti-CRISPR and anti-phage defense proteins," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    18. Carla Pérez-Cruz & Alicia Moraleda-Montoya & Raquel Liébana & Oihana Terrones & Uxue Arrizabalaga & Mikel García-Alija & Maier Lorizate & Ana Martínez Gascueña & Isabel García-Álvarez & Jon Ander Niet, 2024. "Mechanisms of recalcitrant fucoidan breakdown in marine Planctomycetota," Nature Communications, Nature, vol. 15(1), pages 1-24, December.
    19. Mingyue Cheng & Shuai Luo & Peng Zhang & Guangzhou Xiong & Kai Chen & Chuanqi Jiang & Fangdian Yang & Hanhui Huang & Pengshuo Yang & Guanxi Liu & Yuhao Zhang & Sang Ba & Ping Yin & Jie Xiong & Wei Mia, 2024. "A genome and gene catalog of the aquatic microbiomes of the Tibetan Plateau," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    20. Shuqin Zeng & Alexandre Almeida & Shiping Li & Junjie Ying & Hua Wang & Yi Qu & R. Paul Ross & Catherine Stanton & Zhemin Zhou & Xiaoyu Niu & Dezhi Mu & Shaopu Wang, 2024. "A metagenomic catalog of the early-life human gut virome," Nature Communications, Nature, vol. 15(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58442-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.