IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1013124.html
   My bibliography  Save this article

BMDD: A probabilistic framework for accurate imputation of zero-inflated microbiome sequencing data

Author

Listed:
  • Huijuan Zhou
  • Jun Chen
  • Xianyang Zhang

Abstract

Microbiome sequencing data are inherently sparse and compositional, with excessive zeros arising from biological absence or insufficient sampling. These zeros pose significant challenges for downstream analyses, particularly those that require log-transformation. We introduce BMDD (BiModal Dirichlet Distribution), a novel probabilistic modeling framework for accurate imputation of microbiome sequencing data. Unlike existing imputation approaches that assume unimodal abundance, BMDD captures the bimodal abundance distribution of the taxa via a mixture of Dirichlet priors. It uses variational inference and a scalable expectation-maximization algorithm for efficient imputation. Through simulations and real microbiome datasets, we demonstrate that BMDD outperforms competing methods in reconstructing true abundances and improves the performance of differential abundance analysis. Through multiple posterior samples, BMDD enables robust inference by accounting for uncertainty in zero imputation. Our method offers a principled and computationally efficient solution for analyzing high-dimensional, zero-inflated microbiome sequencing data and is broadly applicable in microbial biomarker discovery and host-microbiome interaction studies.Author summary: Understanding the microbes living in and on our bodies—the microbiome—relies on analyzing complex sequencing data. However, these data often contain many zeros, either because a microbe is truly absent or simply missed due to insufficient sampling. These missing values make it hard to accurately analyze microbial patterns and identify important differences between groups, especially for methods that work on a log scale. To address this, we developed a new method called BMDD that uses a more realistic model to impute the zeros. Unlike existing tools that assume each microbe follows an unimodal abundance distribution, BMDD allows for microbes to follow a bimodal distribution, so they could behave differently in different conditions. It provides not just a single guess, but a range of possible values to better reflect the uncertainty. Our testing shows that BMDD more accurately recovers the true microbial profiles and improves the ability to detect meaningful differences between groups. This method can help researchers gain clearer insights into how the microbiome affects health and disease.

Suggested Citation

  • Huijuan Zhou & Jun Chen & Xianyang Zhang, 2025. "BMDD: A probabilistic framework for accurate imputation of zero-inflated microbiome sequencing data," PLOS Computational Biology, Public Library of Science, vol. 21(10), pages 1-21, October.
  • Handle: RePEc:plo:pcbi00:1013124
    DOI: 10.1371/journal.pcbi.1013124
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013124
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1013124&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1013124?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nan Qin & Fengling Yang & Ang Li & Edi Prifti & Yanfei Chen & Li Shao & Jing Guo & Emmanuelle Le Chatelier & Jian Yao & Lingjiao Wu & Jiawei Zhou & Shujun Ni & Lin Liu & Nicolas Pons & Jean Michel Bat, 2014. "Alterations of the human gut microbiome in liver cirrhosis," Nature, Nature, vol. 513(7516), pages 59-64, September.
    2. Luke Jostins & Stephan Ripke & Rinse K. Weersma & Richard H. Duerr & Dermot P. McGovern & Ken Y. Hui & James C. Lee & L. Philip Schumm & Yashoda Sharma & Carl A. Anderson & Jonah Essers & Mitja Mitrov, 2012. "Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease," Nature, Nature, vol. 491(7422), pages 119-124, November.
    3. Junjie Qin & Yingrui Li & Zhiming Cai & Shenghui Li & Jianfeng Zhu & Fan Zhang & Suisha Liang & Wenwei Zhang & Yuanlin Guan & Dongqian Shen & Yangqing Peng & Dongya Zhang & Zhuye Jie & Wenxian Wu & Yo, 2012. "A metagenome-wide association study of gut microbiota in type 2 diabetes," Nature, Nature, vol. 490(7418), pages 55-60, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kerstin Thriene & Karin B. Michels, 2023. "Human Gut Microbiota Plasticity throughout the Life Course," IJERPH, MDPI, vol. 20(2), pages 1-14, January.
    2. Efrat Muller & Itamar Shiryan & Elhanan Borenstein, 2024. "Multi-omic integration of microbiome data for identifying disease-associated modules," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    3. Wanting Dong & Xinyue Fan & Yaqiong Guo & Siyi Wang & Shulei Jia & Na Lv & Tao Yuan & Yuanlong Pan & Yong Xue & Xi Chen & Qian Xiong & Ruifu Yang & Weigang Zhao & Baoli Zhu, 2024. "An expanded database and analytical toolkit for identifying bacterial virulence factors and their associations with chronic diseases," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    4. Ashwag Shami & Rewaa S. Jalal & Ruba A. Ashy & Haneen W. Abuauf & Lina Baz & Mohammed Y. Refai & Aminah A. Barqawi & Hanadi M. Baeissa & Manal A. Tashkandi & Sahar Alshareef & Aala A. Abulfaraj, 2022. "Use of Metagenomic Whole Genome Shotgun Sequencing Data in Taxonomic Assignment of Dipterygium glaucum Rhizosphere and Surrounding Bulk Soil Microbiomes, and Their Response to Watering," Sustainability, MDPI, vol. 14(14), pages 1-21, July.
    5. Hung-Yu Chiang & Hsueh-Han Lu & Janaki N. Sudhakar & Yu-Wen Chen & Nien-Shin Shih & Yi-Ting Weng & Jr-Wen Shui, 2022. "IL-22 initiates an IL-18-dependent epithelial response circuit to enforce intestinal host defence," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Jiaxin Liu & Xue Yan & Qiang Ding & Jiwu Xiang & Zuna Wei & Qian Yang & Kangwei Xie & Bo Cheng & Xiaoying Xie, 2025. "Porous Carbon Derived from Pumpkin Tissue as an Efficient Bioanode Toward Wastewater Treatment in Microbial Fuel Cells," Sustainability, MDPI, vol. 17(11), pages 1-17, May.
    7. Lijuan Kong & Qijin Zhao & Xiaojing Jiang & Jinping Hu & Qian Jiang & Li Sheng & Xiaohong Peng & Shusen Wang & Yibing Chen & Yanjun Wan & Shaocong Hou & Xingfeng Liu & Chunxiao Ma & Yan Li & Li Quan &, 2024. "Trimethylamine N-oxide impairs β-cell function and glucose tolerance," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    8. Can Cui & Susheela P. Singh & Ana‐Maria Staicu & Brian J. Reich, 2021. "Bayesian variable selection for high‐dimensional rank data," Environmetrics, John Wiley & Sons, Ltd., vol. 32(7), November.
    9. Maja Czerwińska-Rogowska & Karolina Skonieczna-Żydecka & Krzysztof Kaseja & Karolina Jakubczyk & Joanna Palma & Marta Bott-Olejnik & Sławomir Brzozowski & Ewa Stachowska, 2022. "Kitchen Diet vs. Industrial Diets—Impact on Intestinal Barrier Parameters among Stroke Patients," IJERPH, MDPI, vol. 19(10), pages 1-11, May.
    10. Li, Yanhui & Zhao, Luqing & Wang, Jinjuan, 2025. "A debiasing phylogenetic tree-assisted regression model for microbiome data," Computational Statistics & Data Analysis, Elsevier, vol. 205(C).
    11. Johanne K. Hansen & Mads Israelsen & Suguru Nishijima & Sara E. Stinson & Peter Andersen & Stine Johansen & Camilla D. Hansen & Maximilian Joseph Brol & Sabine Klein & Robert Schierwagen & Frank Erhar, 2025. "The postbiotic ReFerm® versus standard nutritional support in advanced alcohol-related liver disease (GALA-POSTBIO): a randomized controlled phase 2 trial," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    12. Daphna Rothschild & Sigal Leviatan & Ariel Hanemann & Yossi Cohen & Omer Weissbrod & Eran Segal, 2022. "An atlas of robust microbiome associations with phenotypic traits based on large-scale cohorts from two continents," PLOS ONE, Public Library of Science, vol. 17(3), pages 1-20, March.
    13. Runtan Cheng & Lu Wang & Shenglong Le & Yifan Yang & Can Zhao & Xiangqi Zhang & Xin Yang & Ting Xu & Leiting Xu & Petri Wiklund & Jun Ge & Dajiang Lu & Chenhong Zhang & Luonan Chen & Sulin Cheng, 2022. "A randomized controlled trial for response of microbiome network to exercise and diet intervention in patients with nonalcoholic fatty liver disease," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    14. Doris R. Pierce & Malcolm McDonald & Lea Merone & Luke Becker & Fintan Thompson & Chris Lewis & Rachael Y. M. Ryan & Sze Fui Hii & Patsy A. Zendejas-Heredia & Rebecca J. Traub & Matthew A. Field & Ton, 2023. "Effect of experimental hookworm infection on insulin resistance in people at risk of type 2 diabetes," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    15. Jim Parker & Claire O’Brien & Jason Hawrelak & Felice L. Gersh, 2022. "Polycystic Ovary Syndrome: An Evolutionary Adaptation to Lifestyle and the Environment," IJERPH, MDPI, vol. 19(3), pages 1-25, January.
    16. Yu-Feng Wei & Ming-Shyan Huang & Cheng-Hsieh Huang & Yao-Tsung Yeh & Chih-Hsin Hung, 2022. "Impact of Gut Dysbiosis on the Risk of Non-Small-Cell Lung Cancer," IJERPH, MDPI, vol. 19(23), pages 1-17, November.
    17. Kumaraswamy Jeyaram & Leo Lahti & Sebastian Tims & Hans G. H. J. Heilig & Antonie H. Gelder & Willem M. Vos & Hauke Smidt & Erwin G. Zoetendal, 2025. "Fermented foods affect the seasonal stability of gut bacteria in an Indian rural population," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    18. Seung Jin Han & Kyoung Hwa Ha & Ja Young Jeon & Hae Jin Kim & Kwan Woo Lee & Dae Jung Kim, 2015. "Impact of Cadmium Exposure on the Association between Lipopolysaccharide and Metabolic Syndrome," IJERPH, MDPI, vol. 12(9), pages 1-14, September.
    19. Kristin M. Ham & Layne K. Bower & Shanping Li & Hernan Lorenzi & Safiatou Doumbo & Didier Doumtabe & Kassoum Kayentao & Aissata Ongoiba & Boubacar Traore & Peter D. Crompton & Nathan W. Schmidt, 2024. "The gut microbiome is associated with susceptibility to febrile malaria in Malian children," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    20. Magdalena Jastrzębska & Urszula Wachowska & Marta K. Kostrzewska, 2020. "Pathogenic and Non-Pathogenic Fungal Communities in Wheat Grain as Influenced by Recycled Phosphorus Fertilizers: A Case Study," Agriculture, MDPI, vol. 10(6), pages 1-15, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013124. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.