IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006096.html

Using pseudoalignment and base quality to accurately quantify microbial community composition

Author

Listed:
  • Mark Reppell
  • John Novembre

Abstract

Pooled DNA from multiple unknown organisms arises in a variety of contexts, for example microbial samples from ecological or human health research. Determining the composition of pooled samples can be difficult, especially at the scale of modern sequencing data and reference databases. Here we propose a novel method for taxonomic profiling in pooled DNA that combines the speed and low-memory requirements of k-mer based pseudoalignment with a likelihood framework that uses base quality information to better resolve multiply mapped reads. We apply the method to the problem of classifying 16S rRNA reads using a reference database of known organisms, a common challenge in microbiome research. Using simulations, we show the method is accurate across a variety of read lengths, with different length reference sequences, at different sample depths, and when samples contain reads originating from organisms absent from the reference. We also assess performance in real 16S data, where we reanalyze previous genetic association data to show our method discovers a larger number of quantitative trait associations than other widely used methods. We implement our method in the software Karp, for k-mer based analysis of read pools, to provide a novel combination of speed and accuracy that is uniquely suited for enhancing discoveries in microbial studies.Author summary: Pooled DNA from multiple unknown organisms arises in a variety of contexts. Determining the composition of pooled samples can be difficult, especially at the scale of modern data. Here we propose the novel method Karp, designed to perform taxonomic profiling in pooled DNA. Karp combines the speed and low-memory requirements of k-mer based pseudoalignment with a likelihood framework that uses base quality information to better resolve multiply mapped reads. We apply Karp to the problem of classifying 16S rRNA reads using a reference database of known organisms. Using simulations, we show Karp is accurate across a variety of read lengths, reference sequence lengths, sample depths, and when samples contain reads originating from organisms absent from the reference. We also assess performance in real 16S data, where we reanalyze previous genetic association data to show that relative to other widely used quantification methods Karp reveals a larger number of microbiome quantitative trait association signals. Modern sequencing technology gives us unprecedented access to microbial communities, but uncovering significant findings requires correctly interpreting pooled microbial DNA. Karp provides a novel combination of speed and accuracy that makes it uniquely suited for enhancing discoveries in microbial studies.

Suggested Citation

  • Mark Reppell & John Novembre, 2018. "Using pseudoalignment and base quality to accurately quantify microbial community composition," PLOS Computational Biology, Public Library of Science, vol. 14(4), pages 1-23, April.
  • Handle: RePEc:plo:pcbi00:1006096
    DOI: 10.1371/journal.pcbi.1006096
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006096
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006096&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006096?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter J. Turnbaugh & Micah Hamady & Tanya Yatsunenko & Brandi L. Cantarel & Alexis Duncan & Ruth E. Ley & Mitchell L. Sogin & William J. Jones & Bruce A. Roe & Jason P. Affourtit & Michael Egholm & Be, 2009. "A core gut microbiome in obese and lean twins," Nature, Nature, vol. 457(7228), pages 480-484, January.
    2. Oren E Livne & Lide Han & Gorka Alkorta-Aranburu & William Wentworth-Sheilds & Mark Abney & Carole Ober & Dan L Nicolae, 2015. "PRIMAL: Fast and Accurate Pedigree-based Imputation from Sequence Data in a Founder Population," PLOS Computational Biology, Public Library of Science, vol. 11(3), pages 1-14, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rohan Maddamsetti & Irida Shyti & Maggie L. Wilson & Hye-In Son & Yasa Baig & Zhengqing Zhou & Jia Lu & Lingchong You, 2025. "Scaling laws of bacterial and archaeal plasmids," Nature Communications, Nature, vol. 16(1), pages 1-14, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xingqing Zhao & Jian Huang & Xuyan Zhu & Jinchun Chai & Xiaoli Ji, 2020. "Ecological Effects of Heavy Metal Pollution on Soil Microbial Community Structure and Diversity on Both Sides of a River around a Mining Area," IJERPH, MDPI, vol. 17(16), pages 1-18, August.
    2. repec:plo:pone00:0048998 is not listed on IDEAS
    3. Chihiro Morita & Hirokazu Tsuji & Tomokazu Hata & Motoharu Gondo & Shu Takakura & Keisuke Kawai & Kazufumi Yoshihara & Kiyohito Ogata & Koji Nomoto & Kouji Miyazaki & Nobuyuki Sudo, 2015. "Gut Dysbiosis in Patients with Anorexia Nervosa," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-13, December.
    4. Huang Lin & Merete Eggesbø & Shyamal Das Peddada, 2022. "Linear and nonlinear correlation estimators unveil undescribed taxa interactions in microbiome data," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    5. Patrick D Schloss, 2009. "A High-Throughput DNA Sequence Aligner for Microbial Ecology Studies," PLOS ONE, Public Library of Science, vol. 4(12), pages 1-9, December.
    6. John Molloy & Katrina Allen & Fiona Collier & Mimi L. K. Tang & Alister C. Ward & Peter Vuillermin, 2013. "The Potential Link between Gut Microbiota and IgE-Mediated Food Allergy in Early Life," IJERPH, MDPI, vol. 10(12), pages 1-22, December.
    7. repec:abf:journl:v:28:y:2020:i:2:p:21520-21524 is not listed on IDEAS
    8. Amy Ko & Rasmus Nielsen, 2017. "Composite likelihood method for inferring local pedigrees," PLOS Genetics, Public Library of Science, vol. 13(8), pages 1-21, August.
    9. repec:plo:pone00:0049785 is not listed on IDEAS
    10. Ahmed A Metwally & Philip S Yu & Derek Reiman & Yang Dai & Patricia W Finn & David L Perkins, 2019. "Utilizing longitudinal microbiome taxonomic profiles to predict food allergy via Long Short-Term Memory networks," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-16, February.
    11. Umra Rasool & Luigi Bertolotti & Peter Thomson, 2024. "Gut Microbiota Modulation in Veterinary Medicine - Faecal Microbiota Transplantation as a New Frontier," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 57(5), pages 49732-49741, July.
    12. Pirjo Wacklin & Harri Mäkivuokko & Noora Alakulppi & Janne Nikkilä & Heli Tenkanen & Jarkko Räbinä & Jukka Partanen & Kari Aranko & Jaana Mättö, 2011. "Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-10, May.
    13. Yunxi Liu & R. A. Leo Elworth & Michael D. Jochum & Kjersti M. Aagaard & Todd J. Treangen, 2022. "De novo identification of microbial contaminants in low microbial biomass microbiomes with Squeegee," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    14. C. E. Dubé & M. Ziegler & A. Mercière & E. Boissin & S. Planes & C. A. -F. Bourmaud & C. R. Voolstra, 2021. "Naturally occurring fire coral clones demonstrate a genetic and environmental basis of microbiome composition," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    15. Koji Hosomi & Mayu Saito & Jonguk Park & Haruka Murakami & Naoko Shibata & Masahiro Ando & Takahiro Nagatake & Kana Konishi & Harumi Ohno & Kumpei Tanisawa & Attayeb Mohsen & Yi-An Chen & Hitoshi Kawa, 2022. "Oral administration of Blautia wexlerae ameliorates obesity and type 2 diabetes via metabolic remodeling of the gut microbiota," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    16. Bo Yuan & Shulei Wang, 2025. "Microbiome data integration via shared dictionary learning," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    17. Dongyang Yang & Wei Xu, 2023. "Estimation of Mediation Effect on Zero-Inflated Microbiome Mediators," Mathematics, MDPI, vol. 11(13), pages 1-16, June.
    18. Mariana F. Fernández & Iris Reina-Pérez & Juan Manuel Astorga & Andrea Rodríguez-Carrillo & Julio Plaza-Díaz & Luis Fontana, 2018. "Breast Cancer and Its Relationship with the Microbiota," IJERPH, MDPI, vol. 15(8), pages 1-20, August.
    19. Yao Li & Wei Xu, 2025. "Causal Mediation Tree Model for Feature Identification on High-Dimensional Mediators," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 17(1), pages 151-173, April.
    20. Gertrude Ecklu-Mensah & Candice Choo-Kang & Maria Gjerstad Maseng & Sonya Donato & Pascal Bovet & Bharathi Viswanathan & Kweku Bedu-Addo & Jacob Plange-Rhule & Prince Oti Boateng & Terrence E. Forrest, 2023. "Gut microbiota and fecal short chain fatty acids differ with adiposity and country of origin: the METS-microbiome study," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    21. Feng Xian & Malena Brenek & Christoph Krisp & Elisabeth Urbauer & Ranjith Kumar Ravi Kumar & Doriane Aguanno & Tharan Srikumar & Qixin Liu & Allison M. Barry & Bin Ma & Jonathan Krieger & Dirk Haller , 2025. "Ultra-sensitive metaproteomics redefines the dark metaproteome, uncovering host-microbiome interactions and drug targets in intestinal diseases," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    22. Silvia Rodrigues Jardim & Lucila Marieta Perrotta de Souza & Heitor Siffert Pereira de Souza, 2023. "The Rise of Gastrointestinal Cancers as a Global Phenomenon: Unhealthy Behavior or Progress?," IJERPH, MDPI, vol. 20(4), pages 1-23, February.
    23. Li, Jie & Shen, Xuzhu & Li, YaoTang, 2021. "Modeling the temporal dynamics of gut microbiota from a local community perspective," Ecological Modelling, Elsevier, vol. 460(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006096. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.