Author
Listed:
- Manuel Razo-Mejia
- Madhav Mani
- Dmitri Petrov
Abstract
The tracking of lineage frequencies via DNA barcode sequencing enables the quantification of microbial fitness. However, experimental noise coming from biotic and abiotic sources complicates the computation of a reliable inference. We present a Bayesian pipeline to infer relative microbial fitness from high-throughput lineage tracking assays. Our model accounts for multiple sources of noise and propagates uncertainties throughout all parameters in a systematic way. Furthermore, using modern variational inference methods based on automatic differentiation, we are able to scale the inference to a large number of unique barcodes. We extend this core model to analyze multi-environment assays, replicate experiments, and barcodes linked to genotypes. On simulations, our method recovers known parameters within posterior credible intervals. This work provides a generalizable Bayesian framework to analyze lineage tracking experiments. The accompanying open-source software library enables the adoption of principled statistical methods in experimental evolution.Author summary: In this study, we present a novel Bayesian pipeline for analyzing DNA barcode tracking sequencing data, addressing the challenge of accurately quantifying competitive microbial fitness in the presence of experimental noise. Our method uniquely contributes to understanding microbial evolutionary dynamics by enabling reliable inference of the relative fitness of diverse microbial strains from high-throughput lineage tracking assays. Our approach is distinct in its ability to systematically account for and propagate uncertainties from various noise sources throughout all inferred parameters. Furthermore, the error-propagation quality of our Bayesian method allows us to extend the inference pipeline to common dataset structures, such as jointly analyzing multiple experimental replicates or accounting for multiple unique barcodes mapping the equivalent genotypes. This comprehensive treatment of uncertainties is crucial in experimental settings where noise can significantly influence the results. Furthermore, we have optimized our pipeline for scalability, allowing it to handle large numbers of unique barcodes effectively. This scalability is essential for analyzing complex datasets typical in microbial fitness studies.
Suggested Citation
Manuel Razo-Mejia & Madhav Mani & Dmitri Petrov, 2024.
"Bayesian inference of relative fitness on high-throughput pooled competition assays,"
PLOS Computational Biology, Public Library of Science, vol. 20(3), pages 1-18, March.
Handle:
RePEc:plo:pcbi00:1011937
DOI: 10.1371/journal.pcbi.1011937
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011937. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.