IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003297.html
   My bibliography  Save this article

Comparing Algorithms That Reconstruct Cell Lineage Trees Utilizing Information on Microsatellite Mutations

Author

Listed:
  • Noa Chapal-Ilani
  • Yosef E Maruvka
  • Adam Spiro
  • Yitzhak Reizel
  • Rivka Adar
  • Liran I Shlush
  • Ehud Shapiro

Abstract

Organism cells proliferate and die to build, maintain, renew and repair it. The cellular history of an organism up to any point in time can be captured by a cell lineage tree in which vertices represent all organism cells, past and present, and directed edges represent progeny relations among them. The root represents the fertilized egg, and the leaves represent extant and dead cells. Somatic mutations accumulated during cell division endow each organism cell with a genomic signature that is unique with a very high probability. Distances between such genomic signatures can be used to reconstruct an organism's cell lineage tree. Cell populations possess unique features that are absent or rare in organism populations (e.g., the presence of stem cells and a small number of generations since the zygote) and do not undergo sexual reproduction, hence the reconstruction of cell lineage trees calls for careful examination and adaptation of the standard tools of population genetics. Our lab developed a method for reconstructing cell lineage trees by examining only mutations in highly variable microsatellite loci (MS, also called short tandem repeats, STR). In this study we use experimental data on somatic mutations in MS of individual cells in human and mice in order to validate and quantify the utility of known lineage tree reconstruction algorithms in this context. We employed extensive measurements of somatic mutations in individual cells which were isolated from healthy and diseased tissues of mice and humans. The validation was done by analyzing the ability to infer known and clear biological scenarios. In general, we found that if the biological scenario is simple, almost all algorithms tested can infer it. Another somewhat surprising conclusion is that the best algorithm among those tested is Neighbor Joining where the distance measure used is normalized absolute distance. We include our full dataset in Tables S1, S2, S3, S4, S5 to enable further analysis of this data by others.Author Summary: The history of an organism's cells, from a single cell until any particular moment in time, can be captured by a cell lineage tree. Many fundamental open questions in biology and medicine, such as which cells give rise to metastases, whether oocytes and beta cells renew, and what is the role of stem cells in brain development and maintenance, are in fact questions about the structure and dynamics of that tree. Random mutations that occur during cell division endow each organism cell with an almost unique genomic signature. Distances between signatures capture distances in the cell lineage tree, and can be used to reconstruct that tree. On this basis, our lab developed a method for cell lineage reconstruction utilizing a panel of about 120 microsatellites. In this work, we use a large dataset of microsatellite mutations from many cells that we collected in our lab in the last few years, in order to test the performance of different distance measures and tree reconstruction algorithms. We found that the best method is not the one that gives the most accurate estimates of the mean distance, but rather the one with the lowest variance.

Suggested Citation

  • Noa Chapal-Ilani & Yosef E Maruvka & Adam Spiro & Yitzhak Reizel & Rivka Adar & Liran I Shlush & Ehud Shapiro, 2013. "Comparing Algorithms That Reconstruct Cell Lineage Trees Utilizing Information on Microsatellite Mutations," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-17, November.
  • Handle: RePEc:plo:pcbi00:1003297
    DOI: 10.1371/journal.pcbi.1003297
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003297
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003297&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003297?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Feng Gao & Elizabeth Bailes & David L. Robertson & Yalu Chen & Cynthia M. Rodenburg & Scott F. Michael & Larry B. Cummins & Larry O. Arthur & Martine Peeters & George M. Shaw & Paul M. Sharp & Beatric, 1999. "Origin of HIV-1 in the chimpanzee Pan troglodytes troglodytes," Nature, Nature, vol. 397(6718), pages 436-441, February.
    2. Ian J. Wilson & Michael E. Weale & David J. Balding, 2003. "Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 166(2), pages 155-188, June.
    3. Nicholas Navin & Jude Kendall & Jennifer Troge & Peter Andrews & Linda Rodgers & Jeanne McIndoo & Kerry Cook & Asya Stepansky & Dan Levy & Diane Esposito & Lakshmi Muthuswamy & Alex Krasnitz & W. Rich, 2011. "Tumour evolution inferred by single-cell sequencing," Nature, Nature, vol. 472(7341), pages 90-94, April.
    4. Dan Frumkin & Adam Wasserstrom & Shai Kaplan & Uriel Feige & Ehud Shapiro, 2005. "Genomic Variability within an Organism Exposes Its Cell Lineage Tree," PLOS Computational Biology, Public Library of Science, vol. 1(5), pages 1-13, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tsukuda Koji & Mano Shuhei & Yamamoto Toshimichi, 2020. "Bayesian approach to discriminant problems for count data with application to multilocus short tandem repeat dataset," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 19(2), pages 1-18, April.
    2. Yeboah, Ian E.A., 2007. "HIV/AIDS and the construction of Sub-Saharan Africa: Heuristic lessons from the social sciences for policy," Social Science & Medicine, Elsevier, vol. 64(5), pages 1128-1150, March.
    3. Damien G Hicks & Terence P Speed & Mohammed Yassin & Sarah M Russell, 2019. "Maps of variability in cell lineage trees," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-32, February.
    4. Jinhyun Kim & Sungsik Kim & Huiran Yeom & Seo Woo Song & Kyoungseob Shin & Sangwook Bae & Han Suk Ryu & Ji Young Kim & Ahyoun Choi & Sumin Lee & Taehoon Ryu & Yeongjae Choi & Hamin Kim & Okju Kim & Yu, 2023. "Barcoded multiple displacement amplification for high coverage sequencing in spatial genomics," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    5. Noushin Niknafs & Violeta Beleva-Guthrie & Daniel Q Naiman & Rachel Karchin, 2015. "SubClonal Hierarchy Inference from Somatic Mutations: Automatic Reconstruction of Cancer Evolutionary Trees from Multi-region Next Generation Sequencing," PLOS Computational Biology, Public Library of Science, vol. 11(10), pages 1-26, October.
    6. Yidong Zhou & Changjun Wang & Hanjiang Zhu & Yan Lin & Bo Pan & Xiaohui Zhang & Xin Huang & Qianqian Xu & Yali Xu & Qiang Sun, 2016. "Diagnostic Accuracy of PIK3CA Mutation Detection by Circulating Free DNA in Breast Cancer: A Meta-Analysis of Diagnostic Test Accuracy," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-15, June.
    7. Sunde, Uwe & Berlanda, Andrea & Esposito, Elena & Cervellati, Matteo & Rohner, Dominic, 2022. "Medication Against Conflict," CEPR Discussion Papers 17125, C.E.P.R. Discussion Papers.
    8. Claudia Bühnemann & Simon Li & Haiyue Yu & Harriet Branford White & Karl L Schäfer & Antonio Llombart-Bosch & Isidro Machado & Piero Picci & Pancras C W Hogendoorn & Nicholas A Athanasou & J Alison No, 2014. "Quantification of the Heterogeneity of Prognostic Cellular Biomarkers in Ewing Sarcoma Using Automated Image and Random Survival Forest Analysis," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-14, September.
    9. Chunyang Bao & Richard W. Tourdot & Gregory J. Brunette & Chip Stewart & Lili Sun & Hideo Baba & Masayuki Watanabe & Agoston T. Agoston & Kunal Jajoo & Jon M. Davison & Katie S. Nason & Gad Getz & Ken, 2023. "Genomic signatures of past and present chromosomal instability in Barrett’s esophagus and early esophageal adenocarcinoma," Nature Communications, Nature, vol. 14(1), pages 1-22, December.
    10. Sara Ballouz & Risa Karakida Kawaguchi & Maria T. Pena & Stephan Fischer & Megan Crow & Leon French & Frank M. Knight & Linda B. Adams & Jesse Gillis, 2023. "The transcriptional legacy of developmental stochasticity," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    11. Xian F Mallory & Mohammadamin Edrisi & Nicholas Navin & Luay Nakhleh, 2020. "Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-24, July.
    12. Scott Engstrom, Juris Doctor, 2013. "Effectuating Change In The Regulation Of Hiv Vaccines," Journal of Pharmacy & Biological Sciences (COES&RJ-JPBS), , vol. 1(1), pages 1-13, July.
    13. Heled, Joseph, 2012. "Sequence diversity under the multispecies coalescent with Yule process and constant population size," Theoretical Population Biology, Elsevier, vol. 81(2), pages 97-101.
    14. Camila P. E. de Souza & Mirela Andronescu & Tehmina Masud & Farhia Kabeer & Justina Biele & Emma Laks & Daniel Lai & Patricia Ye & Jazmine Brimhall & Beixi Wang & Edmund Su & Tony Hui & Qi Cao & Marcu, 2020. "Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data," PLOS Computational Biology, Public Library of Science, vol. 16(9), pages 1-24, September.
    15. Joanna Hård & Jeff E. Mold & Jesper Eisfeldt & Christian Tellgren-Roth & Susana Häggqvist & Ignas Bunikis & Orlando Contreras-Lopez & Chen-Shan Chin & Jessica Nordlund & Carl-Johan Rubin & Lars Feuk &, 2023. "Long-read whole-genome analysis of human single cells," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    16. Oster, Emily, 2012. "HIV and sexual behavior change: Why not Africa?," Journal of Health Economics, Elsevier, vol. 31(1), pages 35-49.
    17. Salim Akhter Chowdhury & Stanley E Shackney & Kerstin Heselmeyer-Haddad & Thomas Ried & Alejandro A Schäffer & Russell Schwartz, 2014. "Algorithms to Model Single Gene, Single Chromosome, and Whole Genome Copy Number Changes Jointly in Tumor Phylogenetics," PLOS Computational Biology, Public Library of Science, vol. 10(7), pages 1-19, July.
    18. Ajay Shrestha & Ausif Mahmood, 2016. "Improving Genetic Algorithm with Fine-Tuned Crossover and Scaled Architecture," Journal of Mathematics, Hindawi, vol. 2016, pages 1-10, April.
    19. Brandon Monier & Adam McDermaid & Cankun Wang & Jing Zhao & Allison Miller & Anne Fennell & Qin Ma, 2019. "IRIS-EDA: An integrated RNA-Seq interpretation system for gene expression data analysis," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-15, February.
    20. Jenkins Paul A., 2012. "Stopping-Time Resampling and Population Genetic Inference under Coalescent Models," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-20, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003297. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.