IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1012094.html
   My bibliography  Save this article

Scuphr: A probabilistic framework for cell lineage tree reconstruction

Author

Listed:
  • Hazal Koptagel
  • Seong-Hwan Jun
  • Joanna Hård
  • Jens Lagergren

Abstract

Cell lineage tree reconstruction methods are developed for various tasks, such as investigating the development, differentiation, and cancer progression. Single-cell sequencing technologies enable more thorough analysis with higher resolution. We present Scuphr, a distance-based cell lineage tree reconstruction method using bulk and single-cell DNA sequencing data from healthy tissues. Common challenges of single-cell DNA sequencing, such as allelic dropouts and amplification errors, are included in Scuphr. Scuphr computes the distance between cell pairs and reconstructs the lineage tree using the neighbor-joining algorithm. With its embarrassingly parallel design, Scuphr can do faster analysis than the state-of-the-art methods while obtaining better accuracy. The method’s robustness is investigated using various synthetic datasets and a biological dataset of 18 cells.Author summary: Cell lineage tree reconstruction carries a significant potential for studies of development and medicine. The lineage tree reconstruction task is especially challenging for cells taken from healthy tissue due to the scarcity of mutations. In addition, the single-cell whole-genome sequencing technology introduces artifacts such as amplification errors, allelic dropouts, and sequencing errors. We propose Scuphr, a probabilistic framework to reconstruct cell lineage trees. We designed Scuphr for single-cell DNA sequencing data; it accounts for technological artifacts in its graphical model and uses germline heterozygous sites to improve its accuracy. Scuphr is embarrassingly parallel; the speed of the computational analysis is inversely proportional to the number of available computational nodes. We demonstrated that Scuphr is fast, robust, and more accurate than the state-of-the-art method with the synthetic data experiments. Moreover, in the biological data experiment, we showed Scuphr successfully identifies different clones and further obtains more support on closely related cells within clones.

Suggested Citation

  • Hazal Koptagel & Seong-Hwan Jun & Joanna Hård & Jens Lagergren, 2024. "Scuphr: A probabilistic framework for cell lineage tree reconstruction," PLOS Computational Biology, Public Library of Science, vol. 20(5), pages 1-25, May.
  • Handle: RePEc:plo:pcbi00:1012094
    DOI: 10.1371/journal.pcbi.1012094
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1012094
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012094&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1012094?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nicholas Navin & Jude Kendall & Jennifer Troge & Peter Andrews & Linda Rodgers & Jeanne McIndoo & Kerry Cook & Asya Stepansky & Dan Levy & Diane Esposito & Lakshmi Muthuswamy & Alex Krasnitz & W. Rich, 2011. "Tumour evolution inferred by single-cell sequencing," Nature, Nature, vol. 472(7341), pages 90-94, April.
    2. Tim H. H. Coorens & Luiza Moore & Philip S. Robinson & Rashesh Sanghvi & Joseph Christopher & James Hewinson & Moritz J. Przybilla & Andrew R. J. Lawson & Michael Spencer Chapman & Alex Cagan & Thomas, 2021. "Extensive phylogenies of human development inferred from somatic mutations," Nature, Nature, vol. 597(7876), pages 387-392, September.
    3. Henry Lee-Six & Nina Friesgaard Øbro & Mairi S. Shepherd & Sebastian Grossmann & Kevin Dawson & Miriam Belmonte & Robert J. Osborne & Brian J. P. Huntly & Inigo Martincorena & Elizabeth Anderson & Lau, 2018. "Population dynamics of normal human blood inferred from somatic mutations," Nature, Nature, vol. 561(7724), pages 473-478, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Márton Demeter & Imre Derényi & Gergely J. Szöllősi, 2022. "Trade-off between reducing mutational accumulation and increasing commitment to differentiation determines tissue organization," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    2. Adam C. Weiner & Marc J. Williams & Hongyu Shi & Ignacio Vázquez-García & Sohrab Salehi & Nicole Rusk & Samuel Aparicio & Sohrab P. Shah & Andrew McPherson, 2024. "Inferring replication timing and proliferation dynamics from single-cell DNA sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    3. Jinhyun Kim & Sungsik Kim & Huiran Yeom & Seo Woo Song & Kyoungseob Shin & Sangwook Bae & Han Suk Ryu & Ji Young Kim & Ahyoun Choi & Sumin Lee & Taehoon Ryu & Yeongjae Choi & Hamin Kim & Okju Kim & Yu, 2023. "Barcoded multiple displacement amplification for high coverage sequencing in spatial genomics," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    4. Noushin Niknafs & Violeta Beleva-Guthrie & Daniel Q Naiman & Rachel Karchin, 2015. "SubClonal Hierarchy Inference from Somatic Mutations: Automatic Reconstruction of Cancer Evolutionary Trees from Multi-region Next Generation Sequencing," PLOS Computational Biology, Public Library of Science, vol. 11(10), pages 1-26, October.
    5. Yidong Zhou & Changjun Wang & Hanjiang Zhu & Yan Lin & Bo Pan & Xiaohui Zhang & Xin Huang & Qianqian Xu & Yali Xu & Qiang Sun, 2016. "Diagnostic Accuracy of PIK3CA Mutation Detection by Circulating Free DNA in Breast Cancer: A Meta-Analysis of Diagnostic Test Accuracy," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-15, June.
    6. repec:plo:pcbi00:1003535 is not listed on IDEAS
    7. Claudia Bühnemann & Simon Li & Haiyue Yu & Harriet Branford White & Karl L Schäfer & Antonio Llombart-Bosch & Isidro Machado & Piero Picci & Pancras C W Hogendoorn & Nicholas A Athanasou & J Alison No, 2014. "Quantification of the Heterogeneity of Prognostic Cellular Biomarkers in Ewing Sarcoma Using Automated Image and Random Survival Forest Analysis," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-14, September.
    8. Joseph M. Josephides & Chun-Long Chen, 2025. "Unravelling single-cell DNA replication timing dynamics using machine learning reveals heterogeneity in cancer progression," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    9. Chunyang Bao & Richard W. Tourdot & Gregory J. Brunette & Chip Stewart & Lili Sun & Hideo Baba & Masayuki Watanabe & Agoston T. Agoston & Kunal Jajoo & Jon M. Davison & Katie S. Nason & Gad Getz & Ken, 2023. "Genomic signatures of past and present chromosomal instability in Barrett’s esophagus and early esophageal adenocarcinoma," Nature Communications, Nature, vol. 14(1), pages 1-22, December.
    10. Kitty Sherwood & Joseph C. Ward & Ignacio Soriano & Lynn Martin & Archie Campbell & Raheleh Rahbari & Ioannis Kafetzopoulos & Duncan Sproul & Andrew Green & Julian R. Sampson & Alan Donaldson & Kai-Re, 2023. "Germline de novo mutations in families with Mendelian cancer syndromes caused by defects in DNA repair," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    11. Xian F Mallory & Mohammadamin Edrisi & Nicholas Navin & Luay Nakhleh, 2020. "Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-24, July.
    12. Camila P. E. de Souza & Mirela Andronescu & Tehmina Masud & Farhia Kabeer & Justina Biele & Emma Laks & Daniel Lai & Patricia Ye & Jazmine Brimhall & Beixi Wang & Edmund Su & Tony Hui & Qi Cao & Marcu, 2020. "Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data," PLOS Computational Biology, Public Library of Science, vol. 16(9), pages 1-24, September.
    13. Udit Singhal & Srinivas Nallandhighal & Jeffrey J. Tosoian & Kevin Hu & Trinh M. Pham & Judith Stangl-Kremser & Chia-Jen Liu & Razeen Karim & Komal R. Plouffe & Todd M. Morgan & Marcin Cieslik & Rober, 2024. "Integrative multi-region molecular profiling of primary prostate cancer in men with synchronous lymph node metastasis," Nature Communications, Nature, vol. 15(1), pages 1-9, December.
    14. Joanna Hård & Jeff E. Mold & Jesper Eisfeldt & Christian Tellgren-Roth & Susana Häggqvist & Ignas Bunikis & Orlando Contreras-Lopez & Chen-Shan Chin & Jessica Nordlund & Carl-Johan Rubin & Lars Feuk &, 2023. "Long-read whole-genome analysis of human single cells," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    15. repec:plo:pone00:0054236 is not listed on IDEAS
    16. Heather E. Machado & Nina F. Øbro & Nicholas Williams & Shengjiang Tan & Ahmed Z. Boukerrou & Megan Davies & Miriam Belmonte & Emily Mitchell & E. Joanna Baxter & Nicole Mende & Anna Clay & Philip Anc, 2023. "Convergent somatic evolution commences in utero in a germline ribosomopathy," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    17. Noa Chapal-Ilani & Yosef E Maruvka & Adam Spiro & Yitzhak Reizel & Rivka Adar & Liran I Shlush & Ehud Shapiro, 2013. "Comparing Algorithms That Reconstruct Cell Lineage Trees Utilizing Information on Microsatellite Mutations," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-17, November.
    18. Salim Akhter Chowdhury & Stanley E Shackney & Kerstin Heselmeyer-Haddad & Thomas Ried & Alejandro A Schäffer & Russell Schwartz, 2014. "Algorithms to Model Single Gene, Single Chromosome, and Whole Genome Copy Number Changes Jointly in Tumor Phylogenetics," PLOS Computational Biology, Public Library of Science, vol. 10(7), pages 1-19, July.
    19. Brandon Monier & Adam McDermaid & Cankun Wang & Jing Zhao & Allison Miller & Anne Fennell & Qin Ma, 2019. "IRIS-EDA: An integrated RNA-Seq interpretation system for gene expression data analysis," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-15, February.
    20. Szu-Hsien Sam Wu & Somi Kim & Heetak Lee & Ji-Hyun Lee & So-Yeon Park & Réka Bakonyi & Isaree Teriyapirom & Natalia Hallay & Sandra Pilat-Carotta & Hans-Christian Theussl & Jihoon Kim & Joo-Hyeon Lee , 2024. "Red2Flpe-SCON: a versatile, multicolor strategy for generating mosaic conditional knockout mice," Nature Communications, Nature, vol. 15(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1012094. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.