IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1009829.html
   My bibliography  Save this article

Inference of trajectory presence by tree dimension and subset specificity by subtree cover

Author

Listed:
  • Lovemore Tenha
  • Mingzhou Song

Abstract

The complexity of biological processes such as cell differentiation is reflected in dynamic transitions between cellular states. Trajectory inference arranges the states into a progression using methodologies propelled by single-cell biology. However, current methods, all returning a best trajectory, do not adequately assess statistical significance of noisy patterns, leading to uncertainty in inferred trajectories. We introduce a tree dimension test for trajectory presence in multivariate data by a dimension measure of Euclidean minimum spanning tree, a test statistic, and a null distribution. Computable in linear time to tree size, the tree dimension measure summarizes the extent of branching more effectively than globally insensitive number of leaves or tree diameter indifferent to secondary branches. The test statistic quantifies trajectory presence and its null distribution is estimated under the null hypothesis of no trajectory in data. On simulated and real single-cell datasets, the test outperformed the intuitive number of leaves and tree diameter statistics. Next, we developed a measure for the tissue specificity of the dynamics of a subset, based on the minimum subtree cover of the subset in a minimum spanning tree. We found that tissue specificity of pathway gene expression dynamics is conserved in human and mouse development: several signal transduction pathways including calcium and Wnt signaling are most tissue specific, while genetic information processing pathways such as ribosome and mismatch repair are least so. Neither the tree dimension test nor the subset specificity measure has any user parameter to tune. Our work opens a window to prioritize cellular dynamics and pathways in development and other multivariate dynamical systems.Author summary: Modern biology now routinely studies transcriptome profiles during development. This practice demands computational methods to quantify dynamical changes in cellular states and their heterogeneity. Many methods process single-cell transcriptome data to reconstruct cellular trajectories, which are orderings of cells as they progress from an early to a late developmental stage. Due to noise in transcriptome data, there is a great need to quantify how likely observed data present a trajectory-like pattern due to chance. To address this need, we developed a tree dimension test to quantify evidence for trajectory presence in multivariate data based on graph-theoretical concepts. By this test, one may reject trajectory presence due to low data quality, or accept a trajectory with high statistical significance. Now one can rank biological pathways by their trajectory quality. We also introduce a subset specificity measure to quantify how cellular or pathway dynamics are tissue specific. We found that pathway tissue specificity is highly conserved between human and mouse. Trajectory presence testing and subset specificity offer a unique informatics tool set to study developmental biology.

Suggested Citation

  • Lovemore Tenha & Mingzhou Song, 2022. "Inference of trajectory presence by tree dimension and subset specificity by subtree cover," PLOS Computational Biology, Public Library of Science, vol. 18(2), pages 1-20, February.
  • Handle: RePEc:plo:pcbi00:1009829
    DOI: 10.1371/journal.pcbi.1009829
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1009829
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1009829&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1009829?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1009829. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.