IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011300.html
   My bibliography  Save this article

LRT: Integrative analysis of scRNA-seq and scTCR-seq data to investigate clonal differentiation heterogeneity

Author

Listed:
  • Juan Xie
  • Hyeongseon Jeon
  • Gang Xin
  • Qin Ma
  • Dongjun Chung

Abstract

Single-cell RNA sequencing (scRNA-seq) data has been widely used for cell trajectory inference, with the assumption that cells with similar expression profiles share the same differentiation state. However, the inferred trajectory may not reveal clonal differentiation heterogeneity among T cell clones. Single-cell T cell receptor sequencing (scTCR-seq) data provides invaluable insights into the clonal relationship among cells, yet it lacks functional characteristics. Therefore, scRNA-seq and scTCR-seq data complement each other in improving trajectory inference, where a reliable computational tool is still missing. We developed LRT, a computational framework for the integrative analysis of scTCR-seq and scRNA-seq data to explore clonal differentiation trajectory heterogeneity. Specifically, LRT uses the transcriptomics information from scRNA-seq data to construct overall cell trajectories and then utilizes both the TCR sequence information and phenotype information to identify clonotype clusters with distinct differentiation biasedness. LRT provides a comprehensive analysis workflow, including preprocessing, cell trajectory inference, clonotype clustering, trajectory biasedness evaluation, and clonotype cluster characterization. We illustrated its utility using scRNA-seq and scTCR-seq data of CD8+ T cells and CD4+ T cells with acute lymphocytic choriomeningitis virus infection. These analyses identified several clonotype clusters with distinct skewed distribution along the differentiation path, which cannot be revealed solely based on scRNA-seq data. Clones from different clonotype clusters exhibited diverse expansion capability, V-J gene usage pattern and CDR3 motifs. The LRT framework was implemented as an R package ‘LRT’, and it is now publicly accessible at https://github.com/JuanXie19/LRT. In addition, it provides two Shiny apps ‘shinyClone’ and ‘shinyClust’ that allow users to interactively explore distributions of clonotypes, conduct repertoire analysis, implement clustering of clonotypes, trajectory biasedness evaluation, and clonotype cluster characterization.Author summary: Understanding the dynamic changes behind biological processes is important for determining molecular mechanisms underlying normal tissue formulation, developmental disorders and pathologies. Usually, a biological process can be characterized by identifying a trajectory, a path that goes through the various cellular states associated with the process. Since cells in different states may express different sets of genes, researchers often infer cell trajectory via capturing transcriptomics changes. Dozens of methods have been developed for cell trajectory inference, and scRNA-seq data is predominantly utilized. However, cells from different clones may exist distinct differentiation patterns, while methods based only on scRNA-seq data cannot capture this heterogeneity. T cells play a key role in the immune system, and their high antigen recognition specificity is largely determined by their TCR sequences. Thanks to the advent of scTCR-seq technology, we can now identify the group of cells coming from the same clone. This paper describes our novel computational framework, namely LRT, and demonstrates that by complementing scRNA-seq data with the clonal information from scTCR-seq data using LRT, we are able to examine the clonal differentiation heterogeneity that cannot be revealed solely based on scRNA-seq data.

Suggested Citation

  • Juan Xie & Hyeongseon Jeon & Gang Xin & Qin Ma & Dongjun Chung, 2023. "LRT: Integrative analysis of scRNA-seq and scTCR-seq data to investigate clonal differentiation heterogeneity," PLOS Computational Biology, Public Library of Science, vol. 19(7), pages 1-17, July.
  • Handle: RePEc:plo:pcbi00:1011300
    DOI: 10.1371/journal.pcbi.1011300
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011300
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011300&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011300?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ian Holmes & Keith Harris & Christopher Quince, 2012. "Dirichlet Multinomial Mixtures: Generative Models for Microbial Metagenomics," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-15, February.
    2. Hamim Zafar & Chieh Lin & Ziv Bar-Joseph, 2020. "Single-cell lineage tracing by integrating CRISPR-Cas9 mutations with transcriptomic data," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Achal Dhariwal & Polona Rajar & Gabriela Salvadori & Heidi Aarø Åmdal & Dag Berild & Ola Didrik Saugstad & Drude Fugelseth & Gorm Greisen & Ulf Dahle & Kirsti Haaland & Fernanda Cristina Petersen, 2024. "Prolonged hospitalization signature and early antibiotic effects on the nasopharyngeal resistome in preterm infants," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    2. Laura Anderlucci & Cinzia Viroli, 2020. "Mixtures of Dirichlet-Multinomial distributions for supervised and unsupervised classification of short text data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(4), pages 759-770, December.
    3. Sanjeena Subedi & Drew Neish & Stephen Bak & Zeny Feng, 2020. "Cluster analysis of microbiome data by using mixtures of Dirichlet–multinomial regression models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1163-1187, November.
    4. Zhao, Xin & Zhang, Jingru & Lin, Wei, 2023. "Clustering multivariate count data via Dirichlet-multinomial network fusion," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    5. Yaru Song & Hongyu Zhao & Tao Wang, 2020. "An adaptive independence test for microbiome community data," Biometrics, The International Biometric Society, vol. 76(2), pages 414-426, June.
    6. Pratheepa Jeganathan & Susan P. Holmes, 2021. "A Statistical Perspective on the Challenges in Molecular Microbial Biology," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(2), pages 131-160, June.
    7. Shaikh Mateen R. & Beyene Joseph, 2017. "Statistical models and computational algorithms for discovering relationships in microbiome data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(1), pages 1-12, March.
    8. Georgios D. Kitsios & Khaled Sayed & Adam Fitch & Haopu Yang & Noel Britton & Faraaz Shah & William Bain & John W. Evankovich & Shulin Qin & Xiaohong Wang & Kelvin Li & Asha Patel & Yingze Zhang & Jos, 2024. "Longitudinal multicompartment characterization of host-microbiota interactions in patients with acute respiratory failure," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    9. Hannah M. Schlüter & Caroline Uhler, 2025. "Integrating representation learning, permutation, and optimization to detect lineage-related gene expression patterns," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    10. Julien Tap & Franck Lejzerowicz & Aurélie Cotillard & Matthieu Pichaud & Daniel McDonald & Se Jin Song & Rob Knight & Patrick Veiga & Muriel Derrien, 2023. "Global branches and local states of the human gut microbiome define associations with environmental and intrinsic factors," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    11. Wangshu Tu & Sanjeena Subedi, 2023. "Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data," Journal of Classification, Springer;The Classification Society, vol. 40(3), pages 638-667, November.
    12. Mishra, Aditya & Müller, Christian L., 2022. "Robust regression with compositional covariates," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    13. Gallopin Mélina & Celeux Gilles & Jaffrézic Florence & Rau Andrea, 2015. "A model selection criterion for model-based clustering of annotated gene expression data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(5), pages 413-428, November.
    14. Kihyun Lee & Sebastien Raguideau & Kimmo Sirén & Francesco Asnicar & Fabio Cumbo & Falk Hildebrand & Nicola Segata & Chang-Jun Cha & Christopher Quince, 2023. "Population-level impacts of antibiotic usage on the human gut microbiome," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    15. Patrick LeBlanc & Li Ma, 2023. "Microbiome subcommunity learning with logistic‐tree normal latent Dirichlet allocation," Biometrics, The International Biometric Society, vol. 79(3), pages 2321-2332, September.
    16. Tu, Wangshu & Browne, Ryan & Subedi, Sanjeena, 2024. "A mixture of logistic skew-normal multinomial models," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).
    17. Binhuan Wang & Lanqiu Yao & Jiyuan Hu & Huilin Li, 2023. "A New Algorithm for Convex Biclustering and Its Extension to the Compositional Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(1), pages 193-216, April.
    18. Emma M. Koff & Debbie Baarle & Marlies A. Houten & Marta Reyman & Guy A. M. Berbers & Femke Ham & Mei Ling J. N. Chu & Elisabeth A. M. Sanders & Debby Bogaert & Susana Fuentes, 2022. "Mode of delivery modulates the intestinal microbiota and impacts the response to vaccination," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    19. Zhiwei Qin & John Bowman & Jagtej Bewli, 2018. "A Bayesian framework for large-scale geo-demand estimation in on-line retailing," Annals of Operations Research, Springer, vol. 263(1), pages 231-245, April.
    20. Mahbaneh Eshaghzadeh Torbati & Makedonka Mitreva & Vanathi Gopalakrishnan, 2016. "Application of Taxonomic Modeling to Microbiota Data Mining for Detection of Helminth Infection in Global Populations," Data, MDPI, vol. 1(3), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011300. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.