IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1007510.html
   My bibliography  Save this article

CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data

Author

Listed:
  • Kai Kang
  • Qian Meng
  • Igor Shats
  • David M Umbach
  • Melissa Li
  • Yuanyuan Li
  • Xiaoling Li
  • Leping Li

Abstract

Quantifying cell-type proportions and their corresponding gene expression profiles in tissue samples would enhance understanding of the contributions of individual cell types to the physiological states of the tissue. Current approaches that address tissue heterogeneity have drawbacks. Experimental techniques, such as fluorescence-activated cell sorting, and single cell RNA sequencing are expensive. Computational approaches that use expression data from heterogeneous samples are promising, but most of the current methods estimate either cell-type proportions or cell-type-specific expression profiles by requiring the other as input. Although such partial deconvolution methods have been successfully applied to tumor samples, the additional input required may be unavailable. We introduce a novel complete deconvolution method, CDSeq, that uses only RNA-Seq data from bulk tissue samples to simultaneously estimate both cell-type proportions and cell-type-specific expression profiles. Using several synthetic and real experimental datasets with known cell-type composition and cell-type-specific expression profiles, we compared CDSeq’s complete deconvolution performance with seven other established deconvolution methods. Complete deconvolution using CDSeq represents a substantial technical advance over partial deconvolution approaches and will be useful for studying cell mixtures in tissue samples. CDSeq is available at GitHub repository (MATLAB and Octave code): https://github.com/kkang7/CDSeq.Author summary: Understanding the cellular composition of bulk tissues is critical to investigate the underlying mechanisms of many biological processes. Single cell sequencing is a promising technique, however, it is expensive and the analysis of single cell data is non-trivial. Therefore, tissue samples are still routinely processed in bulk. To estimate cell-type composition using bulk gene expression data, computational deconvolution methods are needed. Many deconvolution methods have been proposed, however, they often estimate only cell type proportions using a reference cell type gene expression profile, which in many cases may not be available. We present a novel complete deconvolution method that uses only bulk gene expression data to simultaneously estimate cell-type-specific gene expression profiles and sample-specific cell-type proportions. We showed that, using multiple RNA-Seq and microarray datasets where the cell-type composition was previously known, our method could accurately determine the cell-type composition. By providing a method that requires a single input to determine both cell-type proportion and cell-type-specific expression profiles, we expect that our method will be beneficial to biologists and facilitate the research and identification of mechanisms underlying many biological processes.

Suggested Citation

  • Kai Kang & Qian Meng & Igor Shats & David M Umbach & Melissa Li & Yuanyuan Li & Xiaoling Li & Leping Li, 2019. "CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data," PLOS Computational Biology, Public Library of Science, vol. 15(12), pages 1-18, December.
  • Handle: RePEc:plo:pcbi00:1007510
    DOI: 10.1371/journal.pcbi.1007510
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007510
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1007510&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1007510?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Xuran Wang & Jihwan Park & Katalin Susztak & Nancy R. Zhang & Mingyao Li, 2019. "Bulk tissue cell type deconvolution with multi-subject single-cell expression reference," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Brendan F. Miller & Feiyang Huang & Lyla Atta & Arpan Sahoo & Jean Fan, 2022. "Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bárbara Andrade Barbosa & Saskia D. Asten & Ji Won Oh & Arantza Farina-Sarasqueta & Joanne Verheij & Frederike Dijk & Hanneke W. M. Laarhoven & Bauke Ylstra & Juan J. Garcia Vallejo & Mark A. Wiel & Y, 2021. "Bayesian log-normal deconvolution for enhanced in silico microdissection of bulk gene expression data," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    2. Keyong Sun & Runda Xu & Fuhai Ma & Naixue Yang & Yang Li & Xiaofeng Sun & Peng Jin & Wenzhe Kang & Lemei Jia & Jianping Xiong & Haitao Hu & Yantao Tian & Xun Lan, 2022. "scRNA-seq of gastric tumor shows complex intercellular interaction with an alternative T cell exhaustion trajectory," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    3. Nelson Johansen & Hongru Hu & Gerald Quon, 2023. "Projecting RNA measurements onto single cell atlases to extract cell type-specific expression profiles using scProjection," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    4. Chang Su & Zichun Xu & Xinning Shan & Biao Cai & Hongyu Zhao & Jingfei Zhang, 2023. "Cell-type-specific co-expression inference from single cell RNA-sequencing data," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    5. Seoyeon Lee & Mohammad Naimul Islam & Kaveh Boostanpour & Dvir Aran & Guangchun Jin & Stephanie Christenson & Michael A. Matthay & Walter L. Eckalbar & Daryle J. DePianto & Joseph R. Arron & Liam Mage, 2021. "Molecular programs of fibrotic change in aging human lung," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    6. Xiao Zhou & Zhen Cheng & Mingyu Dong & Qi Liu & Weiyang Yang & Min Liu & Junzhang Tian & Weibin Cheng, 2022. "Tumor fractions deciphered from circulating cell-free DNA methylation for cancer early diagnosis," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    7. Matteo D’Antonio & Jennifer P. Nguyen & Timothy D. Arthur & Hiroko Matsui & Agnieszka D’Antonio-Chronowska & Kelly A. Frazer, 2023. "Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    8. Brian D. Lehmann & Antonio Colaprico & Tiago C. Silva & Jianjiao Chen & Hanbing An & Yuguang Ban & Hanchen Huang & Lily Wang & Jamaal L. James & Justin M. Balko & Paula I. Gonzalez-Ericsson & Melinda , 2021. "Multi-omics analysis identifies therapeutic vulnerabilities in triple-negative breast cancer subtypes," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    9. Beibei Ru & Jinlin Huang & Yu Zhang & Kenneth Aldape & Peng Jiang, 2023. "Estimation of cell lineages in tumors from spatial transcriptomics data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    10. Michael S. Balzer & Tomohito Doke & Ya-Wen Yang & Daniel L. Aldridge & Hailong Hu & Hung Mai & Dhanunjay Mukhi & Ziyuan Ma & Rojesh Shrestha & Matthew B. Palmer & Christopher A. Hunter & Katalin Suszt, 2022. "Single-cell analysis highlights differences in druggable pathways underlying adaptive or fibrotic kidney regeneration," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    11. David R. Ghasemi & Konstantin Okonechnikov & Anne Rademacher & Stephan Tirier & Kendra K. Maass & Hanna Schumacher & Piyush Joshi & Maxwell P. Gold & Julia Sundheimer & Britta Statz & Ahmet S. Rifaiog, 2024. "Compartments in medulloblastoma with extensive nodularity are connected through differentiation along the granular precursor lineage," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    12. Yanshuo Chen & Yixuan Wang & Yuelong Chen & Yuqi Cheng & Yumeng Wei & Yunxiang Li & Jiuming Wang & Yingying Wei & Ting-Fung Chan & Yu Li, 2022. "Deep autoencoder for interpretable tissue-adaptive deconvolution and cell-type-specific gene analysis," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    13. Zhenzhen Xun & Xinyu Ding & Yao Zhang & Benyan Zhang & Shujing Lai & Duowu Zou & Junke Zheng & Guoqiang Chen & Bing Su & Leng Han & Youqiong Ye, 2023. "Reconstruction of the tumor spatial microenvironment along the malignant-boundary-nonmalignant axis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    14. Bryce Rowland & Ruth Huh & Zoey Hou & Cheynna Crowley & Jia Wen & Yin Shen & Ming Hu & Paola Giusti-Rodríguez & Patrick F Sullivan & Yun Li, 2022. "THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data," PLOS Genetics, Public Library of Science, vol. 18(3), pages 1-18, March.
    15. Stefano Berto & Alex H. Treacher & Emre Caglayan & Danni Luo & Jillian R. Haney & Michael J. Gandal & Daniel H. Geschwind & Albert A. Montillo & Genevieve Konopka, 2022. "Association between resting-state functional brain connectivity and gene expression is altered in autism spectrum disorder," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    16. Khoa A. Tran & Venkateswar Addala & Rebecca L. Johnston & David Lovell & Andrew Bradley & Lambros T. Koufariotis & Scott Wood & Sunny Z. Wu & Daniel Roden & Ghamdan Al-Eryani & Alexander Swarbrick & E, 2023. "Performance of tumour microenvironment deconvolution methods in breast cancer using single-cell simulated bulk mixtures," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    17. Zhiyuan Liu & Dafei Wu & Weiwei Zhai & Liang Ma, 2023. "SONAR enables cell type deconvolution with spatially weighted Poisson-Gamma model for spatial transcriptomics," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    18. Joschka Hey & Michelle Paulsen & Reka Toth & Dieter Weichenhan & Simone Butz & Jolanthe Schatterny & Reinhard Liebers & Pavlo Lutsik & Christoph Plass & Marcus A. Mall, 2021. "Epigenetic reprogramming of airway macrophages promotes polarization and inflammation in muco-obstructive lung disease," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    19. Xiaoguang Xu & Chachrit Khunsriraksakul & James M. Eales & Sebastien Rubin & David Scannali & Sushant Saluja & David Talavera & Havell Markus & Lida Wang & Maciej Drzal & Akhlaq Maan & Abigail C. Lay , 2024. "Genetic imputation of kidney transcriptome, proteome and multi-omics illuminates new blood pressure and hypertension targets," Nature Communications, Nature, vol. 15(1), pages 1-29, December.
    20. Jie Liao & Jingyang Qian & Yin Fang & Zhuo Chen & Xiang Zhuang & Ningyu Zhang & Xin Shao & Yining Hu & Penghui Yang & Junyun Cheng & Yang Hu & Lingqi Yu & Haihong Yang & Jinlu Zhang & Xiaoyan Lu & Li , 2022. "De novo analysis of bulk RNA-seq data at spatially resolved single-cell resolution," Nature Communications, Nature, vol. 13(1), pages 1-19, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1007510. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.