IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006391.html
   My bibliography  Save this article

A new dynamic correlation algorithm reveals novel functional aspects in single cell and bulk RNA-seq data

Author

Listed:
  • Tianwei Yu

Abstract

Dynamic correlations are pervasive in high-throughput data. Large numbers of gene pairs can change their correlation patterns in response to observed/unobserved changes in physiological states. Finding changes in correlation patterns can reveal important regulatory mechanisms. Currently there is no method that can effectively detect global dynamic correlation patterns in a dataset. Given the challenging nature of the problem, the currently available methods use genes as surrogate measurements of physiological states, which cannot faithfully represent true underlying biological signals. In this study we develop a new method that directly identifies strong latent dynamic correlation signals from the data matrix, named DCA: Dynamic Correlation Analysis. At the center of the method is a new metric for the identification of pairs of variables that are highly likely to be dynamically correlated, without knowing the underlying physiological states that govern the dynamic correlation. We validate the performance of the method with extensive simulations. We applied the method to three real datasets: a single cell RNA-seq dataset, a bulk RNA-seq dataset, and a microarray gene expression dataset. In all three datasets, the method reveals novel latent factors with clear biological meaning, bringing new insights into the data.Author summary: Dynamic correlation is an important area in expression data. However it hasn’t received much attention because of the lack of effective methods that can unravel the complex relationship. Here we describe a new method that represents a substantial improvement over existing approaches. It achieves the goal of efficiently finding patterns of dynamic correlation in RNA-seq data, as well as detecting biological functions associated with the dynamic correlation patterns. Unlike traditional methods that focus on first-order structures, linear or nonlinear, our method finds second-order patterns that bring insights into the regulations of the complex system. Some of the interesting discoveries by the new method, such as immunological functions of some intestinal epithelial cells, are validated by recent biological publications.

Suggested Citation

  • Tianwei Yu, 2018. "A new dynamic correlation algorithm reveals novel functional aspects in single cell and bulk RNA-seq data," PLOS Computational Biology, Public Library of Science, vol. 14(8), pages 1-22, August.
  • Handle: RePEc:plo:pcbi00:1006391
    DOI: 10.1371/journal.pcbi.1006391
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006391
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006391&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006391?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Adam L. Haber & Moshe Biton & Noga Rogel & Rebecca H. Herbst & Karthik Shekhar & Christopher Smillie & Grace Burgin & Toni M. Delorey & Michael R. Howitt & Yarden Katz & Itay Tirosh & Semir Beyaz & Da, 2017. "A single-cell survey of the small intestinal epithelium," Nature, Nature, vol. 551(7680), pages 333-339, November.
    2. Dvir Aran & Marina Sirota & Atul J. Butte, 2015. "Systematic pan-cancer analysis of tumour purity," Nature Communications, Nature, vol. 6(1), pages 1-12, December.
    3. Jeremiah R. McDole & Leroy W. Wheeler & Keely G. McDonald & Baomei Wang & Vjollca Konjufca & Kathryn A. Knoop & Rodney D. Newberry & Mark J. Miller, 2012. "Goblet cells deliver luminal antigen to CD103+ dendritic cells in the small intestine," Nature, Nature, vol. 483(7389), pages 345-349, March.
    4. Jun Chen & Jichun Xie & Hongzhe Li, 2011. "A Penalized Likelihood Approach for Bivariate Conditional Normal Models for Dynamic Co-expression Analysis," Biometrics, The International Biometric Society, vol. 67(1), pages 299-308, March.
    5. Yen-Yi Ho & Giovanni Parmigiani & Thomas A. Louis & Leslie M. Cope, 2011. "Modeling Liquid Association," Biometrics, The International Biometric Society, vol. 67(1), pages 133-141, March.
    6. Efron, Bradley, 2004. "Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 96-104, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhen Yang & Yen‐Yi Ho, 2022. "Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data," Biometrics, The International Biometric Society, vol. 78(2), pages 766-776, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zichen Ma & Shannon W. Davis & Yen‐Yi Ho, 2023. "Flexible copula model for integrating correlated multi‐omics data from single‐cell experiments," Biometrics, The International Biometric Society, vol. 79(2), pages 1559-1572, June.
    2. Pounds Stanley B. & Gao Cuilan L. & Zhang Hui, 2012. "Empirical Bayesian Selection of Hypothesis Testing Procedures for Analysis of Sequence Count Expression Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-32, October.
    3. Shigeyuki Matsui & Hisashi Noma, 2011. "Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample-Size Assessments in Microarray Experiments," Biometrics, The International Biometric Society, vol. 67(4), pages 1225-1235, December.
    4. Won, Joong-Ho & Lim, Johan & Yu, Donghyeon & Kim, Byung Soo & Kim, Kyunga, 2014. "Monotone false discovery rate," Statistics & Probability Letters, Elsevier, vol. 87(C), pages 86-93.
    5. van Wieringen, Wessel N. & Stam, Koen A. & Peeters, Carel F.W. & van de Wiel, Mark A., 2020. "Updating of the Gaussian graphical model through targeted penalized estimation," Journal of Multivariate Analysis, Elsevier, vol. 178(C).
    6. Ian W. McKeague & Min Qian, 2015. "An Adaptive Resampling Test for Detecting the Presence of Significant Predictors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1422-1433, December.
    7. Angela Schörgendorfer & Adam J. Branscum & Timothy E. Hanson, 2013. "A Bayesian Goodness of Fit Test and Semiparametric Generalization of Logistic Regression with Measurement Data," Biometrics, The International Biometric Society, vol. 69(2), pages 508-519, June.
    8. Miles C. Andrews & Junna Oba & Chang-Jiun Wu & Haifeng Zhu & Tatiana Karpinets & Caitlin A. Creasy & Marie-Andrée Forget & Xiaoxing Yu & Xingzhi Song & Xizeng Mao & A. Gordon Robertson & Gabriele Roma, 2022. "Multi-modal molecular programs regulate melanoma cell state," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    9. Xiwen Xiong & Chenyan Yang & Wei-Qi He & Jiahui Yu & Yue Xin & Xinge Zhang & Rong Huang & Honghui Ma & Shaofang Xu & Zun Li & Jie Ma & Lin Xu & Qunyi Wang & Kaiqun Ren & Xiaoli S. Wu & Christopher R. , 2022. "Sirtuin 6 maintains epithelial STAT6 activity to support intestinal tuft cell development and type 2 immunity," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    10. Han, Bing & Dalal, Siddhartha R., 2012. "A Bernstein-type estimator for decreasing density with application to p-value adjustments," Computational Statistics & Data Analysis, Elsevier, vol. 56(2), pages 427-437.
    11. Dalia Valencia & Rosa E. Lillo & Juan Romo, 2019. "A Kendall correlation coefficient between functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1083-1103, December.
    12. Kline, Patrick & Walters, Christopher, 2019. "Audits as Evidence: Experiments, Ensembles, and Enforcement," Institute for Research on Labor and Employment, Working Paper Series qt3z72m9kn, Institute of Industrial Relations, UC Berkeley.
    13. He, Yi & Pan, Wei & Lin, Jizhen, 2006. "Cluster analysis using multivariate normal mixture models to detect differential gene expression with microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 641-658, November.
    14. Cheng, Cheng, 2009. "Internal validation inferences of significant genomic features in genome-wide screening," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 788-800, January.
    15. Sinjini Sikdar & Somnath Datta & Susmita Datta, 2017. "EAMA: Empirically adjusted meta-analysis for large-scale simultaneous hypothesis testing in genomic experiments," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-19, October.
    16. Lucia Taraborrelli & Yasin Şenbabaoğlu & Lifen Wang & Junghyun Lim & Kerrigan Blake & Noelyn Kljavin & Sarah Gierke & Alexis Scherl & James Ziai & Erin McNamara & Mark Owyong & Shilpa Rao & Aslihan Ka, 2023. "Tumor-intrinsic expression of the autophagy gene Atg16l1 suppresses anti-tumor immunity in colorectal cancer," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    17. Greg Holmes & Ana S. Gonzalez-Reiche & Madrikha Saturne & Susan M. Motch Perrine & Xianxiao Zhou & Ana C. Borges & Bhavana Shewale & Joan T. Richtsmeier & Bin Zhang & Harm Bakel & Ethylin Wang Jabs, 2021. "Single-cell analysis identifies a key role for Hhip in murine coronal suture development," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    18. Xiang, Qinfang & Edwards, Jode & Gadbury, Gary L., 2006. "Interval estimation in a finite mixture model: Modeling P-values in multiple testing applications," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 570-586, November.
    19. Gordon, Alexander & Chen, Linlin & Glazko, Galina & Yakovlev, Andrei, 2009. "Balancing type one and two errors in multiple testing for differential expression of genes," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1622-1629, March.
    20. Michael J. P. Crowley & Bhavneet Bhinder & Geoffrey J. Markowitz & Mitchell Martin & Akanksha Verma & Tito A. Sandoval & Chang-Suk Chae & Shira Yomtoubian & Yang Hu & Sahil Chopra & Diamile A. Tavarez, 2023. "Tumor-intrinsic IRE1α signaling controls protective immunity in lung cancer," Nature Communications, Nature, vol. 14(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006391. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.