IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011854.html
   My bibliography  Save this article

scaDA: A novel statistical method for differential analysis of single-cell chromatin accessibility sequencing data

Author

Listed:
  • Fengdi Zhao
  • Xin Ma
  • Bing Yao
  • Qing Lu
  • Li Chen

Abstract

Single-cell ATAC-seq sequencing data (scATAC-seq) has been widely used to investigate chromatin accessibility on the single-cell level. One important application of scATAC-seq data analysis is differential chromatin accessibility (DA) analysis. However, the data characteristics of scATAC-seq such as excessive zeros and large variability of chromatin accessibility across cells impose a unique challenge for DA analysis. Existing statistical methods focus on detecting the mean difference of the chromatin accessible regions while overlooking the distribution difference. Motivated by real data exploration that distribution difference exists among cell types, we introduce a novel composite statistical test named “scaDA”, which is based on zero-inflated negative binomial model (ZINB), for performing differential distribution analysis of chromatin accessibility by jointly testing the abundance, prevalence and dispersion simultaneously. Benefiting from both dispersion shrinkage and iterative refinement of mean and prevalence parameter estimates, scaDA demonstrates its superiority to both ZINB-based likelihood ratio tests and published methods by achieving the highest power and best FDR control in a comprehensive simulation study. In addition to demonstrating the highest power in three real sc-multiome data analyses, scaDA successfully identifies differentially accessible regions in microglia from sc-multiome data for an Alzheimer’s disease (AD) study that are most enriched in GO terms related to neurogenesis and the clinical phenotype of AD, and AD-associated GWAS SNPs.Author summary: Understanding the cis-regulatory elements that control the fundamental gene regulatory process is important to basic biology. scATAC-seq data offers an unprecedented opportunity to investigate chromatin accessibility on the single-cell level and explore cell heterogeneity to reveal the dynamic changes of cis-regulatory elements among different cell types. To understand the dynamic change of gene regulation using scATAC-seq data, differential chromatin accessibility (DA) analysis, which is one of the most fundamental analyses for scATAC-seq data, can enable the identification of differentially accessible regions between cell types or between multiple conditions. Subsequently, DA analysis has many applications such as identifying cell type-specific chromatin accessible regions to reveal the cell type-specific gene regulatory program, assessing disease-associated changes in chromatin accessibility to detect potential biomarkers, and linking differentially accessible regions to differentially expressed genes for building a comprehensive gene regulatory map. This paper proposes a novel statistical method named “scaDA” to improve the detection of differentially accessible regions by performing differential distribution analysis. scaDA is believed to benefit the research community of single-cell genomics.

Suggested Citation

  • Fengdi Zhao & Xin Ma & Bing Yao & Qing Lu & Li Chen, 2024. "scaDA: A novel statistical method for differential analysis of single-cell chromatin accessibility sequencing data," PLOS Computational Biology, Public Library of Science, vol. 20(8), pages 1-28, August.
  • Handle: RePEc:plo:pcbi00:1011854
    DOI: 10.1371/journal.pcbi.1011854
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011854
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011854&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011854?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yoshiharu Muto & Parker C. Wilson & Nicolas Ledru & Haojia Wu & Henrik Dimke & Sushrut S. Waikar & Benjamin D. Humphreys, 2021. "Single cell transcriptional and chromatin accessibility profiling redefine cellular heterogeneity in the adult human kidney," Nature Communications, Nature, vol. 12(1), pages 1-17, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhenchao Tang & Guanxing Chen & Shouzhi Chen & Jianhua Yao & Linlin You & Calvin Yu-Chian Chen, 2024. "Modal-nexus auto-encoder for multi-modality cellular data integration and imputation," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    2. Caitriona M. McEvoy & Julia M. Murphy & Lin Zhang & Sergi Clotet-Freixas & Jessica A. Mathews & James An & Mehran Karimzadeh & Delaram Pouyabahar & Shenghui Su & Olga Zaslaver & Hannes Röst & Rangi Ar, 2022. "Single-cell profiling of healthy human kidney reveals features of sex-based transcriptional programs and tissue-specific immunity," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    3. Nicolas Ledru & Parker C. Wilson & Yoshiharu Muto & Yasuhiro Yoshimura & Haojia Wu & Dian Li & Amish Asthana & Stefan G. Tullius & Sushrut S. Waikar & Giuseppe Orlando & Benjamin D. Humphreys, 2024. "Predicting proximal tubule failed repair drivers through regularized regression analysis of single cell multiomic sequencing," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    4. Akihiko Fukagawa & Natsuko Hama & Yasushi Totoki & Hiromi Nakamura & Yasuhito Arai & Mihoko Saito-Adachi & Akiko Maeshima & Yoshiyuki Matsui & Shinichi Yachida & Tetsuo Ushiku & Tatsuhiro Shibata, 2023. "Genomic and epigenomic integrative subtypes of renal cell carcinoma in a Japanese cohort," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011854. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.