IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-021-21312-2.html
   My bibliography  Save this article

Fast and precise single-cell data analysis using a hierarchical autoencoder

Author

Listed:
  • Duc Tran

    (University of Nevada Reno)

  • Hung Nguyen

    (University of Nevada Reno)

  • Bang Tran

    (University of Nevada Reno)

  • Carlo La Vecchia

    (University of Milan)

  • Hung N. Luu

    (Division of Cancer Control and Population Sciences, Hillman Cancer Center, University of Pittsburgh Medical Center
    University of Pittsburgh Graduate School of Public Health)

  • Tin Nguyen

    (University of Nevada Reno)

Abstract

A primary challenge in single-cell RNA sequencing (scRNA-seq) studies comes from the massive amount of data and the excess noise level. To address this challenge, we introduce an analysis framework, named single-cell Decomposition using Hierarchical Autoencoder (scDHA), that reliably extracts representative information of each cell. The scDHA pipeline consists of two core modules. The first module is a non-negative kernel autoencoder able to remove genes or components that have insignificant contributions to the part-based representation of the data. The second module is a stacked Bayesian autoencoder that projects the data onto a low-dimensional space (compressed). To diminish the tendency to overfit of neural networks, we repeatedly perturb the compressed space to learn a more generalized representation of the data. In an extensive analysis, we demonstrate that scDHA outperforms state-of-the-art techniques in many research sub-fields of scRNA-seq analysis, including cell segregation through unsupervised learning, visualization of transcriptome landscape, cell classification, and pseudo-time inference.

Suggested Citation

  • Duc Tran & Hung Nguyen & Bang Tran & Carlo La Vecchia & Hung N. Luu & Tin Nguyen, 2021. "Fast and precise single-cell data analysis using a hierarchical autoencoder," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-21312-2
    DOI: 10.1038/s41467-021-21312-2
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-021-21312-2
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-021-21312-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Scott R. Tyler & Daniel Lozano-Ojalvo & Ernesto Guccione & Eric E. Schadt, 2024. "Anti-correlated feature selection prevents false discovery of subpopulations in scRNAseq," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    2. Thomas Hu & Mayar Allam & Shuangyi Cai & Walter Henderson & Brian Yueh & Aybuke Garipcan & Anton V. Ievlev & Maryam Afkarian & Semir Beyaz & Ahmet F. Coskun, 2023. "Single-cell spatial metabolomics with cell-type specific protein profiling for tissue systems biology," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    3. Yasa Baig & Helena R. Ma & Helen Xu & Lingchong You, 2023. "Autoencoder neural networks enable low dimensional structure analyses of microbial growth dynamics," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    4. Zhuohan Yu & Yanchi Su & Yifu Lu & Yuning Yang & Fuzhou Wang & Shixiong Zhang & Yi Chang & Ka-Chun Wong & Xiangtao Li, 2023. "Topological identification and interpretation for single-cell gene regulation elucidation across multiple platforms using scMGCA," Nature Communications, Nature, vol. 14(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-21312-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.