IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011641.html
   My bibliography  Save this article

Attention-based deep clustering method for scRNA-seq cell type identification

Author

Listed:
  • Shenghao Li
  • Hui Guo
  • Simai Zhang
  • Yizhou Li
  • Menglong Li

Abstract

Single-cell sequencing (scRNA-seq) technology provides higher resolution of cellular differences than bulk RNA sequencing and reveals the heterogeneity in biological research. The analysis of scRNA-seq datasets is premised on the subpopulation assignment. When an appropriate reference is not available, such as specific marker genes and single-cell reference atlas, unsupervised clustering approaches become the predominant option. However, the inherent sparsity and high-dimensionality of scRNA-seq datasets pose specific analytical challenges to traditional clustering methods. Therefore, a various deep learning-based methods have been proposed to address these challenges. As each method improves partially, a comprehensive method needs to be proposed. In this article, we propose a novel scRNA-seq data clustering method named AttentionAE-sc (Attention fusion AutoEncoder for single-cell). Two different scRNA-seq clustering strategies are combined through an attention mechanism, that include zero-inflated negative binomial (ZINB)-based methods dealing with the impact of dropout events and graph autoencoder (GAE)-based methods relying on information from neighbors to guide the dimension reduction. Based on an iterative fusion between denoising and topological embeddings, AttentionAE-sc can easily acquire clustering-friendly cell representations that similar cells are closer in the hidden embedding. Compared with several state-of-art baseline methods, AttentionAE-sc demonstrated excellent clustering performance on 16 real scRNA-seq datasets without the need to specify the number of groups. Additionally, AttentionAE-sc learned improved cell representations and exhibited enhanced stability and robustness. Furthermore, AttentionAE-sc achieved remarkable identification in a breast cancer single-cell atlas dataset and provided valuable insights into the heterogeneity among different cell subtypes.Author summary: Single-cell sequencing (scRNA-seq) has been widely used in numerous biological studies to reveal heterogeneity at the cellular level. Accurate cell type identification serves as the foundation for scRNA-seq data analysis, and unsupervised cluster analysis is commonly employed when an appropriate reference is not available. However, the inherent sparsity and high-dimensionality of scRNA-seq datasets pose specific analytical challenges to traditional clustering methods. To address this, we propose a novel scRNA-seq data clustering method named AttentionAE-sc (Attention fusion AutoEncoder for single-cell). By integrating denoising representation learning and cluster-friendly representation learning through an attention mechanism, AttentionAE-sc demonstrated outstanding performance in the evaluation phase. Firstly, when compared to benchmark methods on the real scRNA-seq datasets, AttentionAE-sc consistently achieved superior external and internal clustering evaluation metrics. Secondly, AttentionAE-sc exhibited robustness and stability across various experimental conditions. Lastly, AttentionAE-sc not only delivered excellent clustering results but also unveiled potential biological insights on a breast cancer single-cell atlas dataset.

Suggested Citation

  • Shenghao Li & Hui Guo & Simai Zhang & Yizhou Li & Menglong Li, 2023. "Attention-based deep clustering method for scRNA-seq cell type identification," PLOS Computational Biology, Public Library of Science, vol. 19(11), pages 1-19, November.
  • Handle: RePEc:plo:pcbi00:1011641
    DOI: 10.1371/journal.pcbi.1011641
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011641
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011641&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011641?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Uri Ben-David & Benjamin Siranosian & Gavin Ha & Helen Tang & Yaara Oren & Kunihiko Hinohara & Craig A. Strathdee & Joshua Dempster & Nicholas J. Lyons & Robert Burns & Anwesha Nag & Guillaume Kugener, 2018. "Genetic and transcriptional evolution alters cancer cell line drug response," Nature, Nature, vol. 560(7718), pages 325-330, August.
    2. Suoqin Jin & Christian F. Guerrero-Juarez & Lihua Zhang & Ivan Chang & Raul Ramos & Chen-Hsiang Kuan & Peggy Myung & Maksim V. Plikus & Qing Nie, 2021. "Inference and analysis of cell-cell communication using CellChat," Nature Communications, Nature, vol. 12(1), pages 1-20, December.
    3. Xiaoping Han & Ziming Zhou & Lijiang Fei & Huiyu Sun & Renying Wang & Yao Chen & Haide Chen & Jingjing Wang & Huanna Tang & Wenhao Ge & Yincong Zhou & Fang Ye & Mengmeng Jiang & Junqing Wu & Yanyu Xia, 2020. "Construction of a human cell landscape at single-cell level," Nature, Nature, vol. 581(7808), pages 303-309, May.
    4. Dominic Grün & Anna Lyubimova & Lennart Kester & Kay Wiebrands & Onur Basak & Nobuo Sasaki & Hans Clevers & Alexander van Oudenaarden, 2015. "Single-cell messenger RNA sequencing reveals rare intestinal cell types," Nature, Nature, vol. 525(7568), pages 251-255, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fei Wang & Peiwen Ding & Xue Liang & Xiangning Ding & Camilla Blunk Brandt & Evelina Sjöstedt & Jiacheng Zhu & Saga Bolund & Lijing Zhang & Laura P. M. H. Rooij & Lihua Luo & Yanan Wei & Wandong Zhao , 2022. "Endothelial cell heterogeneity and microglia regulons revealed by a pig cell landscape at single-cell level," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    2. Jingyang Qian & Hudong Bao & Xin Shao & Yin Fang & Jie Liao & Zhuo Chen & Chengyu Li & Wenbo Guo & Yining Hu & Anyao Li & Yue Yao & Xiaohui Fan & Yiyu Cheng, 2024. "Simulating multiple variability in spatially resolved transcriptomics with scCube," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    3. Xiaochen Wang & Maosheng Cheng & Shuang Chen & Caihua Zhang & Rongsong Ling & Shuqing Qiu & Ke Chen & Bin Zhou & Qiuli Li & Wenbin Lei & Demeng Chen, 2025. "Resistance to anti-LAG-3 plus anti-PD-1 therapy in head and neck cancer is mediated by Sox9+ tumor cells interaction with Fpr1+ neutrophils," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    4. Yanchuan Li & Huamei Li & Cheng Peng & Ge Meng & Yijun Lu & Honglin Liu & Li Cui & Huan Zhou & Zhu Xu & Lingyun Sun & Lihong Liu & Qing Xiong & Beicheng Sun & Shiping Jiao, 2024. "Unraveling the spatial organization and development of human thymocytes through integration of spatial transcriptomics and single-cell multi-omics profiling," Nature Communications, Nature, vol. 15(1), pages 1-25, December.
    5. Tim Flerlage & Jeremy Chase Crawford & E. Kaitlynn Allen & Danielle Severns & Shaoyuan Tan & Sherri Surman & Granger Ridout & Tanya Novak & Adrienne Randolph & Alina N. West & Paul G. Thomas, 2023. "Single cell transcriptomics identifies distinct profiles in pediatric acute respiratory distress syndrome," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    6. Shirong Cao & Yu Pan & Andrew S. Terker & Juan Pablo Arroyo Ornelas & Yinqiu Wang & Jiaqi Tang & Aolei Niu & Sarah Abu Kar & Mengdi Jiang & Wentian Luo & Xinyu Dong & Xiaofeng Fan & Suwan Wang & Matth, 2023. "Epidermal growth factor receptor activation is essential for kidney fibrosis development," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    7. Christopher Bono & Yang Liu & Alexander Ferrena & Aneesa Valentine & Deyou Zheng & Bernice E. Morrow, 2023. "Single-cell transcriptomics uncovers a non-autonomous Tbx1-dependent genetic program controlling cardiac neural crest cell development," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    8. Yue Pang & Yating Qin & Zeyu Du & Qun Liu & Jin Zhang & Kai Han & Jiali Lu & Zengbao Yuan & Jun Li & Shanshan Pan & Xinrui Dong & Mengyang Xu & Dantong Wang & Shuo Li & Zhen Li & Yadong Chen & Zhishen, 2025. "Single-cell transcriptome atlas of lamprey exploring Natterin- induced white adipose tissue browning," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    9. Jingyang Qian & Xin Shao & Hudong Bao & Yin Fang & Wenbo Guo & Chengyu Li & Anyao Li & Hua Hua & Xiaohui Fan, 2025. "Identification and characterization of cell niches in tissue from spatial omics data at single-cell resolution," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    10. Lichun Ma & Sophia Heinrich & Limin Wang & Friederike L. Keggenhoff & Subreen Khatib & Marshonna Forgues & Michael Kelly & Stephen M. Hewitt & Areeba Saif & Jonathan M. Hernandez & Donna Mabry & Roman, 2022. "Multiregional single-cell dissection of tumor and immune cells reveals stable lock-and-key features in liver cancer," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    11. Qingnan Liang & Yuefan Huang & Shan He & Ken Chen, 2023. "Pathway centric analysis for single-cell RNA-seq and spatial transcriptomics data with GSDensity," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Faith H. Brennan & Yang Li & Cankun Wang & Anjun Ma & Qi Guo & Yi Li & Nicole Pukos & Warren A. Campbell & Kristina G. Witcher & Zhen Guan & Kristina A. Kigerl & Jodie C. E. Hall & Jonathan P. Godbout, 2022. "Microglia coordinate cellular interactions during spinal cord repair in mice," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    13. Sandra Curras-Alonso & Juliette Soulier & Thomas Defard & Christian Weber & Sophie Heinrich & Hugo Laporte & Sophie Leboucher & Sonia Lameiras & Marie Dutreix & Vincent Favaudon & Florian Massip & Tho, 2023. "An interactive murine single-cell atlas of the lung responses to radiation injury," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    14. Ilmatar Rooda & Jasmin Hassan & Jie Hao & Magdalena Wagner & Elisabeth Moussaud-Lamodière & Kersti Jääger & Marjut Otala & Katri Knuus & Cecilia Lindskog & Kiriaki Papaikonomou & Sebastian Gidlöf & Ce, 2024. "In-depth analysis of transcriptomes in ovarian cortical follicles from children and adults reveals interfollicular heterogeneity," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    15. Moujtaba Y. Kasmani & Paytsar Topchyan & Ashley K. Brown & Ryan J. Brown & Xiaopeng Wu & Yao Chen & Achia Khatun & Donia Alson & Yue Wu & Robert Burns & Chien-Wei Lin & Matthew R. Kudek & Jie Sun & We, 2023. "A spatial sequencing atlas of age-induced changes in the lung during influenza infection," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    16. Wei Yang & Li-Bo Liu & Feng-Liang Liu & Yan-Hua Wu & Zi-Da Zhen & Dong-Ying Fan & Zi-Yang Sheng & Zheng-Ran Song & Jia-Tong Chang & Yong-Tang Zheng & Jing An & Pei-Gang Wang, 2023. "Single-cell RNA sequencing reveals the fragility of male spermatogenic cells to Zika virus-induced complement activation," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    17. Miles C. Andrews & Junna Oba & Chang-Jiun Wu & Haifeng Zhu & Tatiana Karpinets & Caitlin A. Creasy & Marie-Andrée Forget & Xiaoxing Yu & Xingzhi Song & Xizeng Mao & A. Gordon Robertson & Gabriele Roma, 2022. "Multi-modal molecular programs regulate melanoma cell state," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    18. Luke Simpson & Andrew Strange & Doris Klisch & Sophie Kraunsoe & Takuya Azami & Daniel Goszczynski & Triet Minh & Benjamin Planells & Nadine Holmes & Fei Sang & Sonal Henson & Matthew Loose & Jennifer, 2024. "A single-cell atlas of pig gastrulation as a resource for comparative embryology," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    19. Erick Armingol & Hratch M. Baghdassarian & Cameron Martino & Araceli Perez-Lopez & Caitlin Aamodt & Rob Knight & Nathan E. Lewis, 2022. "Context-aware deconvolution of cell–cell communication with Tensor-cell2cell," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    20. Maria E. Monberg & Heather Geiger & Jaewon J. Lee & Roshan Sharma & Alexander Semaan & Vincent Bernard & Justin Wong & Fang Wang & Shaoheng Liang & Daniel B. Swartzlander & Bret M. Stephens & Matthew , 2022. "Occult polyclonality of preclinical pancreatic cancer models drives in vitro evolution," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011641. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.