IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-64259-4.html
   My bibliography  Save this article

Data-driven fine-grained region discovery in the mouse brain with transformers

Author

Listed:
  • Alex J. Lee

    (University of California
    UCSF Weill Institute for Neurosciences)

  • Alma Dubuc

    (University of California
    UCSF Weill Institute for Neurosciences)

  • Michael Kunst

    (Allen Institute for Brain Science)

  • Shenqin Yao

    (Allen Institute for Brain Science)

  • Nicholas Lusk

    (Allen Institute for Brain Science)

  • Lydia Ng

    (Allen Institute for Brain Science)

  • Hongkui Zeng

    (Allen Institute for Brain Science)

  • Bosiljka Tasic

    (Allen Institute for Brain Science)

  • Reza Abbasi-Asl

    (University of California
    UCSF Weill Institute for Neurosciences
    University of California)

Abstract

Spatial transcriptomics offers unique opportunities to define the spatial organization of tissues and organs, such as the mouse brain. We address a key bottleneck in the analysis of organ-scale spatial transcriptomic data by establishing a workflow for self-supervised spatial domain detection that is scalable to multimillion-cell datasets. This workflow uses a self-supervised framework for learning latent representations of tissue spatial domains or niches. We use an encoder-decoder architecture, which we named CellTransformer, to hierarchically learn higher-order tissue features from lower-level cellular and molecular statistical patterns. Coupling our representation learning workflow with minibatched GPU-accelerated clustering algorithms allows us to scale to multi-million cell MERFISH datasets where other methods cannot. CellTransformer is effective at integrating cells across tissue sections, identifying domains highly similar to ones in existing ontologies such as Allen Mouse Brain Common Coordinate Framework (CCF) while allowing discovery of hundreds of uncataloged areas with minimal loss of domain spatial coherence. CellTransformer domains recapitulate previous neuroanatomical studies of areas in the subiculum and superior colliculus and characterize putatively uncataloged subregions in subcortical areas, which currently lack subregion annotation. CellTransformer is also capable of domain discovery in whole-brain Slide-seqV2 datasets. Our workflows enable complex multi-animal analyses, achieving nearly perfect consistency of up to 100 spatial domains in a dataset of four individual mice with nine million cells across more than 200 tissue sections. CellTransformer advances the state of the art for spatial transcriptomics by providing a performant solution for the detection of fine-grained tissue domains from spatial transcriptomics data.

Suggested Citation

  • Alex J. Lee & Alma Dubuc & Michael Kunst & Shenqin Yao & Nicholas Lusk & Lydia Ng & Hongkui Zeng & Bosiljka Tasic & Reza Abbasi-Asl, 2025. "Data-driven fine-grained region discovery in the mouse brain with transformers," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64259-4
    DOI: 10.1038/s41467-025-64259-4
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-64259-4
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-64259-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Xinyi Zhang & Xiao Wang & G. V. Shivashankar & Caroline Uhler, 2022. "Graph-based autoencoder integrates spatial transcriptomics with chromatin images and identifies joint biomarkers for Alzheimer’s disease," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    2. Kangning Dong & Shihua Zhang, 2022. "Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    3. Hailing Shi & Yichun He & Yiming Zhou & Jiahao Huang & Kamal Maher & Brandon Wang & Zefang Tang & Shuchen Luo & Peng Tan & Morgan Wu & Zuwan Lin & Jingyi Ren & Yaman Thapa & Xin Tang & Ken Y. Chan & B, 2023. "Spatial atlas of the mouse central nervous system at molecular resolution," Nature, Nature, vol. 622(7983), pages 552-561, October.
    4. Trygve E. Bakken & Nikolas L. Jorstad & Qiwen Hu & Blue B. Lake & Wei Tian & Brian E. Kalmbach & Megan Crow & Rebecca D. Hodge & Fenna M. Krienen & Staci A. Sorensen & Jeroen Eggermont & Zizhen Yao & , 2021. "Comparative cellular analysis of motor cortex in human, marmoset and mouse," Nature, Nature, vol. 598(7879), pages 111-119, October.
    5. Nora L. Benavidez & Michael S. Bienkowski & Muye Zhu & Luis H. Garcia & Marina Fayzullina & Lei Gao & Ian Bowman & Lin Gou & Neda Khanjani & Kaelan R. Cotter & Laura Korobkova & Marlene Becerra & Chun, 2021. "Organization of the inputs and outputs of the mouse superior colliculus," Nature Communications, Nature, vol. 12(1), pages 1-20, December.
    6. Seung Wook Oh & Julie A. Harris & Lydia Ng & Brent Winslow & Nicholas Cain & Stefan Mihalas & Quanxin Wang & Chris Lau & Leonard Kuan & Alex M. Henry & Marty T. Mortrud & Benjamin Ouellette & Thuc Ngh, 2014. "A mesoscale connectome of the mouse brain," Nature, Nature, vol. 508(7495), pages 207-214, April.
    7. Jonah Langlieb & Nina S. Sachdev & Karol S. Balderrama & Naeem M. Nadaf & Mukund Raj & Evan Murray & James T. Webber & Charles Vanderburg & Vahid Gazestani & Daniel Tward & Chris Mezias & Xu Li & Kate, 2023. "The molecular cytoarchitecture of the adult mouse brain," Nature, Nature, vol. 624(7991), pages 333-342, December.
    8. Zizhen Yao & Cindy T. J. Velthoven & Michael Kunst & Meng Zhang & Delissa McMillen & Changkyu Lee & Won Jung & Jeff Goldy & Aliya Abdelhak & Matthew Aitken & Katherine Baker & Pamela Baker & Eliza Bar, 2023. "A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain," Nature, Nature, vol. 624(7991), pages 317-332, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lite Yang & Fang Liu & Hannah Hahm & Takao Okuda & Xiaoyue Li & Yufen Zhang & Vani Kalyanaraman & Monique R. Heitmeier & Vijay K. Samineni, 2025. "Projection-TAGs enable multiplex projection tracing and multi-modal profiling of projection neurons," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    2. Leon Fodoulian & Madlaina Boillat & Marie Moulinier & Alan Carleton & Ivan Rodriguez, 2025. "A spatial single-cell atlas of the claustro-insular region uncovers key regulators of neuronal identity and excitability," Nature Communications, Nature, vol. 16(1), pages 1-23, December.
    3. Chen-Rui Xia & Zhi-Jie Cao & Ge Gao, 2025. "DECIPHER for learning disentangled cellular embeddings in large-scale heterogeneous spatial omics data," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    4. Hongru Hu & Gerald Quon, 2024. "scPair: Boosting single cell multimodal analysis by leveraging implicit feature selection and single cell atlases," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    5. Mengting Zhao & Jiandong Zhou & Tao Jiang & Miao Ren & Chuhao Dou & Lingyi Cai & Shengda Bao & Xueyan Jia & Zhaoyang Meng & Feifang Tang & Zhao Feng & Jing Yuan & Xiaoquan Yang & Hui Gong & Qingming L, 2025. "Molecularly defined cellular atlas of the entire mouse brain with isotropic single-cell resolution," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    6. Jingyang Qian & Xin Shao & Hudong Bao & Yin Fang & Wenbo Guo & Chengyu Li & Anyao Li & Hua Hua & Xiaohui Fan, 2025. "Identification and characterization of cell niches in tissue from spatial omics data at single-cell resolution," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    7. Yufeng Liu & Shengdian Jiang & Yingxin Li & Sujun Zhao & Zhixi Yun & Zuo-Han Zhao & Lingli Zhang & Gaoyu Wang & Xin Chen & Linus Manubens-Gil & Yuning Hang & Qiaobo Gong & Yuanyuan Li & Penghao Qian &, 2024. "Neuronal diversity and stereotypy at multiple scales through whole brain morphometry," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    8. Hao Xu & Shuyan Wang & Minghao Fang & Songwen Luo & Chunpeng Chen & Siyuan Wan & Rirui Wang & Meifang Tang & Tian Xue & Bin Li & Jun Lin & Kun Qu, 2023. "SPACEL: deep learning-based characterization of spatial transcriptome architectures," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    9. Ankit Agrawal & Stefan Thomann & Sukanya Basu & Dominic Grün, 2024. "NiCo identifies extrinsic drivers of cell state modulation by niche covariation analysis," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    10. Yuqiu Zhou & Wei He & Weizhen Hou & Ying Zhu, 2024. "Pianno: a probabilistic framework automating semantic annotation for spatial transcriptomics," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    11. Leslie J. Sibener & Alice C. Mosberger & Tiffany X. Chen & Vivek R. Athalye & James M. Murray & Rui M. Costa, 2025. "Dissociable roles of distinct thalamic circuits in learning reaches to spatial targets," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    12. Lina Marcela Carmona & Anders Nelson & Lin T. Tun & An Kim & Rani Shiao & Michael D. Kissner & Vilas Menon & Rui M. Costa, 2025. "Corticothalamic neurons in motor cortex have a permissive role in motor execution," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    13. Federico Malizia & Santiago Lamata-Otín & Mattia Frasca & Vito Latora & Jesús Gómez-Gardeñes, 2025. "Hyperedge overlap drives explosive transitions in systems with higher-order interactions," Nature Communications, Nature, vol. 16(1), pages 1-10, December.
    14. Yuchen Liang & Guowei Shi & Runlin Cai & Yuchen Yuan & Ziying Xie & Long Yu & Yingjian Huang & Qian Shi & Lizhe Wang & Jun Li & Zhonghui Tang, 2024. "PROST: quantitative identification of spatially variable genes and domain detection in spatial transcriptomics," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    15. Wenqi Chen & Jiejunyi Liang & Qiyun Wu & Yunyun Han, 2024. "Anterior cingulate cortex provides the neural substrates for feedback-driven iteration of decision and value representation," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    16. Sydney C Weiser & Brian R Mullen & Desiderio Ascencio & James B Ackman, 2023. "Data-driven segmentation of cortical calcium dynamics," PLOS Computational Biology, Public Library of Science, vol. 19(5), pages 1-36, May.
    17. Wen-Hao Zhang & Si Wu & Krešimir Josić & Brent Doiron, 2023. "Sampling-based Bayesian inference in recurrent circuits of stochastic spiking neurons," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    18. Dandan Geng & Yaning Li & Bo Yang & Li Zhang & Huating Gu & Tianyun Zhang & Zijie Zhao & Hui Liu & Qingzhuo Cui & Rong Zheng & Peng Cao & Fan Zhang, 2025. "Cholecystokinin neurons in the spinal trigeminal nucleus interpolaris regulate mechanically evoked predatory hunting in male mice," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    19. Danni Hong & Muya Shu & Jiamao Liu & Lifang Liu & Hao Cheng & Ming Zhu & Yi Du & Bo Xu & Di Hu & Zhiyong Liu & Yannan Zhao & Jianwu Dai & Falong Lu & Jialiang Huang, 2025. "Divergent combinations of enhancers encode spatial gene expression," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    20. Sungyong Um & Bin Zhang & Sunil Wattal & Youngjin Yoo, 2023. "Software Components and Product Variety in a Platform Ecosystem: A Dynamic Network Analysis of WordPress," Information Systems Research, INFORMS, vol. 34(4), pages 1339-1374, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64259-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.