IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0282171.html
   My bibliography  Save this article

Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas

Author

Listed:
  • Mohammad Abbasi
  • Connor R Sanderford
  • Narendiran Raghu
  • Mirjeta Pasha
  • Benjamin B Bartelle

Abstract

Unsupervised learning methods are commonly used to detect features within transcriptomic data and ultimately derive meaningful representations of biology. Contributions of individual genes to any feature however becomes convolved with each learning step, requiring follow up analysis and validation to understand what biology might be represented by a cluster on a low dimensional plot. We sought learning methods that could preserve the gene information of detected features, using the spatial transcriptomic data and anatomical labels of the Allen Mouse Brain Atlas as a test dataset with verifiable ground truth. We established metrics for accurate representation of molecular anatomy to find sparse learning approaches were uniquely capable of generating anatomical representations and gene weights in a single learning step. Fit to labeled anatomy was highly correlated with intrinsic properties of the data, offering a means to optimize parameters without established ground truth. Once representations were derived, complementary gene lists could be further compressed to generate a low complexity dataset, or to probe for individual features with >95% accuracy. We demonstrate the utility of sparse learning as a means to derive biologically meaningful representations from transcriptomic data and reduce the complexity of large datasets while preserving intelligible gene information throughout the analysis.

Suggested Citation

  • Mohammad Abbasi & Connor R Sanderford & Narendiran Raghu & Mirjeta Pasha & Benjamin B Bartelle, 2023. "Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas," PLOS ONE, Public Library of Science, vol. 18(3), pages 1-16, March.
  • Handle: RePEc:plo:pone00:0282171
    DOI: 10.1371/journal.pone.0282171
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0282171
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0282171&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0282171?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dmitry Kobak & Philipp Berens, 2019. "The art of using t-SNE for single-cell transcriptomics," Nature Communications, Nature, vol. 10(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rong Ma & Eric D. Sun & James Zou, 2023. "A spectral method for assessing and combining multiple data visualizations," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    2. Kaiwen Wang & Yuqiu Yang & Fangjiang Wu & Bing Song & Xinlei Wang & Tao Wang, 2023. "Comparative analysis of dimension reduction methods for cytometry by time-of-flight data," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    3. L. Mathur & B. Szalai & N. H. Du & R. Utharala & M. Ballinger & J. J. M. Landry & M. Ryckelynck & V. Benes & J. Saez-Rodriguez & C. A. Merten, 2022. "Combi-seq for multiplexed transcriptome-based profiling of drug combinations using deterministic barcoding in single-cell droplets," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Andriana Manousidaki & Anna Little & Yuying Xie, 2024. "Clustering and visualization of single-cell RNA-seq data using path metrics," PLOS Computational Biology, Public Library of Science, vol. 20(5), pages 1-19, May.
    5. Zhiyuan Yuan & Yisi Li & Minglei Shi & Fan Yang & Juntao Gao & Jianhua Yao & Michael Q. Zhang, 2022. "SOTIP is a versatile method for microenvironment modeling with spatial omics data," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Ziyan Huang & Myung Chung & Kentaro Tao & Akiyuki Watarai & Mu-Yun Wang & Hiroh Ito & Teruhiro Okuyama, 2023. "Ventromedial prefrontal neurons represent self-states shaped by vicarious fear in male mice," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    7. Mikhael D. Manurung & Friederike Sonnet & Marie-Astrid Hoogerwerf & Jacqueline J. Janse & Yvonne Kruize & Laura de Bes-Roeleveld & Marion König & Alex Loukas & Benjamin G. Dewals & Taniawati Supali & , 2024. "Controlled human hookworm infection remodels plasmacytoid dendritic cells and regulatory T cells towards profiles seen in natural infections in endemic areas," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    8. Caitlin E. Carey & Rebecca Shafee & Robbee Wedow & Amanda Elliott & Duncan S. Palmer & John Compitello & Masahiro Kanai & Liam Abbott & Patrick Schultz & Konrad J. Karczewski & Samuel C. Bryant & Caro, 2024. "Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation," Nature Human Behaviour, Nature, vol. 8(8), pages 1599-1615, August.
    9. Junzhan Jing & Ming Hu & Tenzin Ngodup & Qianqian Ma & Shu-Ning Natalie Lau & M. Cecilia Ljungberg & Matthew J. McGinley & Laurence O. Trussell & Xiaolong Jiang, 2025. "Molecular logic for cellular specializations that initiate the auditory parallel processing pathways," Nature Communications, Nature, vol. 16(1), pages 1-25, December.
    10. Jakob Woerner & Yidi Huang & Stephan Hutter & Carmelo Gurnari & Jesús María Hernández Sánchez & Janet Wang & Yimin Huang & Daniel Schnabel & Michael Aaby & Wanying Xu & Vedant Thorat & Dongxu Jiang & , 2022. "Circulating microbial content in myeloid malignancy patients is associated with disease subtypes and patient outcomes," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    11. Benjamin B. Sun & Stephanie J. Loomis & Fabrizio Pizzagalli & Natalia Shatokhina & Jodie N. Painter & Christopher N. Foley & Megan E. Jensen & Donald G. McLaren & Sai Spandana Chintapalli & Alyssa H. , 2022. "Genetic map of regional sulcal morphology in the human brain from UK biobank data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    12. Qiu, Hailing & Tseng, Shuan Wei & Zhang, Xuan & Huang, Caiyan & Wu, Kuo-Jui, 2024. "Revealing the compound interrelationships toward sustainable transition in semiconductor supply chain: A sensitivity analysis," International Journal of Production Economics, Elsevier, vol. 271(C).
    13. Erik Hermansen & David A. Klindt & Benjamin A. Dunn, 2024. "Uncovering 2-D toroidal representations in grid cell ensemble activity during 1-D behavior," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    14. Guocheng Fang & Zhen Qiao & Luqi Huang & Hui Zhu & Jun Xie & Tian Zhou & Zhongshu Xiong & I-Hsin Su & Dayong Jin & Yu-Cheng Chen, 2024. "Single-cell laser emitting cytometry for label-free nucleolus fingerprinting," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    15. Marissa A. Weis & Stelios Papadopoulos & Laura Hansel & Timo Lüddecke & Brendan Celii & Paul G. Fahey & Eric Y. Wang & J. Alexander Bae & Agnes L. Bodor & Derrick Brittain & JoAnn Buchanan & Daniel J., 2025. "An unsupervised map of excitatory neuron dendritic morphology in the mouse visual cortex," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    16. Lucy Xia & Christy Lee & Jingyi Jessica Li, 2024. "Statistical method scDEED for detecting dubious 2D single-cell embeddings and optimizing t-SNE and UMAP hyperparameters," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    17. Lijun Cheng & Pratik Karkhanis & Birkan Gokbag & Yueze Liu & Lang Li, 2022. "DGCyTOF: Deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data," PLOS Computational Biology, Public Library of Science, vol. 18(4), pages 1-22, April.
    18. Parashar Dhapola & Johan Rodhe & Rasmus Olofzon & Thomas Bonald & Eva Erlandsson & Shamit Soneji & Göran Karlsson, 2022. "Scarf enables a highly memory-efficient analysis of large-scale single-cell genomics data," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    19. Wen Jiang & Eric J. Jaehnig & Yuxing Liao & Zhiao Shi & Tomer M. Yaron-Barir & Jared L. Johnson & Lewis C. Cantley & Bing Zhang, 2025. "Deciphering the dark cancer phosphoproteome using machine-learned co-regulation of phosphosites," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    20. Eva C. Freckmann & Emma Sandilands & Erin Cumming & Matthew Neilson & Alvaro Román-Fernández & Konstantina Nikolatou & Marisa Nacke & Tamsin R. M. Lannagan & Ann Hedley & David Strachan & Mark Salji &, 2022. "Traject3d allows label-free identification of distinct co-occurring phenotypes within 3D culture by live imaging," Nature Communications, Nature, vol. 13(1), pages 1-21, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0282171. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.