IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1013759.html
   My bibliography  Save this article

Randomized Spatial PCA (RASP): A computationally efficient method for dimensionality reduction of high-resolution spatial transcriptomics data

Author

Listed:
  • Ian K Gingerich
  • Brittany A Goods
  • H Robert Frost

Abstract

Spatial transcriptomics (ST) provides critical insights into the spatial organization of gene expression, enabling researchers to unravel the intricate relationship between cellular environments and biological function. Identifying spatial domains within tissues is key to understanding tissue architecture and mechanisms underlying development and disease progression. Here, we present Randomized Spatial PCA (RASP), a novel spatially-aware dimensionality reduction method for ST data. RASP is designed to be orders-of-magnitude faster than existing techniques, scale to datasets with 100,000+ locations, support flexible integration of non-transcriptomic covariates, and reconstruct de-noised, spatially-smoothed gene expression values. RASP itself is not a clustering or domain detection method; cell types and spatial regions are obtained by clustering the RASP PCs, and the effective cluster resolution depends on the K-nearest-neighbor (kNN) graph and a smoothing parameter β. It employs a randomized two-stage PCA framework and configurable spatial smoothing. RASP was compared to BASS, GraphST, SEDR, SpatialPCA, STAGATE, and CellCharter using diverse ST datasets (10x Visium, Stereo-Seq, MERFISH, 10x Xenium) on human and mouse tissues. In these benchmarks, RASP delivers comparable or superior accuracy in tissue-domain detection while achieving substantial improvements in computational speed. Its efficiency not only reduces runtime and resource requirements but also makes it practical to explore a broad range of spatial-smoothing parameters in a high-throughput fashion. By enabling rapid re-analysis under different parameter settings, RASP empowers users to fine-tune the balance between resolution and noise suppression on large, high-resolution subcellular datasets—a critical capability when investigating complex tissue architecture.Author summary: Spatial transcriptomics (ST) technologies enable unprecedented insights into the spatial organization of gene expression within tissues, yet analysis of these increasingly large and complex datasets remains computationally challenging. We present Randomized Spatial PCA (RASP), a novel, scalable, and computationally efficient dimensionality reduction method tailored for spatial transcriptomics data. Unlike existing methods, RASP can rapidly process datasets with hundreds of thousands of spatial locations and integrates non-transcriptomic covariates to improve biological signal recovery. By combining randomized linear algebra with spatial smoothing, RASP produces spatially informed principal components that support downstream clustering and spatial domain identification across diverse ST platforms, including high-throughput sequencing and in situ imaging technologies. Benchmarking on multiple real and simulated datasets demonstrates that RASP achieves comparable or superior accuracy to state-of-the-art methods while drastically reducing computational time and resource requirements. This efficiency empowers researchers to explore biological questions at multiple spatial resolutions and scales, facilitating robust, high-throughput spatial analysis critical for advancing our understanding of complex tissue architectures.

Suggested Citation

  • Ian K Gingerich & Brittany A Goods & H Robert Frost, 2025. "Randomized Spatial PCA (RASP): A computationally efficient method for dimensionality reduction of high-resolution spatial transcriptomics data," PLOS Computational Biology, Public Library of Science, vol. 21(12), pages 1-28, December.
  • Handle: RePEc:plo:pcbi00:1013759
    DOI: 10.1371/journal.pcbi.1013759
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013759
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1013759&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1013759?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013759. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.