IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011324.html
   My bibliography  Save this article

Curated single cell multimodal landmark datasets for R/Bioconductor

Author

Listed:
  • Kelly B Eckenrode
  • Dario Righelli
  • Marcel Ramos
  • Ricard Argelaguet
  • Christophe Vanderaa
  • Ludwig Geistlinger
  • Aedin C Culhane
  • Laurent Gatto
  • Vincent Carey
  • Martin Morgan
  • Davide Risso
  • Levi Waldron

Abstract

Background: The majority of high-throughput single-cell molecular profiling methods quantify RNA expression; however, recent multimodal profiling methods add simultaneous measurement of genomic, proteomic, epigenetic, and/or spatial information on the same cells. The development of new statistical and computational methods in Bioconductor for such data will be facilitated by easy availability of landmark datasets using standard data classes. Results: We collected, processed, and packaged publicly available landmark datasets from important single-cell multimodal protocols, including CITE-Seq, ECCITE-Seq, SCoPE2, scNMT, 10X Multiome, seqFISH, and G&T. We integrate data modalities via the MultiAssayExperiment Bioconductor class, document and re-distribute datasets as the SingleCellMultiModal package in Bioconductor’s Cloud-based ExperimentHub. The result is single-command actualization of landmark datasets from seven single-cell multimodal data generation technologies, without need for further data processing or wrangling in order to analyze and develop methods within Bioconductor’s ecosystem of hundreds of packages for single-cell and multimodal data. Conclusions: We provide two examples of integrative analyses that are greatly simplified by SingleCellMultiModal. The package will facilitate development of bioinformatic and statistical methods in Bioconductor to meet the challenges of integrating molecular layers and analyzing phenotypic outputs including cell differentiation, activity, and disease. Author summary: Experimental data packages that provide landmark datasets have historically played an important role in the development of new statistical methods in Bioconductor by lowering the barrier of access to relevant data, providing a common testing ground for software development and benchmarking, and encouraging interoperability around common data structures. In this manuscript, we review major classes of technologies for collecting multimodal data including genomics, transcriptomics, epigenetics, proteomics, and spatial information at the level of single cells. We present the SingleCellMultiModal R/Bioconductor package that provides single-command access to landmark datasets from seven different technologies, storing datasets using HDF5 and sparse arrays for memory efficiency and integrating data modalities via the MultiAssayExperiment class. We demonstrate two integrative analyses that are greatly simplified by SingleCellMultiModal. The package facilitates development and benchmarking of bioinformatic and statistical methods to integrate molecular layers at the level of single cells with phenotypic outputs including cell differentiation, activity, and disease, within Bioconductor’s ecosystem of hundreds of packages for single-cell and multimodal data.

Suggested Citation

  • Kelly B Eckenrode & Dario Righelli & Marcel Ramos & Ricard Argelaguet & Christophe Vanderaa & Ludwig Geistlinger & Aedin C Culhane & Laurent Gatto & Vincent Carey & Martin Morgan & Davide Risso & Levi, 2023. "Curated single cell multimodal landmark datasets for R/Bioconductor," PLOS Computational Biology, Public Library of Science, vol. 19(8), pages 1-24, August.
  • Handle: RePEc:plo:pcbi00:1011324
    DOI: 10.1371/journal.pcbi.1011324
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011324
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011324&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011324?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stephen J. Clark & Ricard Argelaguet & Chantriolnt-Andreas Kapourani & Thomas M. Stubbs & Heather J. Lee & Celia Alda-Catalinas & Felix Krueger & Guido Sanguinetti & Gavin Kelsey & John C. Marioni & O, 2018. "scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells," Nature Communications, Nature, vol. 9(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Caglar Berkel & Ercan Cacan, 2019. "Single-Cell Epigenomics In Cancer Research," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 21(3), pages 15966-15973, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011324. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.