IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-60822-1.html
   My bibliography  Save this article

GAUDI: interpretable multi-omics integration with UMAP embeddings and density-based clustering

Author

Listed:
  • Pol Castellano-Escuder

    (Duke University School of Medicine)

  • Derek K. Zachman

    (Duke University School of Medicine
    Duke University School of Medicine)

  • Kevin Han

    (Duke University School of Medicine)

  • Matthey D. Hirschey

    (Duke University School of Medicine
    Duke University School of Medicine
    Duke University School of Medicine
    Duke-National University of Singapore (NUS) Medical School)

Abstract

Integrating high-dimensional cellular multi-omics data is crucial for understanding various layers of biological control. Single ‘omic methods provide important insights, but often fall short in handling the complex relationships between genes, proteins, metabolites and beyond. Here, we present a novel, non-linear, and unsupervised method called GAUDI (Group Aggregation via UMAP Data Integration) that leverages independent UMAP embeddings for the concurrent analysis of multiple data types. GAUDI uncovers non-linear relationships among different omics data better than several state-of-the-art methods. This approach not only clusters samples by their multi-omic profiles but also identifies latent factors across each omics dataset, thereby enabling interpretation of the underlying features contributing to each cluster. Consequently, GAUDI facilitates more intuitive, interpretable visualizations to identify novel insights and potential biomarkers from a wide range of experimental designs.

Suggested Citation

  • Pol Castellano-Escuder & Derek K. Zachman & Kevin Han & Matthey D. Hirschey, 2025. "GAUDI: interpretable multi-omics integration with UMAP embeddings and density-based clustering," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60822-1
    DOI: 10.1038/s41467-025-60822-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-60822-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-60822-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Longqi Liu & Chuanyu Liu & Andrés Quintero & Liang Wu & Yue Yuan & Mingyue Wang & Mengnan Cheng & Lizhi Leng & Liqin Xu & Guoyi Dong & Rui Li & Yang Liu & Xiaoyu Wei & Jiangshan Xu & Xiaowei Chen & Ha, 2019. "Deconvolution of single-cell multi-omics layers reveals regulatory heterogeneity," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    2. Olivier Poirion & Xun Zhu & Travers Ching & Lana X. Garmire, 2018. "Using single nucleotide variations in single-cell RNA-seq to identify subpopulations and genotype-phenotype linkage," Nature Communications, Nature, vol. 9(1), pages 1-13, December.
    3. Michel Tenenhaus & Arthur Tenenhaus & Patrick J. F. Groenen, 2017. "Regularized Generalized Canonical Correlation Analysis: A Framework for Sequential Multiblock Component Methods," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 737-777, September.
    4. Laura Cantini & Pooya Zakeri & Celine Hernandez & Aurelien Naldi & Denis Thieffry & Elisabeth Remy & Anaïs Baudot, 2021. "Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    5. Nimrod Rappoport & Elad Chomsky & Takashi Nagano & Charlie Seibert & Yaniv Lubling & Yael Baran & Aviezer Lifshitz & Wing Leung & Zohar Mukamel & Ron Shamir & Peter Fraser & Amos Tanay, 2023. "Single cell Hi-C identifies plastic chromosome conformations underlying the gastrulation enhancer landscape," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Geert-Jan Huizing & Ina Maria Deutschmann & Gabriel Peyré & Laura Cantini, 2023. "Paired single-cell multi-omics data integration with Mowgli," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Yichuan Cao & Xiamiao Zhao & Songming Tang & Qun Jiang & Sijie Li & Siyu Li & Shengquan Chen, 2024. "scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    3. Kudraszow, Nadia L. & Vahnovan, Alejandra V. & Ferrario, Julieta & Fasano, M. Victoria, 2025. "Robust generalized canonical correlation analysis based on scatter matrices," Computational Statistics & Data Analysis, Elsevier, vol. 206(C).
    4. Sera Şanlı, 2023. "Untapped potentials on a well‐endowed plate: A sustainable future catalogue for the harmony of renewable technologies with the water‐energy‐climate‐SDGs nexus," Natural Resources Forum, Blackwell Publishing, vol. 47(4), pages 672-698, November.
    5. Akshaya Ramakrishnan & Aikaterini Symeonidi & Patrick Hanel & Katharina T. Schmid & Maria L. Richter & Michael Schubert & Maria Colomé-Tatché, 2023. "epiAneufinder identifies copy number alterations from single-cell ATAC-seq data," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    6. Anna L. Tyler & J. Matthew Mahoney & Mark P. Keller & Candice N. Baker & Margaret Gaca & Anuj Srivastava & Isabela Gerdes Gyuricza & Madeleine J. Braun & Nadia A. Rosenthal & Alan D. Attie & Gary A. C, 2025. "Transcripts with high distal heritability mediate genetic effects on complex metabolic traits," Nature Communications, Nature, vol. 16(1), pages 1-21, December.
    7. Boyi Guo & Hannah D. Holscher & Loretta S. Auvil & Michael E. Welge & Colleen B. Bushell & Janet A. Novotny & David J. Baer & Nicholas A. Burd & Naiman A. Khan & Ruoqing Zhu, 2023. "Estimating Heterogeneous Treatment Effect on Multivariate Responses Using Random Forests," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(3), pages 545-561, December.
    8. Teemu J Rintala & Vittorio Fortino, 2024. "COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms," PLOS Computational Biology, Public Library of Science, vol. 20(8), pages 1-23, August.
    9. Kai Cao & Qiyu Gong & Yiguang Hong & Lin Wan, 2022. "A unified computational framework for single-cell data integration with optimal transport," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    10. Jules Samaran & Gabriel Peyré & Laura Cantini, 2024. "scConfluence: single-cell diagonal integration with regularized Inverse Optimal Transport on weakly connected features," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    11. Xiuli Du & Xiaohu Jiang & Jinguan Lin, 2023. "Multinomial Logistic Factor Regression for Multi-source Functional Block-wise Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 88(3), pages 975-1001, September.
    12. Dybro Liengaard, Benjamin, 2024. "Measurement invariance testing in partial least squares structural equation modeling," Journal of Business Research, Elsevier, vol. 177(C).
    13. Heungsun Hwang & Gyeongcheol Cho, 2020. "Global Least Squares Path Modeling: A Full-Information Alternative to Partial Least Squares Path Modeling," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 947-972, December.
    14. Adrià Fernández-Torras & Miquel Duran-Frigola & Martino Bertoni & Martina Locatelli & Patrick Aloy, 2022. "Integrating and formatting biomedical data as pre-calculated knowledge graph embeddings in the Bioteque," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    15. Zaoqu Liu & Yushuai Wu & Hui Xu & Minkai Wang & Siyuan Weng & Dongling Pei & Shuang Chen & WeiWei Wang & Jing Yan & Li Cui & Jingxian Duan & Yuanshen Zhao & Zilong Wang & Zeyu Ma & Ran Li & Wenchao Du, 2025. "Multimodal fusion of radio-pathology and proteogenomics identify integrated glioma subtypes with prognostic and therapeutic opportunities," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    16. Aaron Wing Cheung Kwok & Chen Qiao & Rongting Huang & Mai-Har Sham & Joshua W. K. Ho & Yuanhua Huang, 2022. "MQuad enables clonal substructure discovery using single cell mitochondrial variants," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    17. Fangfang Yan & Akiko Suzuki & Chihiro Iwaya & Guangsheng Pei & Xian Chen & Hiroki Yoshioka & Meifang Yu & Lukas M. Simon & Junichi Iwata & Zhongming Zhao, 2024. "Single-cell multiomics decodes regulatory programs for mouse secondary palate development," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    18. Mohamed Hanafi & Zouhair El Hadri & Abderrahim Sahli & Pasquale Dolce, 2022. "Overcoming convergence problems in PLS path modelling," Computational Statistics, Springer, vol. 37(5), pages 2437-2470, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60822-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.