IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-021-25957-x.html
   My bibliography  Save this article

Efficient and precise single-cell reference atlas mapping with Symphony

Author

Listed:
  • Joyce B. Kang

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Aparna Nathan

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Kathryn Weinand

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Fan Zhang

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Nghia Millard

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Laurie Rumker

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • D. Branch Moody

    (Brigham and Women’s Hospital and Harvard Medical School)

  • Ilya Korsunsky

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

  • Soumya Raychaudhuri

    (Center for Data Sciences, Brigham and Women’s Hospital
    Brigham and Women’s Hospital and Harvard Medical School
    Brigham and Women’s Hospital and Harvard Medical School
    Harvard Medical School)

Abstract

Recent advances in single-cell technologies and integration algorithms make it possible to construct comprehensive reference atlases encompassing many donors, studies, disease states, and sequencing platforms. Much like mapping sequencing reads to a reference genome, it is essential to be able to map query cells onto complex, multimillion-cell reference atlases to rapidly identify relevant cell states and phenotypes. We present Symphony ( https://github.com/immunogenomics/symphony ), an algorithm for building large-scale, integrated reference atlases in a convenient, portable format that enables efficient query mapping within seconds. Symphony localizes query cells within a stable low-dimensional reference embedding, facilitating reproducible downstream transfer of reference-defined annotations to the query. We demonstrate the power of Symphony in multiple real-world datasets, including (1) mapping a multi-donor, multi-species query to predict pancreatic cell types, (2) localizing query cells along a developmental trajectory of fetal liver hematopoiesis, and (3) inferring surface protein expression with a multimodal CITE-seq atlas of memory T cells.

Suggested Citation

  • Joyce B. Kang & Aparna Nathan & Kathryn Weinand & Fan Zhang & Nghia Millard & Laurie Rumker & D. Branch Moody & Ilya Korsunsky & Soumya Raychaudhuri, 2021. "Efficient and precise single-cell reference atlas mapping with Symphony," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-25957-x
    DOI: 10.1038/s41467-021-25957-x
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-021-25957-x
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-021-25957-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kevin Wei & Ilya Korsunsky & Jennifer L. Marshall & Anqi Gao & Gerald F. M. Watts & Triin Major & Adam P. Croft & Jordan Watts & Philip E. Blazar & Jeffrey K. Lange & Thomas S. Thornhill & Andrew File, 2020. "Notch signalling drives synovial fibroblast identity and arthritis pathology," Nature, Nature, vol. 582(7811), pages 259-264, June.
    2. Junyue Cao & Malte Spielmann & Xiaojie Qiu & Xingfan Huang & Daniel M. Ibrahim & Andrew J. Hill & Fan Zhang & Stefan Mundlos & Lena Christiansen & Frank J. Steemers & Cole Trapnell & Jay Shendure, 2019. "The single-cell transcriptional landscape of mammalian organogenesis," Nature, Nature, vol. 566(7745), pages 496-502, February.
    3. Massimo Andreatta & Jesus Corria-Osorio & Sören Müller & Rafael Cubas & George Coukos & Santiago J. Carmona, 2021. "Interpretation of T cell states from single-cell transcriptomics data using reference atlases," Nature Communications, Nature, vol. 12(1), pages 1-19, December.
    4. Dorin-Mirel Popescu & Rachel A. Botting & Emily Stephenson & Kile Green & Simone Webb & Laura Jardine & Emily F. Calderbank & Krzysztof Polanski & Issac Goh & Mirjana Efremova & Meghan Acres & Daniel , 2019. "Decoding human fetal liver haematopoiesis," Nature, Nature, vol. 574(7778), pages 365-371, October.
    5. Orit Rozenblatt-Rosen & Michael J. T. Stubbington & Aviv Regev & Sarah A. Teichmann, 2017. "The Human Cell Atlas: from vision to reality," Nature, Nature, vol. 550(7677), pages 451-453, October.
    6. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    7. Xiaoping Han & Ziming Zhou & Lijiang Fei & Huiyu Sun & Renying Wang & Yao Chen & Haide Chen & Jingjing Wang & Huanna Tang & Wenhao Ge & Yincong Zhou & Fang Ye & Mengmeng Jiang & Junqing Wu & Yanyu Xia, 2020. "Construction of a human cell landscape at single-cell level," Nature, Nature, vol. 581(7808), pages 303-309, May.
    8. Zhi-Jie Cao & Lin Wei & Shen Lu & De-Chang Yang & Ge Gao, 2020. "Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Francisco X. Galdos & Sidra Xu & William R. Goodyer & Lauren Duan & Yuhsin V. Huang & Soah Lee & Han Zhu & Carissa Lee & Nicholas Wei & Daniel Lee & Sean M. Wu, 2022. "devCellPy is a machine learning-enabled pipeline for automated annotation of complex multilayered single-cell transcriptomic data," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    2. Shravan Leonard-Murali & Chetana Bhaskarla & Ghanshyam S. Yadav & Sudeep K. Maurya & Chenna R. Galiveti & Joshua A. Tobin & Rachel J. Kann & Eishan Ashwat & Patrick S. Murphy & Anish B. Chakka & Visha, 2024. "Uveal melanoma immunogenomics predict immunotherapy resistance and susceptibility," Nature Communications, Nature, vol. 15(1), pages 1-17, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aiko Sekita & Hiroshi Kawasaki & Ayano Fukushima-Nomura & Kiyoshi Yashiro & Keiji Tanese & Susumu Toshima & Koichi Ashizaki & Tomohiro Miyai & Junshi Yazaki & Atsuo Kobayashi & Shinichi Namba & Tatsuh, 2023. "Multifaceted analysis of cross-tissue transcriptomes reveals phenotype–endotype associations in atopic dermatitis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    2. Kim Vanuytsel & Carlos Villacorta-Martin & Jonathan Lindstrom-Vautrin & Zhe Wang & Wilfredo F. Garcia-Beltran & Vladimir Vrbanac & Dylan Parsons & Evan C. Lam & Taylor M. Matte & Todd W. Dowrey & Sara, 2022. "Multi-modal profiling of human fetal liver hematopoietic stem cells reveals the molecular signature of engraftment," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    3. Yuan Liao & Lifeng Ma & Qile Guo & Weigao E & Xing Fang & Lei Yang & Fanwei Ruan & Jingjing Wang & Peijing Zhang & Zhongyi Sun & Haide Chen & Zhongliang Lin & Xueyi Wang & Xinru Wang & Huiyu Sun & Xiu, 2022. "Cell landscape of larval and adult Xenopus laevis at single-cell resolution," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Suijuan Zhong & Mengdi Wang & Luwei Huang & Youqiao Chen & Yuxin Ge & Jiyao Zhang & Yingchao Shi & Hao Dong & Xin Zhou & Bosong Wang & Tian Lu & Xiaoxi Jing & Yufeng Lu & Junjing Zhang & Xiaoqun Wang , 2023. "Single-cell epigenomics and spatiotemporal transcriptomics reveal human cerebellar development," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    5. Ajita Shree & Musale Krushna Pavan & Hamim Zafar, 2023. "scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    6. Monika Graf & Marta Interlandi & Natalia Moreno & Dörthe Holdhof & Carolin Göbel & Viktoria Melcher & Julius Mertins & Thomas K. Albert & Dennis Kastrati & Amelie Alfert & Till Holsten & Flavia de Far, 2022. "Single-cell transcriptomics identifies potential cells of origin of MYC rhabdoid tumors," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    7. Mírian Romitti & Adrien Tourneur & Barbara Faria da Fonseca & Gilles Doumont & Pierre Gillotay & Xiao-Hui Liao & Sema Elif Eski & Gaetan Simaeys & Laura Chomette & Helene Lasolle & Olivier Monestier &, 2022. "Transplantable human thyroid organoids generated from embryonic stem cells to rescue hypothyroidism," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    8. Lei Xiong & Kang Tian & Yuzhe Li & Weixi Ning & Xin Gao & Qiangfeng Cliff Zhang, 2022. "Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    9. Fang Ye & Guodong Zhang & Weigao E. & Haide Chen & Chengxuan Yu & Lei Yang & Yuting Fu & Jiaqi Li & Sulei Fu & Zhongyi Sun & Lijiang Fei & Qile Guo & Jingjing Wang & Yanyu Xiao & Xinru Wang & Peijing , 2022. "Construction of the axolotl cell landscape using combinatorial hybridization sequencing at single-cell resolution," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    10. Ziye Xu & Tianyu Zhang & Hongyu Chen & Yuyi Zhu & Yuexiao Lv & Shunji Zhang & Jiaye Chen & Haide Chen & Lili Yang & Weiqin Jiang & Shengyu Ni & Fangru Lu & Zhaolun Wang & Hao Yang & Ling Dong & Feng C, 2023. "High-throughput single nucleus total RNA sequencing of formalin-fixed paraffin-embedded tissues by snRandom-seq," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    11. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    12. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    13. Rui Wang & Naihua Xiu & Kim-Chuan Toh, 2021. "Subspace quadratic regularization method for group sparse multinomial logistic regression," Computational Optimization and Applications, Springer, vol. 79(3), pages 531-559, July.
    14. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    15. Chen, Le-Yu & Lee, Sokbae, 2018. "Best subset binary prediction," Journal of Econometrics, Elsevier, vol. 206(1), pages 39-56.
    16. Sung Jae Jun & Sokbae Lee, 2020. "Causal Inference under Outcome-Based Sampling with Monotonicity Assumptions," Papers 2004.08318, arXiv.org, revised Oct 2023.
    17. Christoph Ziegenhain & Rickard Sandberg, 2021. "BAMboozle removes genetic variation from human sequence data for open data sharing," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    18. Xiangwei Li & Thomas Delerue & Ben Schöttker & Bernd Holleczek & Eva Grill & Annette Peters & Melanie Waldenberger & Barbara Thorand & Hermann Brenner, 2022. "Derivation and validation of an epigenetic frailty risk score in population-based cohorts of older adults," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    19. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    20. Heng Chen & Daniel F. Heitjan, 2022. "Analysis of local sensitivity to nonignorability with missing outcomes and predictors," Biometrics, The International Biometric Society, vol. 78(4), pages 1342-1352, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-25957-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.