IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-30118-9.html
   My bibliography  Save this article

SIMILE enables alignment of tandem mass spectra with statistical significance

Author

Listed:
  • Daniel G. C. Treen

    (Environmental Genomics and Systems Biology Division & The Joint Genome Institute Lawrence Berkeley National Laboratory)

  • Mingxun Wang

    (University of California, San Diego)

  • Shipei Xing

    (University of British Columbia)

  • Katherine B. Louie

    (Environmental Genomics and Systems Biology Division & The Joint Genome Institute Lawrence Berkeley National Laboratory)

  • Tao Huan

    (University of British Columbia)

  • Pieter C. Dorrestein

    (University of California, San Diego)

  • Trent R. Northen

    (Environmental Genomics and Systems Biology Division & The Joint Genome Institute Lawrence Berkeley National Laboratory)

  • Benjamin P. Bowen

    (Environmental Genomics and Systems Biology Division & The Joint Genome Institute Lawrence Berkeley National Laboratory)

Abstract

Interrelating small molecules according to their aligned fragmentation spectra is central to tandem mass spectrometry-based untargeted metabolomics. Current alignment algorithms do not provide statistical significance and compounds that have multiple delocalized structural differences and therefore often fail to have their fragment ions aligned. Here we align fragmentation spectra with both statistical significance and allowance for multiple chemical differences using Significant Interrelation of MS/MS Ions via Laplacian Embedding (SIMILE). SIMILE yields spectral alignment inferred structural connections in molecular networks that are not found with cosine-based scoring algorithms. In addition, it is now possible to rank spectral alignments based on p-values in the exploration of structural relationships between compounds and enhance the chemical connectivity that can be obtained with molecular networking.

Suggested Citation

  • Daniel G. C. Treen & Mingxun Wang & Shipei Xing & Katherine B. Louie & Tao Huan & Pieter C. Dorrestein & Trent R. Northen & Benjamin P. Bowen, 2022. "SIMILE enables alignment of tandem mass spectra with statistical significance," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-30118-9
    DOI: 10.1038/s41467-022-30118-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-30118-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-30118-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Florian Huber & Lars Ridder & Stefan Verhoeven & Jurriaan H Spaaks & Faruk Diblen & Simon Rogers & Justin J J van der Hooft, 2021. "Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-18, February.
    2. Liu Cao & Mustafa Guler & Azat Tagirdzhanov & Yi-Yuan Lee & Alexey Gurevich & Hosein Mohimani, 2021. "MolDiscovery: learning mass spectrometry fragmentation of small molecules," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    3. Kerstin Scheubert & Franziska Hufsky & Daniel Petras & Mingxun Wang & Louis-Félix Nothias & Kai Dührkop & Nuno Bandeira & Pieter C. Dorrestein & Sebastian Böcker, 2017. "Significance estimation for large scale metabolomics annotations by spectral matching," Nature Communications, Nature, vol. 8(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wout Bittremieux & Nicole E. Avalon & Sydney P. Thomas & Sarvar A. Kakhkhorov & Alexander A. Aksenov & Paulo Wender P. Gomes & Christine M. Aceves & Andrés Mauricio Caraballo-Rodríguez & Julia M. Gaug, 2023. "Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    2. Nicholas J. Morehouse & Trevor N. Clark & Emily J. McMann & Jeffrey A. Santen & F. P. Jake Haeckl & Christopher A. Gray & Roger G. Linington, 2023. "Annotation of natural product compound families using molecular networking topology and structural similarity fingerprinting," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    3. Qiong Yang & Hongchao Ji & Zhenbo Xu & Yiming Li & Pingshan Wang & Jinyu Sun & Xiaqiong Fan & Hailiang Zhang & Hongmei Lu & Zhimin Zhang, 2023. "Ultra-fast and accurate electron ionization mass spectrum matching for compound identification with million-scale in-silico library," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    4. Zhiwei Zhou & Mingdu Luo & Haosong Zhang & Yandong Yin & Yuping Cai & Zheng-Jiang Zhu, 2022. "Metabolite annotation from knowns to unknowns through knowledge-guided multi-layer metabolic networking," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    5. Oliver Alka & Premy Shanthamoorthy & Michael Witting & Karin Kleigrewe & Oliver Kohlbacher & Hannes L. Röst, 2022. "DIAMetAlyzer allows automated false-discovery rate-controlled analysis for data-independent acquisition in metabolomics," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    6. Niek F. de Jonge & Joris J. R. Louwen & Elena Chekmeneva & Stephane Camuzeaux & Femke J. Vermeir & Robert S. Jansen & Florian Huber & Justin J. J. van der Hooft, 2023. "MS2Query: reliable and scalable MS2 mass spectra-based analogue search," Nature Communications, Nature, vol. 14(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-30118-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.