IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v8y2025i3p57-d1699184.html
   My bibliography  Save this article

Well Begun Is Half Done: The Impact of Pre-Processing in MALDI Mass Spectrometry Imaging Analysis Applied to a Case Study of Thyroid Nodules

Author

Listed:
  • Giulia Capitoli

    (Bicocca Bioinformatics Biostatistics and Bioimaging B4 Center, Department of Medicine and Surgery, University of Milano-Bicocca, 20900 Monza, Italy
    Biostatistics and Clinical Epidemiology, Fondazione IRCCS San Gerardo Dei Tintori, 20900 Monza, Italy
    These authors contributed equally to this work.)

  • Kirsten C. J. van Abeelen

    (Radboud University Medical Center, Department of Internal Medicine, 6525 AJ Nijmegen, The Netherlands
    These authors contributed equally to this work.)

  • Isabella Piga

    (Proteomics and Metabolomics Unit, Department of Medicine and Surgery, University of Milano-Bicocca, 20900 Monza, Italy)

  • Vincenzo L’Imperio

    (Pathology Unit, Fondazione IRCCS San Gerardo dei Tintori, Department of Medicine and Surgery, University of Milano-Bicocca, 20900 Monza, Italy)

  • Marco S. Nobile

    (Department of Environmental Sciences, Informatics and Statistics, Ca’ Foscari University of Venice, 30100 Venice, Italy)

  • Daniela Besozzi

    (Department of Informatics, Systems, and Communication, University of Milano-Bicocca, 20126 Milan, Italy)

  • Stefania Galimberti

    (Bicocca Bioinformatics Biostatistics and Bioimaging B4 Center, Department of Medicine and Surgery, University of Milano-Bicocca, 20900 Monza, Italy
    Biostatistics and Clinical Epidemiology, Fondazione IRCCS San Gerardo Dei Tintori, 20900 Monza, Italy)

Abstract

The discovery of proteomic biomarkers in cancer research can be effectively performed in situ by exploiting Matrix-Assisted Laser Desorption Ionization (MALDI) Mass Spectrometry Imaging (MSI). However, due to experimental limitations, the spectra extracted by MALDI-MSI can be noisy, so pre-processing steps are generally needed to reduce the instrumental and analytical variability. Thus far, the importance and the effect of standard pre-processing methods, as well as their combinations and parameter settings, have not been extensively investigated in proteomics applications. In this work, we present a systematic study of 15 combinations of pre-processing steps—including baseline, smoothing, normalization, and peak alignment—for a real-data classification task on MALDI-MSI data measured from fine-needle aspirates biopsies of thyroid nodules. The influence of each combination was assessed by analyzing the feature extraction, pixel-by-pixel classification probabilities, and LASSO classification performance. Our results highlight the necessity of fine-tuning a pre-processing pipeline, especially for the reliable transfer of molecular diagnostic signatures in clinical practice. We outline some recommendations on the selection of pre-processing steps, together with filter levels and alignment methods, according to the mass-to-charge range and heterogeneity of data.

Suggested Citation

  • Giulia Capitoli & Kirsten C. J. van Abeelen & Isabella Piga & Vincenzo L’Imperio & Marco S. Nobile & Daniela Besozzi & Stefania Galimberti, 2025. "Well Begun Is Half Done: The Impact of Pre-Processing in MALDI Mass Spectrometry Imaging Analysis Applied to a Case Study of Thyroid Nodules," Stats, MDPI, vol. 8(3), pages 1-14, July.
  • Handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:57-:d:1699184
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/8/3/57/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/8/3/57/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:57-:d:1699184. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.