IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v17y2025i12p5274-d1673993.html
   My bibliography  Save this article

Comparison of Selected Ensemble Supervised Learning Algorithms Used for Meteorological Normalisation of Particulate Matter (PM 10 )

Author

Listed:
  • Karolina Gora

    (Faculty of Geo-Data Science, Geodesy and Environmental Engineering, AGH University of Krakow, 30-059 Kraków, Poland)

  • Mateusz Rzeszutek

    (Faculty of Geo-Data Science, Geodesy and Environmental Engineering, AGH University of Krakow, 30-059 Kraków, Poland)

Abstract

Air pollution, particularly PM 10 particulate matter, poses significant health risks related to respiratory and cardiovascular diseases as well as cancer. Accurate identification of PM 10 reduction factors is therefore essential for developing effective sustainable development strategies. According to the current state of knowledge, machine learning methods are most frequently employed for this purpose due to their superior performance compared to classical statistical approaches. This study evaluated the performance of three machine learning algorithms—Decision Tree (CART), Random Forest, and Cubist Rule—in predicting PM 10 concentrations and estimating long-term trends following meteorological normalisation. The research focused on Tarnów, Poland (2010–2022), with comprehensive consideration of meteorological variability. The results demonstrated superior accuracy for the Random Forest and Cubist models (R 2 ~0.88–0.89, RMSE ~14 μg/m 3 ) compared to CART (RMSE 19.96 μg/m 3 ). Air temperature and boundary layer height emerged as the most significant predictive variables across all algorithms. The Cubist algorithm proved particularly effective in detecting the impact of policy interventions, making it valuable for air quality trend analysis. While the study confirmed a statistically significant annual decrease in PM 10 concentrations (0.83–1.03 μg/m 3 ), pollution levels still exceeded both the updated EU air quality standards from 2024 (Directive (EU) 2024/2881), which will come into force in 2030, and the more stringent WHO guidelines from 2021.

Suggested Citation

  • Karolina Gora & Mateusz Rzeszutek, 2025. "Comparison of Selected Ensemble Supervised Learning Algorithms Used for Meteorological Normalisation of Particulate Matter (PM 10 )," Sustainability, MDPI, vol. 17(12), pages 1-17, June.
  • Handle: RePEc:gam:jsusta:v:17:y:2025:i:12:p:5274-:d:1673993
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/17/12/5274/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/17/12/5274/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:12:p:5274-:d:1673993. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.