Author
Listed:
- Mina Adel Shokry Fahim
(Department of Geodesy and Cadastre, Environmental Engineering Faculty, Vilnius Gediminas Technical University, LT-10223 Vilnius, Lithuania)
- Jūratė Sužiedelytė Visockienė
(Department of Geodesy and Cadastre, Environmental Engineering Faculty, Vilnius Gediminas Technical University, LT-10223 Vilnius, Lithuania)
Abstract
Air pollution remains a major public-health concern, and exposure to particulate matter (PM), particularly PM 10 (with a diameter ≤ 10 µm), is associated with adverse respiratory and cardiovascular outcomes. Most research relies on a singular model for PM 10 surface estimation. This study is an assessment of a national-scale, daily PM 10 estimation framework for Lithuania (2019–2024), using a hybrid machine-learning method that combines Random Forest (RF) and extreme gradient boosting (XGBoost) algorithms. Hourly PM 10 observations were aggregated from 18 monitoring stations to obtain daily means and temporal means. The predictors integrated meteorological factors, such as temperature, wind, humidity, and precipitation, to determine satellite-based atmospheric composition from Sentinel-5P Tropospheric Monitoring Instruments (TROPOMI). Atmospheric components include nitrogen dioxide (NO 2 ), carbon monoxide (CO), sulfur dioxide (SO 2 ), ozone (O 3 ), formaldehyde (HCHO), and the absorbing aerosol index (AI). Moderate-Resolution Imaging Spectroradiometers (MODIS) were used to record land-surface temperature and static spatial descriptors, such as elevation, land cover, Normalized Difference Vegetation Index (NDVI), population, and road proximity. The dataset was partitioned temporally into training (70%), validation (20%), and testing (10%). The hybrid model achieved an improved accuracy, compared with single-model baselines, reaching a coefficient of determination (R 2 ) of 0.739 in validation and R 2 = 0.75 in the tested dataset. Mean absolute error (MAE) was 3.15 µg/m 3 , and root mean square error (RMSE) was 3.98 µg/m 3 . The results indicate a slight tendency to overestimate PM 10 concentrations at lower concentration levels. Feature-importance analysis revealed that short-term temporal persistence is the key to daily PM 10 prediction, while meteorological variables provide secondary contributions. Temporal evaluation, using consecutive two-year windows, revealed a consistent improvement in predictive performance from 2019–2020 to 2023–2024, while station-level analysis showed moderate-to-strong agreement between the predicted and observed PM 10 concentrations across monitoring stations, with R 2 ranging from 0.455 to 0.760. This provides decision-support capabilities for air-quality management, the evaluation of mitigation measures, and integration of air-pollution considerations into sustainable urban planning strategies assessing public-health protection.
Suggested Citation
Mina Adel Shokry Fahim & Jūratė Sužiedelytė Visockienė, 2026.
"An Assessment of the Multi-Input Spatiotemporal RF–XGBoost Hybrid Framework for PM 10 Estimation in Lithuania,"
Sustainability, MDPI, vol. 18(4), pages 1-20, February.
Handle:
RePEc:gam:jsusta:v:18:y:2026:i:4:p:2022-:d:1866293
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2026:i:4:p:2022-:d:1866293. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.