IDEAS home Printed from https://ideas.repec.org/a/spr/nathaz/v121y2025i6d10.1007_s11069-024-07077-z.html
   My bibliography  Save this article

Optimal machine learning techniques for meteorological modeling of $${\textrm{PM}}_{2.5}$$ PM 2.5 concentration in five major polluted cities of South-East Asia

Author

Listed:
  • Sedra Shafi

    (University of Naples Federico II, Complesso Universitario di Monte S. Angelo)

  • Nicola Scafetta

    (University of Naples Federico II, Complesso Universitario di Monte S. Angelo)

Abstract

The rapid decline in air quality across Southeast and Western Pacific Asia is occurring at an accelerated pace due to population growth and industrial development. The region’s Meteorological factors, including the monsoon seasonality, exert a significant influence on air pollution levels, particularly $${\textrm{PM}}_{2.5}$$ PM 2.5 concentrations. In this study, we employ a statistical modeling approach to derive daily $${\textrm{PM}}_{2.5}$$ PM 2.5 levels from meteorological parameters in five major polluted cities: Lahore (Pakistan), Delhi (India), Dhaka (Bangladesh), Hanoi (Vietnam), and Shanghai (China). The incorporated meteorological parameters are wind speed, barometric pressure, temperature, and rainfall, which are known to affect air pollution levels from 2020 to 2022. The statistical modeling was based on the comparative analysis of 35 different machine learning (ML) regression techniques with the purpose of selecting the algorithms most efficient for reconstructing and predicting $${\textrm{PM}}_{2.5}$$ PM 2.5 levels from meteorological variables alone. Specifically, each ML regression model was trained to reconstruct daily $${\textrm{PM}}_{2.5}$$ PM 2.5 levels in 2020–2021, and then used to reconstruct both missing daily $${\textrm{PM}}_{2.5}$$ PM 2.5 levels in 2020–2021 and forecast the whole of 2022 using only the 2022 meteorological records. The results indicated that most of the daily and seasonal variability in daily $${\textrm{PM}}_{2.5}$$ PM 2.5 levels could be reconstructed from meteorological conditions. However, the performance of the various ML models (as assessed by Root Mean Square Error tests) exhibited considerable variability. Among the tested models, the Ensembles Boosted Tree ML method demonstrated optimal efficiency during the training period (the first 2 years, 2020 and 2021) and it also was highly efficient in predicting the third year (2022) using only meteorological data. Additionaly, the Trilayer Neural Network ML method was found the most effective at reconstructing the data after 3 years of training and may therefore be preferred to fill in short periods of missing $${\textrm{PM}}_{2.5}$$ PM 2.5 data. In contrast, our comparative analyses showed that the traditional multi-linear regression models under-performed in both constructing and predicting $${\textrm{PM}}_{2.5}$$ PM 2.5 data. This study demonstrates the necessity and usefulness of assessing multiple ML regression methodologies for selecting which ones better perform for reconstructing the data of interest (in our case $${\textrm{PM}}_{2.5}$$ PM 2.5 records) from their hypothesized constructors (in our case meteorological parameters). In particular, this study has highlighted the utility of using ML regression techniques for forecasting air quality and reconstructing missing pollution data, which is crucial for policy-making across South-East and Western-Pacific Asia regions, where only limited pollution monitoring infrastructure are available.

Suggested Citation

  • Sedra Shafi & Nicola Scafetta, 2025. "Optimal machine learning techniques for meteorological modeling of $${\textrm{PM}}_{2.5}$$ PM 2.5 concentration in five major polluted cities of South-East Asia," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 121(6), pages 6981-7025, April.
  • Handle: RePEc:spr:nathaz:v:121:y:2025:i:6:d:10.1007_s11069-024-07077-z
    DOI: 10.1007/s11069-024-07077-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11069-024-07077-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11069-024-07077-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chang Cao & Xuhui Lee & Shoudong Liu & Natalie Schultz & Wei Xiao & Mi Zhang & Lei Zhao, 2016. "Urban heat islands in China enhanced by haze pollution," Nature Communications, Nature, vol. 7(1), pages 1-7, November.
    2. Wenju Cai & Ke Li & Hong Liao & Huijun Wang & Lixin Wu, 2017. "Weather conditions conducive to Beijing severe haze more frequent under climate change," Nature Climate Change, Nature, vol. 7(4), pages 257-262, April.
    3. Dimitrios Katsanos & Adrianos Retalis & Filippos Tymvios & Silas Michaelides, 2016. "Analysis of precipitation extremes based on satellite (CHIRPS) and in situ dataset over Cyprus," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 83(1), pages 53-63, October.
    4. Jianbao Liu & Kathleen M. Rühland & Jianhui Chen & Yangyang Xu & Shengqian Chen & Qiaomei Chen & Wei Huang & Qinghai Xu & Fahu Chen & John P. Smol, 2017. "Aerosol-weakened summer monsoons decrease lake fertilization on the Chinese Loess Plateau," Nature Climate Change, Nature, vol. 7(3), pages 190-194, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qianqian Yang & Qiangqiang Yuan & Tongwen Li & Huanfeng Shen & Liangpei Zhang, 2017. "The Relationships between PM 2.5 and Meteorological Factors in China: Seasonal and Regional Variations," IJERPH, MDPI, vol. 14(12), pages 1-19, December.
    2. Renfeng Ma & Congcong Wang & Yixia Jin & Xiaojing Zhou, 2019. "Estimating the Effects of Economic Agglomeration on Haze Pollution in Yangtze River Delta China Using an Econometric Analysis," Sustainability, MDPI, vol. 11(7), pages 1-19, March.
    3. Liu, Cuiping & Zhang, Feng & Miao, Lijuan & Lei, Yadong & Yang, Quan, 2020. "Future haze events in Beijing, China: When climate warms by 1.5 and 2.0°C," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 40(8), pages 3689-3700.
    4. Yue Tui & Jiaxin Qiu & Ju Wang & Chunsheng Fang, 2021. "Analysis of Spatio-Temporal Variation Characteristics of Main Air Pollutants in Shijiazhuang City," Sustainability, MDPI, vol. 13(2), pages 1-17, January.
    5. Sinan Demir & İbrahim Dursun, 2024. "Assessment of pre- and post-fire erosion using the RUSLE equation in a watershed affected by the forest fire on Google Earth Engine: the study of Manavgat River Basin," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 120(3), pages 2499-2527, February.
    6. Muxue Liang & Hong Liao & Yue Huang & Zifang Qiao & Chenchen Tan & Ruoxin Liu, 2021. "A Questionnaire Case Study of Opinions of Chinese Agricultural Workers on the Coordinated Control of Emissions of Ammonia," Sustainability, MDPI, vol. 13(4), pages 1-18, February.
    7. Ze Liang & Yueyao Wang & Jiao Huang & Feili Wei & Shuyao Wu & Jiashu Shen & Fuyue Sun & Shuangcheng Li, 2020. "Seasonal and Diurnal Variations in the Relationships between Urban Form and the Urban Heat Island Effect," Energies, MDPI, vol. 13(22), pages 1-19, November.
    8. Rui Lyu & Wei Gao & Yarong Peng & Yijie Qian & Qianshan He & Tiantao Cheng & Xingna Yu & Gang Zhao, 2022. "Fog–Haze Transition and Drivers in the Coastal Region of the Yangtze River Delta," IJERPH, MDPI, vol. 19(15), pages 1-16, August.
    9. Fangjin Xu & Qingxu Huang & Huanbi Yue & Xingyun Feng & Haoran Xu & Chunyang He & Peng Yin & Brett A. Bryan, 2023. "The challenge of population aging for mitigating deaths from PM2.5 air pollution in China," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    10. Hao Guo & Anming Bao & Tie Liu & Felix Ndayisaba & Daming He & Alishir Kurban & Philippe De Maeyer, 2017. "Meteorological Drought Analysis in the Lower Mekong Basin Using Satellite-Based Long-Term CHIRPS Product," Sustainability, MDPI, vol. 9(6), pages 1-21, May.
    11. Peng Zhang & Tianzeng Chen & Qingxin Ma & Biwu Chu & Yonghong Wang & Yujing Mu & Yunbo Yu & Hong He, 2022. "Diesel soot photooxidation enhances the heterogeneous formation of H2SO4," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    12. Huanbi Yue & Chunyang He & Qingxu Huang & Da Zhang & Peijun Shi & Enayat A. Moallemi & Fangjin Xu & Yang Yang & Xin Qi & Qun Ma & Brett A. Bryan, 2024. "Substantially reducing global PM2.5-related deaths under SDG3.9 requires better air pollution control and healthcare," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    13. Zhiyuan Wang & Xiaoyi Shi & Chunhua Pan & Sisi Wang, 2021. "Spatial and Temporal Characteristics of Environmental Air Quality and Its Relationship with Seasonal Climatic Conditions in Eastern China during 2015–2018," IJERPH, MDPI, vol. 18(9), pages 1-17, April.
    14. Li Yang & Chunyan Qin & Ke Li & Chuxiong Deng & Yaojun Liu, 2023. "Quantifying the Spatiotemporal Heterogeneity of PM 2.5 Pollution and Its Determinants in 273 Cities in China," IJERPH, MDPI, vol. 20(2), pages 1-17, January.
    15. Xiangxue Zhang & Changxiu Cheng, 2022. "Temporal and Spatial Heterogeneity of PM 2.5 Related to Meteorological and Socioeconomic Factors across China during 2000–2018," IJERPH, MDPI, vol. 19(2), pages 1-15, January.
    16. Lu Niu & Ronglin Tang & Yazhen Jiang & Xiaoming Zhou, 2020. "Spatiotemporal Patterns and Drivers of the Surface Urban Heat Island in 36 Major Cities in China: A Comparison of Two Different Methods for Delineating Rural Areas," Sustainability, MDPI, vol. 12(2), pages 1-17, January.
    17. Wang, Jing & Li, Yazhou & Wu, Jianlin & Gu, Jibao & Xu, Shuo, 2020. "Environmental beliefs and public acceptance of nuclear energy in China: A moderated mediation analysis," Energy Policy, Elsevier, vol. 137(C).
    18. Yu Taoa & Faisal Mumtaz & Barjeece Bashir & Hamid Faiz & Mariam Kareem & Adeel Ahmad & Hammad Ul Hassan, 2021. "The Impact Of The Lockdown On Air Quality In Result Of Covid-19 Pandemic Over Hubei Province, China," Environment & Ecosystem Science (EES), Zibeline International Publishing, vol. 5(1), pages 15-22, January.
    19. Lei, Yadong & Zhang, Feng & Miao, Lijuan & Yu, Qiu-Run & Duan, Mingkeng & Fraedrich, Klaus & Yu, Zifeng, 2020. "Potential impacts of future reduced aerosols on internal dynamics characteristics of precipitation based on model simulations over southern China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    20. Yali Zhong & Shuqing Chen & Haihua Mo & Weiwen Wang & Pengfei Yu & Xuemei Wang & Nima Chuduo & Bian Ba, 2022. "Contribution of urban expansion to surface warming in high-altitude cities of the Tibetan Plateau," Climatic Change, Springer, vol. 175(1), pages 1-22, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:nathaz:v:121:y:2025:i:6:d:10.1007_s11069-024-07077-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.