IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v17y2025i20p9211-d1773500.html
   My bibliography  Save this article

Predicting the Concentration Levels of PM 2.5 and O 3 for Highly Urbanized Areas Based on Machine Learning Models

Author

Listed:
  • Chao Wei

    (China National Environmental Monitoring Center, Beijing 100012, China)

  • Chen Zhao

    (State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China)

  • Yuanan Hu

    (MOE Laboratory of Groundwater Circulation and Evolution, School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, China)

  • Yutai Tian

    (State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China)

Abstract

The accurate real-time forecasting and impact factor identification of air pollutant levels are critical for effective pollution control and management. In this study, we implemented three machine learning algorithms, namely, Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Fully Connected Neural Network (FCNN), to predict PM 2.5 and O 3 concentrations in the Beijing–Tianjin–Hebei region from 2019 to 2023. XGBoost outperformed the other algorithms and was further utilized to predict PM 2.5 and O 3 concentrations and identify their controlling factors. The models could efficiently capture the spatial and temporal variations in the pollutants in the study area, and it was found that both anthropogenic sources and weather conditions can have significant impacts on air pollutant levels. PM 10 and CO were significantly correlated to PM 2.5 levels, which could be attributed to their similar emission sources and dispersion characteristics in air. O 3 concentrations were greatly influenced by temperature and NO 2 due to their significant impacts on O 3 generation. This study demonstrates that XGBoost-based models are cost-effective tools for predicting PM 2.5 and O 3 levels and identifying their controlling factors. These findings provide valuable insights for formulating effective air pollution prevention policies.

Suggested Citation

  • Chao Wei & Chen Zhao & Yuanan Hu & Yutai Tian, 2025. "Predicting the Concentration Levels of PM 2.5 and O 3 for Highly Urbanized Areas Based on Machine Learning Models," Sustainability, MDPI, vol. 17(20), pages 1-22, October.
  • Handle: RePEc:gam:jsusta:v:17:y:2025:i:20:p:9211-:d:1773500
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/17/20/9211/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/17/20/9211/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:20:p:9211-:d:1773500. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.