IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0314327.html
   My bibliography  Save this article

Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks

Author

Listed:
  • Md Khalid Hossen
  • Yan-Tsung Peng
  • Meng Chang Chen

Abstract

In many deep learning tasks, it is assumed that the data used in the training process is sampled from the same distribution. However, this may not be accurate for data collected from different contexts or during different periods. For instance, the temperatures in a city can vary from year to year due to various unclear reasons. In this paper, we utilized three distinct statistical techniques to analyze annual data drifting at various stations. These techniques calculate the P values for each station by comparing data from five years (2014-2018) to identify data drifting phenomena. To find out the data drifting scenario those statistical techniques and calculate the P value from those techniques to measure the data drifting in specific locations. From those statistical techniques, the highest drifting stations can be identified from the previous year’s datasets To identify data drifting and highlight areas with significant drift, we utilized meteorological air quality and weather data in this study. We proposed two models that consider the characteristics of data drifting for PM2.5 prediction and compared them with various deep learning models, such as Long Short-Term Memory (LSTM) and its variants, for predictions from the next hour to the 64th hour. Our proposed models significantly outperform traditional neural networks. Additionally, we introduced a wrapped loss function incorporated into a model, resulting in more accurate results compared to those using the original loss function alone and prediction has been evaluated by RMSE, MAE and MAPE metrics. The proposed Front-loaded connection model(FLC) and Back-loaded connection model (BLC) solve the data drifting issue and the wrap loss function also help alleviate the data drifting problem with model training and works for the neural network models to achieve more accurate results. Eventually, the experimental results have shown that the proposed model performance enhanced from 24.1% -16%, 12%-8.3% respectively at 1h-24h, 32h-64h with compared to baselines BILSTM model, by 24.6% -11.8%, 10%-10.2% respectively at 1h-24h, 32h-64h compared to CNN model in hourly PM2.5 predictions.

Suggested Citation

  • Md Khalid Hossen & Yan-Tsung Peng & Meng Chang Chen, 2025. "Enhancing PM2.5 prediction by mitigating annual data drift using wrapped loss and neural networks," PLOS ONE, Public Library of Science, vol. 20(2), pages 1-25, February.
  • Handle: RePEc:plo:pone00:0314327
    DOI: 10.1371/journal.pone.0314327
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0314327
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0314327&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0314327?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Erin E. McDuffie & Randall V. Martin & Joseph V. Spadaro & Richard Burnett & Steven J. Smith & Patrick O’Rourke & Melanie S. Hammer & Aaron Donkelaar & Liam Bindle & Viral Shah & Lyatt Jaeglé & Gan Lu, 2021. "Source sector and fuel contributions to ambient PM2.5 and attributable mortality across multiple spatial scales," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baraldi, Anna Laura & Fosco, Giovanni, 2024. "Clearing the Air: Women in Politics and Air Pollution," MPRA Paper 121377, University Library of Munich, Germany.
    2. Duque, Valentina & Gilraine, Michael, 2022. "Coal use, air pollution, and student performance," Journal of Public Economics, Elsevier, vol. 213(C).
    3. Margherita Gerolimetto & Stefano Magrini, 2025. "From Metropolitan Planning Organization to Transport Management Areas: a change of air?," Economics Bulletin, AccessEcon, vol. 45(1), pages 577-585.
    4. Sun, Shuyu & Tong, Kangkang, 2024. "Rural-urban inequality in energy use sufficiency and efficiency during a rapid urbanization period," Applied Energy, Elsevier, vol. 364(C).
    5. Lutz Sager, 2025. "Estimating the Effect of China’s 2013 Air Pollution Prevention and Control Action Plan," CESifo Working Paper Series 11826, CESifo.
    6. Gilraine, Michael & Zheng, Angela, 2024. "JUE insight: Air pollution and student performance in the U.S," Journal of Urban Economics, Elsevier, vol. 143(C).
    7. Cao, Andi & Guo, Lili & Li, Houjian, 2025. "Tackling clean energy transition challenge in developing rural areas: Does college-educated village official matter?," Renewable and Sustainable Energy Reviews, Elsevier, vol. 208(C).
    8. Fan, Jing-Li & Li, Zezheng & Li, Kai & Zhang, Xian, 2022. "Modelling plant-level abatement costs and effects of incentive policies for coal-fired power generation retrofitted with CCUS," Energy Policy, Elsevier, vol. 165(C).
    9. Klaiber,Christoph Michael & Maruyama Rentschler,Jun Erik & Dorband,Ira Irina, 2023. "Distributional and Health Co-Benefits of Fossil Fuel Subsidy Reforms — Evidence from35 Countries," Policy Research Working Paper Series 10398, The World Bank.
    10. World Bank, 2022. "Azerbaijan : Towards Green Growth - Issues Note," World Bank Publications - Reports 38073, The World Bank Group.
    11. Alessandra Drigo, 2024. "Environmental justice gap in Italy: the role of industrial agglomerations and regional pollution dispersion capacity," Working Papers 2024.12, Fondazione Eni Enrico Mattei.
    12. Robin Fears & Claudia Canales‐Holzeis & Deoraj Caussy & Sherilee L. Harper & Victor Chee Wai Hoe & Jeremy N. McNeil & Johanna Mogwitz & Volker ter Meulen & Andy Haines, 2024. "Climate action for health: Inter‐regional engagement to share knowledge to guide mitigation and adaptation actions," Global Policy, London School of Economics and Political Science, vol. 15(S5), pages 75-96, September.
    13. Drigo, Alessandra, 2024. "Environmental justice gap in Italy: the role of industrial agglomerations and regional pollution dispersion capacity," FEEM Working Papers 343509, Fondazione Eni Enrico Mattei (FEEM).
    14. Zhu, Lin & Liao, Hua & Burke, Paul J., 2023. "Household fuel transitions have substantially contributed to child mortality reductions in China," World Development, Elsevier, vol. 164(C).
    15. World Bank, 2022. "Supporting Serbia’s Transition to Greener and More Resilient Growth," World Bank Publications - Reports 38271, The World Bank Group.
    16. World Bank Group, 2023. "Azerbaijan Country Climate and Development Report," World Bank Publications - Reports 40622, The World Bank Group.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0314327. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.