IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i19p12180-d925319.html
   My bibliography  Save this article

Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques

Author

Listed:
  • Thi-Minh-Trang Huynh

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan)

  • Chuen-Fa Ni

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Yu-Sheng Su

    (Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung 202301, Taiwan)

  • Vo-Chau-Ngan Nguyen

    (College of Environment and Natural Resources, Can Tho University, Can Tho 94000, Vietnam)

  • I-Hsien Lee

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Chi-Ping Lin

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Hoang-Hiep Nguyen

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan)

Abstract

Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability.

Suggested Citation

  • Thi-Minh-Trang Huynh & Chuen-Fa Ni & Yu-Sheng Su & Vo-Chau-Ngan Nguyen & I-Hsien Lee & Chi-Ping Lin & Hoang-Hiep Nguyen, 2022. "Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques," IJERPH, MDPI, vol. 19(19), pages 1-21, September.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:19:p:12180-:d:925319
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/19/12180/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/19/12180/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Eric W Fox & Jay M Ver Hoef & Anthony R Olsen, 2020. "Comparing spatial regression to random forests for large environmental data sets," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-22, March.
    2. Basim Mahbooba & Mohan Timilsina & Radhya Sahal & Martin Serrano & Ahmed Mostafa Khalil, 2021. "Explainable Artificial Intelligence (XAI) to Enhance Trust Management in Intrusion Detection Systems Using Decision Tree Model," Complexity, Hindawi, vol. 2021, pages 1-11, January.
    3. Akram Seifi & Mohammad Ehteram & Vijay P. Singh & Amir Mosavi, 2020. "Modeling and Uncertainty Analysis of Groundwater Level Using Six Evolutionary Optimization Algorithms Hybridized with ANFIS, SVM, and ANN," Sustainability, MDPI, vol. 12(10), pages 1-42, May.
    4. Russell R. Barton & Barry L. Nelson & Wei Xie, 2014. "Quantifying Input Uncertainty via Simulation Confidence Intervals," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 74-87, February.
    5. Nantian Huang & Guobo Lu & Dianguo Xu, 2016. "A Permutation Importance-Based Feature Selection Method for Short-Term Electricity Load Forecasting Using Random Forest," Energies, MDPI, vol. 9(10), pages 1-24, September.
    6. Willcock, Simon & Martínez-López, Javier & Hooftman, Danny A.P. & Bagstad, Kenneth J. & Balbi, Stefano & Marzo, Alessia & Prato, Carlo & Sciandrello, Saverio & Signorello, Giovanni & Voigt, Brian & , 2018. "Machine learning for ecosystem services," Ecosystem Services, Elsevier, vol. 33(PB), pages 165-174.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wen Shi & Xi Chen & Jennifer Shang, 2019. "An Efficient Morris Method-Based Framework for Simulation Factor Screening," INFORMS Journal on Computing, INFORMS, vol. 31(4), pages 745-770, October.
    2. Ricardo Vazquez & Hortensia Amaris & Monica Alonso & Gregorio Lopez & Jose Ignacio Moreno & Daniel Olmeda & Javier Coca, 2017. "Assessment of an Adaptive Load Forecasting Methodology in a Smart Grid Demonstration Project," Energies, MDPI, vol. 10(2), pages 1-23, February.
    3. Agudelo, César Augusto Ruiz & Bustos, Sandra Liliana Hurtado & Moreno, Carmen Alicia Parrado, 2020. "Modeling interactions among multiple ecosystem services. A critical review," Ecological Modelling, Elsevier, vol. 429(C).
    4. Kaneko, Nanae & Fujimoto, Yu & Hayashi, Yasuhiro, 2022. "Sensitivity analysis of factors relevant to extreme imbalance between procurement plans and actual demand: Case study of the Japanese electricity market," Applied Energy, Elsevier, vol. 313(C).
    5. Weiwei Fan & L. Jeff Hong & Xiaowei Zhang, 2020. "Distributionally Robust Selection of the Best," Management Science, INFORMS, vol. 66(1), pages 190-208, January.
    6. Héctor Migallón & Akram Belazi & José-Luis Sánchez-Romero & Héctor Rico & Antonio Jimeno-Morenilla, 2020. "Settings-Free Hybrid Metaheuristic General Optimization Methods," Mathematics, MDPI, vol. 8(7), pages 1-25, July.
    7. Lorilla, Roxanne Suzette & Poirazidis, Konstantinos & Detsis, Vassilis & Kalogirou, Stamatis & Chalkias, Christos, 2020. "Socio-ecological determinants of multiple ecosystem services on the Mediterranean landscapes of the Ionian Islands (Greece)," Ecological Modelling, Elsevier, vol. 422(C).
    8. Sangwan Lee, 2022. "Exploring Associations between Multimodality and Built Environment Characteristics in the U.S," Sustainability, MDPI, vol. 14(11), pages 1-16, May.
    9. Shi, Wen & Zhou, Qing & Zhou, Yanju, 2023. "An efficient elementary effect-based method for sensitivity analysis in identifying main and two-factor interaction effects," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
    10. Sangwan Lee, 2022. "Satisfaction with the Pedestrian Environment and Its Relationship to Neighborhood Satisfaction in Seoul, South Korea," Sustainability, MDPI, vol. 14(15), pages 1-15, July.
    11. Katharina Schulze & Žiga Malek & Dmitry Schepaschenko & Myroslava Lesiv & Steffen Fritz & Peter H. Verburg, 2023. "Pantropical distribution of short-rotation woody plantations: spatial probabilities under current and future climate," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 28(5), pages 1-22, June.
    12. Jun Yuan & Haowei Wang & Szu Hui Ng & Victor Nian, 2020. "Ship Emission Mitigation Strategies Choice Under Uncertainty," Energies, MDPI, vol. 13(9), pages 1-20, May.
    13. Vahid Nourani & Nardin Jabbarian Paknezhad & Hitoshi Tanaka, 2021. "Prediction Interval Estimation Methods for Artificial Neural Network (ANN)-Based Modeling of the Hydro-Climatic Processes, a Review," Sustainability, MDPI, vol. 13(4), pages 1-18, February.
    14. Dongkyu Lee & Jinhwa Jeong & Sung Hoon Yoon & Young Tae Chae, 2019. "Improvement of Short-Term BIPV Power Predictions Using Feature Engineering and a Recurrent Neural Network," Energies, MDPI, vol. 12(17), pages 1-17, August.
    15. Signorello, Giovanni & Prato, Carlo & Marzo, Alessia & Ientile, Renzo & Cucuzza, Giuseppe & Sciandrello, Saverio & Martínez-López, Javier & Balbi, Stefano & Villa, Ferdinando, 2018. "Are protected areas covering important biodiversity sites? An assessment of the nature protection network in Sicily (Italy)," Land Use Policy, Elsevier, vol. 78(C), pages 593-602.
    16. Richards, Daniel Rex & Lavorel, Sandra, 2022. "Integrating social media data and machine learning to analyse scenarios of landscape appreciation," Ecosystem Services, Elsevier, vol. 55(C).
    17. repec:eur:ejfejr:25 is not listed on IDEAS
    18. Frank Cremer & Barry Sheehan & Michael Fortmann & Arash N. Kia & Martin Mullins & Finbarr Murphy & Stefan Materne, 2022. "Cyber risk and cybersecurity: a systematic review of data availability," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 47(3), pages 698-736, July.
    19. Capriolo, A. & Boschetto, R.G. & Mascolo, R.A. & Balbi, S. & Villa, F., 2020. "Biophysical and economic assessment of four ecosystem services for natural capital accounting in Italy," Ecosystem Services, Elsevier, vol. 46(C).
    20. Manley, Kyle & Nyelele, Charity & Egoh, Benis N., 2022. "A review of machine learning and big data applications in addressing ecosystem service research gaps," Ecosystem Services, Elsevier, vol. 57(C).
    21. Feras Alasali & Husam Foudeh & Esraa Mousa Ali & Khaled Nusair & William Holderbaum, 2021. "Forecasting and Modelling the Uncertainty of Low Voltage Network Demand and the Effect of Renewable Energy Sources," Energies, MDPI, vol. 14(8), pages 1-31, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:19:p:12180-:d:925319. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.