Author
Listed:
- Jiao Li
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
- Linglong Meng
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
- Tianran Li
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
- Pengli Xue
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
- Hejing Wang
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
- Jie Hua
(Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China)
Abstract
Soil element background concentration is foundational data for environmental quality assessment, contamination diagnosis, and sustainable land management. However, existing investigation-based methods are time-consuming and inefficient. The machine learning (ML) method has demonstrated excellent performance in predicting soil heavy metal concentration. In this study, based on the nine environmental variables of soil formation from 210 soil monitoring points, including elevation, pH, organic matter, soil type, parent material, plant cover, land use type, topography, and soil texture, decision tree (DT), random forest (RF), extreme gradient boosting (XGB), and support vector machine (SVM) models were used to predict the eleven soil element background concentrations. Among them, SVM and RF models could be used for an effective prediction of the background concentration of all soil heavy metals. Compared with the XGBoost and DT, the SVM for all heavy metals except for cadmium (Cd) and manganese (Mn) performs best. Although the key factors affecting background concentrations vary among different soil elements, organic matter, soil type, and altitude, they play a crucial role in the accurate prediction of soil element background concentration. This study provides simple and efficient ML models for predicting soil element background concentration at the large regional scale. The results of this study can be utilized to distinguish natural geochemical processes from human-induced pollution.
Suggested Citation
Jiao Li & Linglong Meng & Tianran Li & Pengli Xue & Hejing Wang & Jie Hua, 2025.
"Application of Machine Learning Approaches to Predict Soil Element Background Concentration at Large Region Scale,"
Sustainability, MDPI, vol. 17(17), pages 1-22, August.
Handle:
RePEc:gam:jsusta:v:17:y:2025:i:17:p:7853-:d:1738780
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:17:p:7853-:d:1738780. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.