IDEAS home Printed from https://ideas.repec.org/a/gam/jlands/v14y2025i7p1348-d1687134.html
   My bibliography  Save this article

An Adaptive Graph Convolutional Network with Spatial Autocorrelation for Enhancing 3D Soil Pollutant Mapping Precision from Sparse Borehole Data

Author

Listed:
  • Huan Tao

    (Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences (CAS), Beijing 100101, China)

  • Ziyang Li

    (School of Information Science and Technology, Beijing Forestry University, Beijing 100083, China)

  • Shengdong Nie

    (Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China)

  • Hengkai Li

    (Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China)

  • Dan Zhao

    (Key Laboratory of Environmental Damage Identification and Restoration, Ministry of Ecology and Environment, Beijing 100041, China
    Center for Environmental Risk and Damage Assessment, Chinese Academy of Environmental Planning, Beijing 100012, China)

Abstract

Sparse borehole sampling at contaminated sites results in sparse and unevenly distributed data on soil pollutants. Traditional interpolation methods may obscure local variations in soil contamination when applied to such sparse data, thus reducing the interpolation accuracy. We propose an adaptive graph convolutional network with spatial autocorrelation (ASI-GCN) model to overcome this challenge. The ASI-GCN model effectively constrains pollutant concentration transfer while capturing subtle spatial variations, improving soil pollution characterization accuracy. We tested our model at a coking plant using 215 soil samples from 15 boreholes, evaluating its robustness with three pollutants of varying volatility: arsenic (As, non-volatile), benzo(a)pyrene (BaP, semi-volatile), and benzene (Ben, volatile). Leave-one-out cross-validation demonstrates that the ASI-GCN_RC_G model (ASI-GCN with residual connections) achieves the highest prediction accuracy. Specifically, the R for As, BaP, and Ben are 0.728, 0.825, and 0.781, respectively, outperforming traditional models by 58.8% (vs. IDW), 45.82% (vs. OK), and 53.78% (vs. IDW). Meanwhile, their RMSE drop by 36.56% (vs. Bayesian_K), 38.02% (vs. Bayesian_K), and 35.96% (vs. IDW), further confirming the model’s superior precision. Beyond accuracy, Monte Carlo uncertainty analysis reveals that most predicted areas exhibit low uncertainty, with only a few high-pollution hotspots exhibiting relatively high uncertainty. Further analysis revealed the significant influence of pollutant volatility on vertical migration patterns. Non-volatile As was primarily distributed in the fill and silty sand layers, and semi-volatile BaP concentrated in the silty sand layer. At the same time, volatile Ben was predominantly found in the clay and fine sand layers. By integrating spatial autocorrelation with deep graph representation, ASI-GCN redefines sparse data 3D mapping, offering a transformative tool for precise environmental governance and human health assessment.

Suggested Citation

  • Huan Tao & Ziyang Li & Shengdong Nie & Hengkai Li & Dan Zhao, 2025. "An Adaptive Graph Convolutional Network with Spatial Autocorrelation for Enhancing 3D Soil Pollutant Mapping Precision from Sparse Borehole Data," Land, MDPI, vol. 14(7), pages 1-15, June.
  • Handle: RePEc:gam:jlands:v:14:y:2025:i:7:p:1348-:d:1687134
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2073-445X/14/7/1348/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2073-445X/14/7/1348/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Di Zhu & Fan Zhang & Shengyin Wang & Yaoli Wang & Ximeng Cheng & Zhou Huang & Yu Liu, 2020. "Understanding Place Characteristics in Geographic Contexts through Graph Convolutional Neural Networks," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 110(2), pages 408-420, March.
    2. Valerie C. Bradley & Shiro Kuriwaki & Michael Isakov & Dino Sejdinovic & Xiao-Li Meng & Seth Flaxman, 2021. "Unrepresentative big surveys significantly overestimated US vaccine uptake," Nature, Nature, vol. 600(7890), pages 695-700, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bussemakers, Carlijn & van Dijk, Mart & Dima, Alexandra L. & de Bruin, Marijn, 2023. "How well do surveys on adherence to pandemic policies assess actual behaviour: Measurement properties of the Dutch COVID-19 adherence to prevention advice survey (CAPAS)," Social Science & Medicine, Elsevier, vol. 339(C).
    2. Gazmararian, Alexander F., 2024. "Fossil fuel communities support climate policy coupled with just transition assistance," Energy Policy, Elsevier, vol. 184(C).
    3. Chao Fan & Yang Yang & Ali Mostafavi, 2024. "Neural embeddings of urban big data reveal spatial structures in cities," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-15, December.
    4. Jonathan Eggleston & Carl Lieberman, 2024. "Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data," Working Papers 24-60, Center for Economic Studies, U.S. Census Bureau.
    5. Markhof, Yannick & Wollburg, Philip & Zezza, Alberto, 2025. "Beyond the records: Data quality and COVID-19 vaccination progress in low- and middle-income countries," Journal of Development Economics, Elsevier, vol. 174(C).
    6. Ziqi Li & Zhan Peng, 2025. "Can Moran Eigenvectors Improve Machine Learning of Spatial Data? Insights from Synthetic Data Validation," Papers 2504.12450, arXiv.org.
    7. Zhi Li & Jinsong Liu, 2023. "Evolution Process and Characteristics of Multifactor Flows in Rural Areas: A Case Study of Licheng Village in Hebei, China," Sustainability, MDPI, vol. 15(4), pages 1-16, February.
    8. Camilla Salvatore, 2023. "Inference with non-probability samples and survey data integration: a science mapping study," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 83-107, April.
    9. Alexandre Blake & Ashley Hazel & John Jakurama & Justy Matundu & Nita Bharti, 2023. "Disparities in mobile phone ownership reflect inequities in access to healthcare," PLOS Digital Health, Public Library of Science, vol. 2(7), pages 1-16, July.
    10. He, Xin & Mao, Xiaojun & Wang, Zhonglei, 2024. "Nonparametric augmented probability weighting with sparsity," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    11. Cameron Deal & Shea Greenberg & Gilbert Gonzales, 2024. "Sexual identity, poverty, and utilization of government services," Journal of Population Economics, Springer;European Society for Population Economics, vol. 37(2), pages 1-31, June.
    12. Carpenter, Christopher S. & Dasgupta, Kabir & Merchant, Zofsha & Plum, Alexander, 2025. "Sexual orientation and financial well-being in the United States," Journal of Economic Behavior & Organization, Elsevier, vol. 233(C).
    13. Tesary Lin & Avner Strulov-Shlain, 2023. "Choice Architecture, Privacy Valuations, and Selection Bias in Consumer Data," Papers 2308.13496, arXiv.org.
    14. Avinash Collis & Kiran Garimella & Alex Moehring & M. Amin Rahimian & Stella Babalola & Nina H. Gobat & Dominick Shattuck & Jeni Stolow & Sinan Aral & Dean Eckles, 2022. "Global survey on COVID-19 beliefs, behaviours and norms," Nature Human Behaviour, Nature, vol. 6(9), pages 1310-1317, September.
    15. Ananat, Elizabeth & Glasner, Benjamin & Hamilton, Christal & Parolin, Zachary & Pignatti, Clemente, 2024. "Effects of the expanded Child Tax Credit on employment outcomes," Journal of Public Economics, Elsevier, vol. 238(C).
    16. Stoler, Justin & Klofstad, Casey A. & Enders, Adam M. & Uscinski, Joseph E., 2022. "Sociopolitical and psychological correlates of COVID-19 vaccine hesitancy in the United States during summer 2021," Social Science & Medicine, Elsevier, vol. 306(C).
    17. Nelson, Victoria & Bashyal, Bidhan & Tan, Pang-Ning & Argyris, Young Anna, 2024. "Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance," Social Science & Medicine, Elsevier, vol. 348(C).
    18. Pengyuan Liu & Yan Zhang & Filip Biljecki, 2024. "Explainable spatially explicit geospatial artificial intelligence in urban analytics," Environment and Planning B, , vol. 51(5), pages 1104-1123, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jlands:v:14:y:2025:i:7:p:1348-:d:1687134. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.