IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v17y2025i16p7547-d1729308.html
   My bibliography  Save this article

Predicting Soil Fertility in Semi-Arid Agroecosystems Using Interpretable Machine Learning Models: A Sustainable Approach for Data-Sparse Regions

Author

Listed:
  • Nurullah Acir

    (Department of Soil Science and Plant Nutrition, Faculty of Agriculture, Kırşehir Ahi Evran University, Kırşehir 40100, Türkiye)

Abstract

The accurate assessment of soil fertility is critical for guiding nutrient management and promoting sustainable agriculture in semi-arid agroecosystems. In this study, a machine learning-based Soil Fertility Index (SFI) model was developed using regularized regression techniques to evaluate fertility across a dryland maize-growing region in southeastern Türkiye. A total of 64 composite soil samples were collected from the Batman Plain, characterized by alkaline and salinity-prone conditions. Five soil chemical indicators, electrical conductivity (EC), pH, organic matter (OM), zinc (Zn), and iron (Fe), were selected for SFI estimation using a standardized rating approach. The dataset was randomly split into training (80%) and test (20%) subsets to calibrate and validate the models. Ridge, Lasso, and Elastic Net regression models were employed to predict SFI and assess variable importance. Among these, the Lasso model achieved the highest predictive accuracy on test data (R 2 = 0.746, RMSE = 0.060), retaining only EC and Zn as significant predictors. Ridge and Elastic Net captured OM and pH, though their contributions were minimal (|β| < 0.01). Spatial predictions showed moderate alignment with observed SFI values (range: 0.48–0.76), but all models underestimated high-fertility zones (>0.69), likely due to coefficient shrinkage. Despite its simplicity, the Lasso model offered superior interpretability and spatial resolution. The results reveal the potential of interpretable machine learning for supporting sustainable, site-specific fertility assessment and informed nutrient management in data-scarce and environmentally vulnerable regions.

Suggested Citation

  • Nurullah Acir, 2025. "Predicting Soil Fertility in Semi-Arid Agroecosystems Using Interpretable Machine Learning Models: A Sustainable Approach for Data-Sparse Regions," Sustainability, MDPI, vol. 17(16), pages 1-22, August.
  • Handle: RePEc:gam:jsusta:v:17:y:2025:i:16:p:7547-:d:1729308
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/17/16/7547/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/17/16/7547/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:16:p:7547-:d:1729308. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.