IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0331097.html
   My bibliography  Save this article

Experimental characterization and machine learning modeling of leakage-induced soil fluidization in water distribution systems

Author

Listed:
  • Masoud Ghodsian
  • Shima Mohammadbeigi

Abstract

Leakage in water distribution systems poses a global challenge, not only due to resource loss but also through soil erosion and sinkhole formation, which risk infrastructure collapse. This study investigates the mechanisms of soil fluidization, a process in which pressurized pipeline leakage generates turbulent water-soil mixtures, forming expanding fluidized zones. Experimental tests using a custom leakage simulation apparatus, combined with dimensional analysis, were conducted to identify key factors influencing fluidization dynamics. Empirical equations were developed to predict fluidized zone height and area (R² = 0.753–0.915) for both upward and downward leakages. These models were validated against 150 experimental datasets from current and prior studies, covering a wide range of leakage rates and soil types. Three ensemble machine learning models—Random Forest, XGBoost, and a Stacking model integrating support vector regression, multilayer perceptron, and linear regression—were employed to enhance predictive accuracy and stability. The results of evaluation metrics (R2, RMSE and correlation coefficient) showed that although XGBoost outperformed other models regarding accuracy (with R2 = 0.91 in test splits), this model exhibited the lowest stability in predicting dimensionless height and area of fluidized zone (with ΔR² = 0.07–0.08). The Random Forest model had the lowest accuracy (R2 = 0.907–0.912 in train phase) though the most capability in generalization through the minimum differences between train and test splits (with ΔR² = 0.02–0.04). Regarding Stacking model, both accuracy and stability maintained in balanced conditions with moderate performance. The conclusion from findings of all evaluation metrics were the same as deterministic coefficient. The equations derived from dimensional analysis, especially equations for downward leakage direction, also showed comparable performance with the most accurate ensemble model like XGBoost (with R² up to 0.915). Temporal analysis revealed the progression of fluidization through seven distinct stages over approximately 75 seconds. Fluidization initiated at a specific discharge rate, beginning with the formation of a 2 cm cavity and culminating in vortex bifurcation and dimensional stabilization. Critical pore pressure thresholds, observed around 20 seconds, induced suspended particle states and fountain flow, while vertical cavity growth predominated between 25 and 65 seconds. Sensitivity analysis highlighted the densimetric Froude number and soil uniformity as dominant factors, with their omission reducing model R² by up to 84.1% and 71.3%, respectively. In contrast, the particle size-to-leak area ratio exhibited marginal effect (

Suggested Citation

  • Masoud Ghodsian & Shima Mohammadbeigi, 2025. "Experimental characterization and machine learning modeling of leakage-induced soil fluidization in water distribution systems," PLOS ONE, Public Library of Science, vol. 20(9), pages 1-23, September.
  • Handle: RePEc:plo:pone00:0331097
    DOI: 10.1371/journal.pone.0331097
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0331097
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0331097&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0331097?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0331097. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.