IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0347858.html

Optimizing LSTM networks and feature selection algorithms using GEE data

Author

Listed:
  • Mohammad Kazemi
  • Reza Naderi Samani
  • Narges Kariminejad

Abstract

Feature selection uncertainty and suboptimal model configuration in flood susceptibility mapping (FSM) are critical challenges for disaster risk reduction. This study introduces a novel integrated framework that couples feature selection strategy with metaheuristic-optimized deep learning for high-precision FSM in the flood-prone Khuzestan Province, Iran. An initial set of 19 factors was sourced from Google Earth Engine (GEE). The most influential variables were identified using an ensemble of nine feature selection methods, including Boruta, Boruta-SHAP, Elastic-Net, Mutual Information, Permutation Importance, Recursive Feature Elimination (RFE), Sequential Forward Selection (SFS), Stability Selection, and Deep Feature Importance. For model development, 1,000 sample points were used, consisting of 500 randomly selected non-flood points (value 0) and 500 flood points (value 1), with the trained model subsequently generalized to the entire study area. In this process, a frequency-based consensus rule was applied, whereby variables were retained only if selected by a majority of methods. This process established the Normalized Difference Vegetation Index (NDVI) and Daily Minimum Temperature (TMMN) as the most critical predictors. A Long Short-Term Memory (LSTM) was developed using this optimal feature set and subsequently enhanced through hyperparameter optimization with five advanced metaheuristic algorithms, including WOA, GWO, OOA, CSA, and HOA. The model validation demonstrated that optimization significantly boosted performance, with the LSTM-WOA model emerging as superior, achieving the highest F1-Score (0.88) and Cohen’s Kappa (0.75). The final FSM identified the northwestern and central regions as the highest susceptibility. The study innovation lies in its formalized consensus feature selection and comparative metaheuristic optimization, providing a reliable tool for FSM in arid and semi-arid regions.

Suggested Citation

  • Mohammad Kazemi & Reza Naderi Samani & Narges Kariminejad, 2026. "Optimizing LSTM networks and feature selection algorithms using GEE data," PLOS ONE, Public Library of Science, vol. 21(4), pages 1-26, April.
  • Handle: RePEc:plo:pone00:0347858
    DOI: 10.1371/journal.pone.0347858
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0347858
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0347858&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0347858?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0347858. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.