Author
Listed:
- Chengwei Zhao
- Long Li
- Yubo Wang
- Xuqing Li
- Chong Xu
- Yubin Song
- Dongsheng Ren
- Cheng Xiao
Abstract
Landslide segmentation from remote sensing imagery is crucial for rapid disaster assessment and risk mitigation. Owing to the pronounced heterogeneity of landslide scales and the subtle visual contrast between some landslide bodies and their background, this task remains highly challenging. Although Transformers surpass convolutional neural networks in modeling long-range contextual dependencies, channel-level or feature-level fusion strategies provide only intermittent terrain cues, leading models to underutilize digital elevation model (DEM) information and to lack fine-grained adaptability to terrain variability. To address this, We propose a Swin-Transformer–based framework, Dual-Stage DEM-guided Fusion Transformer for landslide segmentation (D2FLS-Net), which embeds terrain features via two modules: (1) The Dual-Stage DEM-Guided Fusion (DSDF) module that injects DEM cues twice, where the early stage emphasizes DEM related discontinuities before feature extraction, and the late stage coordinates high-level RGB and DEM semantics through a cross-attention mechanism. (2) The Terrain-aware Pixel-wise Adaptive Context Enhancement (T-PACE) module that optimizes intermediate features using a DEM-gated, pixel-adaptive hybrid of multi-dilation atrous convolutions, enabling broader context aggregation within homogeneous landslide interiors and more precise discrimination at boundaries. We evaluate D2FLS-Net on the Bijie and Landslide4Sense 2022 datasets. On Bijie, the mean Intersection over Union (mIoU) reaches 88.77%, Recall 95.27%, and Precision 94.60%, exceeding the best competing model SegFormer by 7.96%, 7.90%, and 4.05%, respectively. On Landslide4Sense2022, mIoU 72.86%, Recall 82.55%, and Precision 93.30%, surpassing SegFormer by 7.06%, 6.56%, and 5.02%, respectively. Ablation studies indicate that DSDF primarily reduces missed detections of landslide traces, whereas T-PACE refines pixel level context selection. Injecting DEM at the Swin-1 and Swin-4 stages consistently outperforms other stage combinations. In summary, the model shows good detection performance and is suitable for fusing DEM and remote sensing imagery for landslide recognition.
Suggested Citation
Chengwei Zhao & Long Li & Yubo Wang & Xuqing Li & Chong Xu & Yubin Song & Dongsheng Ren & Cheng Xiao, 2025.
"D2FLS-Net: Dual-Stage DEM-guided Fusion Transformer for landslide segmentation,"
PLOS ONE, Public Library of Science, vol. 20(11), pages 1-19, November.
Handle:
RePEc:plo:pone00:0337412
DOI: 10.1371/journal.pone.0337412
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0337412. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.