Author
Listed:
- Deliang Li
- Tao Liu
- Haokun Wang
- Long Yan
Abstract
Traditional methods for building extraction from remote sensing images rely on feature classification techniques, which often suffer from high usage thresholds, cumbersome data processing, slow recognition speeds, and poor adaptability. With the rapid advancement of artificial intelligence, particularly machine learning and deep learning, significant progress has been achieved in the intelligent extraction of remote sensing images. Building extraction plays a crucial role in geographic information applications, such as urban planning, resource management, and ecological protection. This study proposes an efficient and accurate building extraction method based on the SegFormer model, a state-of-the-art Transformer-based architecture for semantic segmentation. The workflow includes data preparation, model construction, model deployment, and application. The SegFormer model is selected for its hierarchical Transformer encoder and lightweight MLP decoder, which enable high-precision binary classification of buildings in remote sensing images. Additionally, post-processing techniques, such as noise filtering, boundary cleanup, and building regularization, are applied to refine the inference results, significantly improving both the visual presentation and accuracy of the extracted buildings. Experimental validation is conducted using the publicly available WHU building dataset, demonstrating the effectiveness of the proposed method in urban, rural, and mountainous areas. The results show that the SegFormer model achieves high accuracy, with the MiT-B5 backbone network reaching 94.13% Intersection over Union (IoU) after 100 training epochs. The study highlights the robustness and scalability of the method, providing a solid technical foundation for remote sensing image analysis and practical applications in geographic information systems.
Suggested Citation
Deliang Li & Tao Liu & Haokun Wang & Long Yan, 2025.
"Building extraction from remote sensing imagery using SegFormer with post-processing optimization,"
PLOS ONE, Public Library of Science, vol. 20(12), pages 1-17, December.
Handle:
RePEc:plo:pone00:0338104
DOI: 10.1371/journal.pone.0338104
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0338104. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.