IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0337318.html

Aerial small target detection algorithm based on cross-scale separated attention

Author

Listed:
  • Ju Liang
  • Fan Wang
  • Jia Chen
  • Hai-Yan Huang
  • Zu-Fan Dou

Abstract

In UAV aerial photography scenarios, targets exhibit characteristics such as multi-scale distribution, a high proportion of small targets, complex occlusions, and strong background interference. These characteristics impose high demands on detection algorithms in terms of fine-grained feature extraction, cross-scale fusion capability, and occlusion resistance.The YOLOv11s model has significant limitations in practical applications: its feature extraction module has a single semantic representation, the traditional feature pyramid network has limited capability to detect multi-scale targets, and it lacks an effective feature compensation mechanism when targets are occluded.To address these issues, we propose a UAV aerial small target detection algorithm named UAS-YOLO (Universal Inverted Bottleneck with Adaptive BiFPN and Separated and Enhancement Attention module YOLO), which incorporates three key optimizations. First, an Adaptive Bidirectional Feature Pyramid Network (ABiFPN) is designed as the Neck structure. Through cross-scale connections and dynamic weighted fusion, ABiFPN adjusts weight allocation based on target scale characteristics, focusing on enhancing feature integration for scales related to small targets and improving multi-scale feature representation capability. Second, a Separated and Enhancement Attention Module (SEAM) is introduced to replace the original SPPF module. This module focuses on key target regions, enhances effective feature responses in unoccluded areas, and specifically compensates for information loss in occluded regions, thereby improving the detection stability of occluded small targets. Third, a Universal Inverted Bottleneck (UIB) structure is proposed, which is fused with the C3K2 module to form the C3K2_UIB module. By leveraging dynamic channel attention and spatial feature recalibration, C3K2_UIB suppresses background noise; although this increases parameters by 34%, it achieves improved detection accuracy through efficient feature selection, striking a balance between accuracy and complexity.Experimental results show that on the VisDrone2019 dataset and the TinyPerson dataset from Kaggle, the mean Average Precision (mAP) of the algorithm is increased by 4.9 and 2.1 percentage points, respectively. Moreover, it demonstrates greater advantages compared to existing advanced algorithms, effectively addressing the challenge of small target detection in complex UAV scenarios.

Suggested Citation

  • Ju Liang & Fan Wang & Jia Chen & Hai-Yan Huang & Zu-Fan Dou, 2025. "Aerial small target detection algorithm based on cross-scale separated attention," PLOS ONE, Public Library of Science, vol. 20(11), pages 1-26, November.
  • Handle: RePEc:plo:pone00:0337318
    DOI: 10.1371/journal.pone.0337318
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0337318
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0337318&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0337318?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Houwang Shi & Wenzhong Yang & Danni Chen & Min Wang, 2024. "ASG-YOLOv5: Improved YOLOv5 unmanned aerial vehicle remote sensing aerial images scenario for small object detection based on attention and spatial gating," PLOS ONE, Public Library of Science, vol. 19(6), pages 1-24, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xuhui Liu & Chi Feng & Shuran Zi & Zhengkun Qin & Qinghe Guan, 2025. "M-ReDet: A mamba-based method for remote sensing ship object detection and fine-grained recognition," PLOS ONE, Public Library of Science, vol. 20(8), pages 1-22, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0337318. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.