Author
Listed:
- Anchuan Wang
(School of Software, Xinjiang University, Urumqi 830091, China)
- Ling Qin
(School of Automotive Engineering, Wuhan University of Technology, Wuhan 430070, China)
- Qing Huang
(College of Continuing Education, Wuhan Institute of Technology, Wuhan 430074, China)
- Qun Zou
(China Ship Development and Design Center, Wuhan 430060, China)
Abstract
Reliable perception of small floating objects is critical for the management of aquatic environments and supports key applications, including the autonomous navigation of Unmanned Surface Vehicles (USVs), waterborne debris monitoring, and Search and Rescue (SAR) operations. However, accurate detection in dynamic water-surface environments remains a significant challenge, as targets are frequently obscured by high-frequency wave clutter, and feature distributions are destabilized by covariate shifts caused by illumination. To address these limitations, this study proposes YOLO11-MCN, a real-time detection framework that integrates two architectural components specifically designed for water-surface monitoring. The Multi-Scale Contextual Attention (MSCA) module distinguishes target signatures from background noise by aggregating contextual information across heterogeneous receptive fields, thereby suppressing false positives generated by waves. The Channel Normalization Attention Mechanism (CNAM) addresses illumination instability through feature statistic calibration based on Group Normalization, effectively mitigating covariate shifts induced by extreme lighting variations. Furthermore, these components are complemented by a high-resolution P2 detection head, which recovers the geometric details of small-scale targets typically lost during downsampling. Extensive experiments conducted on a dataset of 5812 images demonstrate that YOLO11-MCN achieves an mAP@0.5 of 92.7%, outperforming the YOLO11n baseline by 5.9 percentage points. Robustness evaluations confirm that MSCA and CNAM significantly reduce missed detections under severe wave clutter and backlighting conditions. With a recall of 90.5%, an inference speed of 94 FPS on desktop hardware, and a compact footprint of 3.89M parameters and 14.8 GFLOPs, the proposed framework offers a robust and efficient solution for intelligent water-surface surveillance systems within the single-class detection paradigm evaluated in this study, with strong potential for edge-device deployment following platform-specific optimization.
Suggested Citation
Anchuan Wang & Ling Qin & Qing Huang & Qun Zou, 2026.
"Real-Time Small Floating Object Detection from Dynamic Water Surfaces Using YOLO11-MCN for Sustainable Aquatic Monitoring,"
Sustainability, MDPI, vol. 18(10), pages 1-33, May.
Handle:
RePEc:gam:jsusta:v:18:y:2026:i:10:p:5083-:d:1945663
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2026:i:10:p:5083-:d:1945663. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.