Author
Listed:
- Xinyu Zhang
- Sawut Mamat
- Xiaohuang Liu
- Jiufen Liu
- Run Liu
- Guangjie Wu
- Ping Zhu
- Hongyu Li
- Min Ma
- Xiaotong Liu
Abstract
Accurate fruit detection is a key component of precision agriculture applications such as crop yield estimation, orchard management, and intelligent harvesting. In scenarios where immature fruits exhibit visual similarity to the background or where significant varietal differences exist, traditional models often lack sufficient generalization ability, resulting in reduced detection accuracy and unstable predictions. To address this problem, this paper proposes a fruit detection model, MSRRT-DETR, which achieves a balance of high accuracy, real-time performance, and strong generalization capability. To improve detection accuracy and robustness in complex orchard environments, MSRRT-DETR introduces three major enhancements to the RT-DETR framework: a Multi-Scale Convolutional Attention Module (MSBlock) to enhance feature representation at different scales; a Spatial and Channel Synergistic Attention Module (SCSA) to improve object focus and discriminative capability; and a Re-parameterized Feature Pyramid Network (RepGFPN) to achieve efficient multi-scale feature fusion. Experimental results show that MSRRT-DETR achieves a mAP50 of 87.3% on the self-constructed TSApple dataset, outperforming mainstream lightweight models YOLOv8, YOLO11, and YOLO12 by 2.0–7.9 percentage points, exceeding two-stage detectors including Faster R-CNN, Mask R-CNN, and Cascade R-CNN by 5.1–8.6 percentage points, and surpassing the RT-DETR series by 1.1–2.6 percentage points. With an inference speed of 30.2 FPS, comparable to the YOLO series, MSRRT-DETR achieves an excellent balance between accuracy and real-time performance. In addition, MSRRT-DETR demonstrates outstanding cross-domain generalization capability on four public datasets including MinneApple, validating its stable applicability across diverse scenarios and fruit varieties. MSRRT-DETR combines high recognition accuracy, fast inference, and strong cross-domain generalization, fully meeting the requirements of fruit detection in complex agricultural scenarios. The model provides robust technical support for intelligent monitoring and automated orchard management in precision agriculture, and holds significant practical value and broad potential for application.
Suggested Citation
Xinyu Zhang & Sawut Mamat & Xiaohuang Liu & Jiufen Liu & Run Liu & Guangjie Wu & Ping Zhu & Hongyu Li & Min Ma & Xiaotong Liu, 2026.
"MSRRT-DETR: A high-precision apple detection method with strong cross-domain generalization capability in complex orchard scenes,"
PLOS ONE, Public Library of Science, vol. 21(3), pages 1-22, March.
Handle:
RePEc:plo:pone00:0342854
DOI: 10.1371/journal.pone.0342854
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0342854. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.