Author
Listed:
- Shihao Li
- Qiao Meng
- Xin Liu
- Zhijie Wang
- Siyuan Kong
- Bingyu Li
Abstract
In mixed traffic flow scenarios, multiple types of traffic participants coexist on the same roadway, posing severe challenges for object detection algorithms due to significant disparities in target scales, complex background interference, dense occlusions, and the high heterogeneity of classes. Existing CNN-based detectors are constrained by the fixed receptive fields inherent in convolution operations and are generally plagued by imbalances between positive and negative samples as well as inadequate representations of small objects, further limiting their performance in mixed traffic detection tasks. To address these issues, we propose the MTF-NET detection network, which is endowed with full-field perceptual capabilities. First, a combination of CNN and MetaFormer is employed as the backbone for feature extraction to enhance contextual modeling. Second, to mitigate the inherent dual-dimensional information loss and small-target representation bottlenecks associated with pyramid structures, we introduce a Hierarchical Implicit-Explicit Pyramid structure alongside a Multi-Kernel Dilation Fusion Network designed to counteract the information degradation brought about by pooling operations. Finally, the Dynamic Dual Detection Heads utilize a dual-branch design that facilitates end-to-end deployment while alleviating the limitations imposed by non-maximum suppression (NMS), and a hybrid strategy integrating Exponential Adaptive Loss with Focaler-DIoU is developed to address the imbalance between positive and negative samples across multiple classes. Experimental results demonstrate that MTF-NET achieves a 5.1% improvement in mAP50 on the VisDrone2019 dataset, surpassing current state-of-the-art methods, and further yields enhancements of 4.2% and 13.4% on the UA-DETRAC-G2 and HazyDet datasets, respectively. These findings effectively validate the robustness and generalization capabilities of our network, providing a potent solution for object detection in complex mixed traffic flow scenarios.
Suggested Citation
Shihao Li & Qiao Meng & Xin Liu & Zhijie Wang & Siyuan Kong & Bingyu Li, 2026.
"MTF-NET: A mixed traffic flow multi-target detection network based on full-field perception and adaptive optimization,"
PLOS ONE, Public Library of Science, vol. 21(3), pages 1-24, March.
Handle:
RePEc:plo:pone00:0344151
DOI: 10.1371/journal.pone.0344151
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0344151. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.