# MTF-NET: A mixed traffic flow multi-target detection network based on full-field perception and adaptive optimization

**Authors:** Shihao Li, Qiao Meng, Xin Liu, Zhijie Wang, Siyuan Kong, Bingyu Li

PMC · DOI: 10.1371/journal.pone.0344151 · PLOS One · 2026-03-16

## TL;DR

MTF-NET is a new object detection network designed to improve performance in complex mixed traffic scenarios by combining advanced feature extraction and adaptive optimization techniques.

## Contribution

The paper introduces MTF-NET, a novel detection network with full-field perception and adaptive optimization for mixed traffic flow.

## Key findings

- MTF-NET achieves a 5.1% improvement in mAP50 on the VisDrone2019 dataset.
- The network shows 4.2% and 13.4% improvements on the UA-DETRAC-G2 and HazyDet datasets, respectively.
- The proposed methods enhance robustness and generalization in mixed traffic detection.

## Abstract

In mixed traffic flow scenarios, multiple types of traffic participants coexist on the same roadway, posing severe challenges for object detection algorithms due to significant disparities in target scales, complex background interference, dense occlusions, and the high heterogeneity of classes. Existing CNN-based detectors are constrained by the fixed receptive fields inherent in convolution operations and are generally plagued by imbalances between positive and negative samples as well as inadequate representations of small objects, further limiting their performance in mixed traffic detection tasks. To address these issues, we propose the MTF-NET detection network, which is endowed with full-field perceptual capabilities. First, a combination of CNN and MetaFormer is employed as the backbone for feature extraction to enhance contextual modeling. Second, to mitigate the inherent dual-dimensional information loss and small-target representation bottlenecks associated with pyramid structures, we introduce a Hierarchical Implicit-Explicit Pyramid structure alongside a Multi-Kernel Dilation Fusion Network designed to counteract the information degradation brought about by pooling operations. Finally, the Dynamic Dual Detection Heads utilize a dual-branch design that facilitates end-to-end deployment while alleviating the limitations imposed by non-maximum suppression (NMS), and a hybrid strategy integrating Exponential Adaptive Loss with Focaler-DIoU is developed to address the imbalance between positive and negative samples across multiple classes. Experimental results demonstrate that MTF-NET achieves a 5.1% improvement in mAP50 on the VisDrone2019 dataset, surpassing current state-of-the-art methods, and further yields enhancements of 4.2% and 13.4% on the UA-DETRAC-G2 and HazyDet datasets, respectively. These findings effectively validate the robustness and generalization capabilities of our network, providing a potent solution for object detection in complex mixed traffic flow scenarios.

## Full-text entities

- **Diseases:** HFFU (MESH:D000069337), NMS (MESH:C580335), HIEP (MESH:C538104)
- **Chemicals:** Focaler (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** YOLOv11n — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_VM32)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12991249/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12991249/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC12991249/full.md

---
Source: https://tomesphere.com/paper/PMC12991249