MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection
Zhiqiang Yang, Qiu Guan, Zhongwen Yu, Xinli Xu, Haixia, Long, Sheng Lian, Haigen Hu, Ying Tang

TL;DR
MHAF-YOLO introduces a multi-branch fusion framework with advanced feature integration modules and kernel selection mechanisms, significantly improving object detection accuracy across scale variations.
Contribution
This work presents a novel multi-branch fusion architecture with specialized modules and kernel selection strategies, enhancing feature fusion and detection performance in YOLO-based models.
Findings
Improved detection accuracy on benchmark datasets.
Enhanced feature fusion with multi-scale and multi-level modules.
Better small object detection through optimized receptive fields.
Abstract
Due to the effective multi-scale feature fusion capabilities of the Path Aggregation FPN (PAFPN), it has become a widely adopted component in YOLO-based detectors. However, PAFPN struggles to integrate high-level semantic cues with low-level spatial details, limiting its performance in real-world applications, especially with significant scale variations. In this paper, we propose MHAF-YOLO, a novel detection framework featuring a versatile neck design called the Multi-Branch Auxiliary FPN (MAFPN), which consists of two key modules: the Superficial Assisted Fusion (SAF) and Advanced Assisted Fusion (AAF). The SAF bridges the backbone and the neck by fusing shallow features, effectively transferring crucial low-level spatial information with high fidelity. Meanwhile, the AAF integrates multi-scale feature information at deeper neck layers, delivering richer gradient information to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Bottom-up Path Augmentation · PAFPN · Convolution · 1x1 Convolution · Feature Pyramid Network
