TL;DR
SFFNet introduces a dual-domain edge enhancement and multi-scale feature fusion approach to improve UAV image object detection, effectively handling complex backgrounds and scale variations.
Contribution
The paper proposes a novel synergistic feature fusion network with dual-domain edge enhancement and adaptive detectors for UAV image object detection.
Findings
Achieves 36.8 AP on VisDrone dataset
Achieves 20.6 AP on UAVDT dataset
Lightweight models balance accuracy and efficiency
Abstract
Object detection in unmanned aerial vehicle (UAV) images remains a highly challenging task, primarily caused by the complexity of background noise and the imbalance of target scales. Traditional methods easily struggle to effectively separate objects from intricate backgrounds and fail to fully leverage the rich multi-scale information contained within images. To address these issues, we have developed a synergistic feature fusion network (SFFNet) with dual-domain edge enhancement specifically tailored for object detection in UAV images. Firstly, the multi-scale dynamic dual-domain coupling (MDDC) module is designed. This component introduces a dual-driven edge extraction architecture that operates in both the frequency and spatial domains, enabling effective decoupling of multi-scale object edges from background noise. Secondly, to further enhance the representation capability of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
