Cross-modal Offset-guided Dynamic Alignment and Fusion for Weakly Aligned UAV Object Detection
Liu Zongzhen, Luo Hui, Wang Zhixing, Wei Yuxing, Zuo Haorui, Zhang Jianlin

TL;DR
This paper introduces CoDAF, a unified framework for weakly aligned UAV object detection that jointly addresses spatial misalignment and modality conflict through novel alignment and fusion modules, significantly improving detection accuracy.
Contribution
The paper proposes a novel unified framework with offset-guided semantic alignment and dynamic attention-guided fusion modules for robust multimodal UAV object detection under weak alignment conditions.
Findings
Achieves 78.6% mAP on DroneVehicle dataset.
Effectively aligns features despite spatial misalignment.
Improves detection robustness in multimodal UAV scenarios.
Abstract
Unmanned aerial vehicle (UAV) object detection plays a vital role in applications such as environmental monitoring and urban security. To improve robustness, recent studies have explored multimodal detection by fusing visible (RGB) and infrared (IR) imagery. However, due to UAV platform motion and asynchronous imaging, spatial misalignment frequently occurs between modalities, leading to weak alignment. This introduces two major challenges: semantic inconsistency at corresponding spatial locations and modality conflict during feature fusion. Existing methods often address these issues in isolation, limiting their effectiveness. In this paper, we propose Cross-modal Offset-guided Dynamic Alignment and Fusion (CoDAF), a unified framework that jointly tackles both challenges in weakly aligned UAV-based object detection. CoDAF comprises two novel modules: the Offset-guided Semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · UAV Applications and Optimization · Video Surveillance and Tracking Methods
