Coarse-to-Fine Hierarchical Alignment for UAV-based Human Detection using Diffusion Models
Wenda Li, Meng Wu, Liangzhao Chen, Sungmin Eum, Heesung Kwon, Qing Qu

TL;DR
This paper presents a diffusion-based hierarchical framework to adapt synthetic UAV images for human detection, significantly reducing the domain gap and improving detection accuracy in real-world scenarios.
Contribution
The proposed CFHA framework introduces a three-stage diffusion process to effectively align synthetic data with real-world UAV images, enhancing detection performance.
Findings
Achieves up to +14.1 mAP50 improvement on Semantic-Drone benchmark.
Effectively narrows domain gap between synthetic and real images.
Demonstrates the importance of hierarchical alignment through ablation studies.
Abstract
Training object detectors demands extensive, task-specific annotations, yet this requirement becomes impractical in UAV-based human detection due to constantly shifting target distributions and the scarcity of labeled images. As a remedy, synthetic simulators are adopted to generate annotated data, with a low annotation cost. However, the domain gap between synthetic and real images hinders the model from being effectively applied to the target domain. Accordingly, we introduce Coarse-to-Fine Hierarchical Alignment (CFHA), a three-stage diffusion-based framework designed to transform synthetic data for UAV-based human detection, narrowing the domain gap while preserving the original synthetic labels. CFHA explicitly decouples global style and local content domain discrepancies and bridges those gaps using three modules: (1) Global Style Transfer -- a diffusion model aligns color,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · UAV Applications and Optimization
