Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability
Boyong He, Yuxiang Ji, Zhuoyue Tan, Liaoni Wu

TL;DR
This paper introduces a diffusion model-based framework for domain-generalized and adaptive detection, significantly reducing inference time and improving robustness across diverse domains by leveraging intermediate features and auxiliary branches.
Contribution
It proposes a novel method that extracts intermediate features from diffusion models, constructs an object-centered auxiliary branch, and aligns detectors for improved cross-domain detection performance.
Findings
Reduces inference time by 75%.
Achieves state-of-the-art results on multiple benchmarks.
Maintains advantages in large domain shifts and low-data scenarios.
Abstract
Detectors often suffer from performance drop due to domain gap between training and testing data. Recent methods explore diffusion models applied to domain generalization (DG) and adaptation (DA) tasks, but still struggle with large inference costs and have not yet fully leveraged the capabilities of diffusion models. We propose to tackle these problems by extracting intermediate features from a single-step diffusion process, improving feature collection and fusion to reduce inference time by 75% while enhancing performance on source domains (i.e., Fitness). Then, we construct an object-centered auxiliary branch by applying box-masked images with class prompts to extract robust and domain-invariant features that focus on object. We also apply consistency loss to align the auxiliary and ordinary branch, balancing fitness and generalization while preventing overfitting and improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Advanced Neural Network Applications
MethodsDiffusion · Focus
