Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay, Timm Haucke, Suzanne Stathatos, Siqi Deng, Erik Young, Pietro Perona, Sara Beery, Grant Van Horn

TL;DR
This paper introduces a unified benchmarking framework, a new dataset, and a state-of-the-art method for domain adaptive object detection, addressing previous evaluation issues and enabling more reliable progress in the field.
Contribution
The paper presents ALDI, a comprehensive benchmarking and implementation framework, a new diverse dataset CFC-DAOD, and ALDI++, a method achieving new state-of-the-art results in DAOD.
Findings
ALDI++ outperforms previous methods by large margins on multiple benchmarks.
The framework enables fair, transparent, and reproducible comparisons of DAOD methods.
The new dataset provides diverse real-world data for robust evaluation.
Abstract
Object detectors often perform poorly on data that differs from their training set. Domain adaptive object detection (DAOD) methods have recently demonstrated strong results on addressing this challenge. Unfortunately, we identify systemic benchmarking pitfalls that call past results into question and hamper further progress: (a) Overestimation of performance due to underpowered baselines, (b) Inconsistent implementation practices preventing transparent comparisons of methods, and (c) Lack of generality due to outdated backbones and lack of diversity in benchmarks. We address these problems by introducing: (1) A unified benchmarking and implementation framework, Align and Distill (ALDI), enabling comparison of DAOD methods and supporting future development, (2) A fair and modern training and evaluation protocol for DAOD that addresses benchmarking pitfalls, (3) A new DAOD benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsAlign and Distill ++
