Align and Distill: Unifying and Improving Domain Adaptive Object Detection

Justin Kay; Timm Haucke; Suzanne Stathatos; Siqi Deng; Erik Young; Pietro Perona; Sara Beery; Grant Van Horn

arXiv:2403.12029·cs.CV·June 25, 2025·3 cites

Align and Distill: Unifying and Improving Domain Adaptive Object Detection

Justin Kay, Timm Haucke, Suzanne Stathatos, Siqi Deng, Erik Young, Pietro Perona, Sara Beery, Grant Van Horn

PDF

Open Access 2 Repos

TL;DR

This paper introduces a unified benchmarking framework, a new dataset, and a state-of-the-art method for domain adaptive object detection, addressing previous evaluation issues and enabling more reliable progress in the field.

Contribution

The paper presents ALDI, a comprehensive benchmarking and implementation framework, a new diverse dataset CFC-DAOD, and ALDI++, a method achieving new state-of-the-art results in DAOD.

Findings

01

ALDI++ outperforms previous methods by large margins on multiple benchmarks.

02

The framework enables fair, transparent, and reproducible comparisons of DAOD methods.

03

The new dataset provides diverse real-world data for robust evaluation.

Abstract

Object detectors often perform poorly on data that differs from their training set. Domain adaptive object detection (DAOD) methods have recently demonstrated strong results on addressing this challenge. Unfortunately, we identify systemic benchmarking pitfalls that call past results into question and hamper further progress: (a) Overestimation of performance due to underpowered baselines, (b) Inconsistent implementation practices preventing transparent comparisons of methods, and (c) Lack of generality due to outdated backbones and lack of diversity in benchmarks. We address these problems by introducing: (1) A unified benchmarking and implementation framework, Align and Distill (ALDI), enabling comparison of DAOD methods and supporting future development, (2) A fair and modern training and evaluation protocol for DAOD that addresses benchmarking pitfalls, (3) A new DAOD benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsAlign and Distill ++