Detect, Augment, Compose, and Adapt: Four Steps for Unsupervised Domain Adaptation in Object Detection
Mohamed L. Mekhalfi, Davide Boscaini, Fabio Poiesi

TL;DR
This paper introduces a four-step unsupervised domain adaptation method for object detection that uses self-supervision, pseudo-labeling, augmentation, and composition to improve detection performance across different domains.
Contribution
The paper presents a novel four-step framework for unsupervised domain adaptation in object detection, leveraging self-supervision and image composition techniques.
Findings
Achieves state-of-the-art results on cross-camera, weather, and synthetic-to-real scenarios.
Improves mAP by over 2% compared to previous methods.
Demonstrates robustness across diverse domain shifts.
Abstract
Unsupervised domain adaptation (UDA) plays a crucial role in object detection when adapting a source-trained detector to a target domain without annotated data. In this paper, we propose a novel and effective four-step UDA approach that leverages self-supervision and trains source and target data concurrently. We harness self-supervised learning to mitigate the lack of ground truth in the target domain. Our method consists of the following steps: (1) identify the region with the highest-confidence set of detections in each target image, which serve as our pseudo-labels; (2) crop the identified region and generate a collection of its augmented versions; (3) combine these latter into a composite image; (4) adapt the network to the target domain using the composed image. Through extensive experiments under cross-camera, cross-weather, and synthetic-to-real scenarios, our approach achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
