Robust Object Detection via Instance-Level Temporal Cycle Confusion
Xin Wang, Thomas E. Huang, Benlin Liu, Fisher Yu, Xiaolong Wang,, Joseph E. Gonzalez, Trevor Darrell

TL;DR
This paper introduces a novel self-supervised task called instance-level temporal cycle confusion (CycConf) to enhance object detector robustness against domain shifts, achieving state-of-the-art results on several benchmarks.
Contribution
The paper proposes CycConf, a new self-supervised task that improves out-of-domain object detection by encouraging invariant feature learning across video frames.
Findings
Consistent out-of-domain performance improvements with CycConf.
State-of-the-art results on unsupervised domain adaptation benchmarks.
Effective robustness enhancement on large-scale video datasets.
Abstract
Building reliable object detectors that are robust to domain shifts, such as various changes in context, viewpoint, and object appearances, is critical for real-world applications. In this work, we study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors. Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf), which operates on the region features of the object detectors. For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision. CycConf encourages the object detector to explore invariant structures across instances under various motions, which leads to improved model robustness in unseen domains at test time. We observe consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
