Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts
Chen Li, Huiying Xu, Changxin Gao, Zeyu Wang, Yun Liu, Xinzhong Zhu

TL;DR
This paper introduces Cauvis, a novel method using causal visual prompts and dual-branch adapters to improve single-source domain generalized object detection, significantly reducing reliance on spurious features and enhancing robustness.
Contribution
The paper proposes Cauvis, a new approach combining visual prompts and causal disentanglement for better domain generalization in object detection from a single source.
Findings
Achieves 15.9-31.4% performance gains over existing methods.
Enhances robustness in complex interference environments.
Effectively disentangles causal and spurious features.
Abstract
Single-source Domain Generalized Object Detection (SDGOD), as a cutting-edge research topic in computer vision, aims to enhance model generalization capability in unseen target domains through single-source domain training. Current mainstream approaches attempt to mitigate domain discrepancies via data augmentation techniques. However, due to domain shift and limited domain-specific knowledge, models tend to fall into the pitfall of spurious correlations. This manifests as the model's over-reliance on simplistic classification features (e.g., color) rather than essential domain-invariant representations like object contours. To address this critical challenge, we propose the Cauvis (Causal Visual Prompts) method. First, we introduce a Cross-Attention Prompts module that mitigates bias from spurious features by integrating visual prompts with cross-attention. To address the inadequate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis
