Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts

Chen Li; Huiying Xu; Changxin Gao; Zeyu Wang; Yun Liu; Xinzhong Zhu

arXiv:2510.19487·cs.CV·October 23, 2025

Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts

Chen Li, Huiying Xu, Changxin Gao, Zeyu Wang, Yun Liu, Xinzhong Zhu

PDF

Open Access

TL;DR

This paper introduces Cauvis, a novel method using causal visual prompts and dual-branch adapters to improve single-source domain generalized object detection, significantly reducing reliance on spurious features and enhancing robustness.

Contribution

The paper proposes Cauvis, a new approach combining visual prompts and causal disentanglement for better domain generalization in object detection from a single source.

Findings

01

Achieves 15.9-31.4% performance gains over existing methods.

02

Enhances robustness in complex interference environments.

03

Effectively disentangles causal and spurious features.

Abstract

Single-source Domain Generalized Object Detection (SDGOD), as a cutting-edge research topic in computer vision, aims to enhance model generalization capability in unseen target domains through single-source domain training. Current mainstream approaches attempt to mitigate domain discrepancies via data augmentation techniques. However, due to domain shift and limited domain-specific knowledge, models tend to fall into the pitfall of spurious correlations. This manifests as the model's over-reliance on simplistic classification features (e.g., color) rather than essential domain-invariant representations like object contours. To address this critical challenge, we propose the Cauvis (Causal Visual Prompts) method. First, we introduce a Cross-Attention Prompts module that mitigates bias from spurious features by integrating visual prompts with cross-attention. To address the inadequate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis