TL;DR
Bridge introduces a basis-driven causal inference framework for object detection that reduces confounders' effects, improving domain generalization across diverse datasets and models.
Contribution
It proposes a novel causal inference method using low-rank bases for front-door adjustment, enhancing domain generalization in object detection.
Findings
Outperforms previous state-of-the-art methods on multiple datasets.
Effective across both discriminative and generative Vision Foundation Models.
Demonstrates robustness in diverse real-world domain shifts.
Abstract
Detectors often suffer from degraded performance, primarily due to the distributional gap between the source and target domains. This issue is especially evident in single-source domains with limited data, as models tend to rely on confounders (e.g., illumination, co-occurrence, and style) from the source domain, leading to spurious correlations that hinder generalization. To this end, this paper proposes a novel Basis-driven framework for domain generalization, namely \textbf{\textit{Bridge}}, that incorporates causal inference into object detection. By learning the low-rank bases for front-door adjustment, \textbf{\textit{Bridge}} blocks confounders' effects to mitigate spurious correlations, while simultaneously refining representations by filtering redundant and task-irrelevant components. \textbf{\textit{Bridge}} can be seamlessly integrated with both discriminative (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
