The Finer the Better: Towards Granular-aware Open-set Domain Generalization
Yunyun Wang, Zheng Duan, Xinyue Liao, Ke-Jia Chen, Songcan Chen

TL;DR
This paper introduces SeeCLIP, a semantic-enhanced framework for open-set domain generalization that improves fine-grained discrimination and unknown detection by leveraging semantic tokens, contrastive learning, and pseudo-unknown synthesis.
Contribution
The paper proposes a novel semantic-aware prompt enhancement, duplex contrastive learning, and semantic-guided diffusion to improve open-set domain generalization with fine-grained semantic understanding.
Findings
Achieves 3% higher accuracy over state-of-the-art methods.
Attains 5% higher H-score in benchmarks.
Demonstrates effectiveness across five diverse datasets.
Abstract
Open-Set Domain Generalization (OSDG) tackles the realistic scenario where deployed models encounter both domain shifts and novel object categories. Despite impressive progress with vision-language models like CLIP, existing methods still fall into the dilemma between structural risk of known-classes and open-space risk from unknown-classes, and easily suffers from over-confidence, especially when distinguishing ``hard unknowns" that share fine-grained visual similarities with known classes. To this end, we propose a Semantic-enhanced CLIP (SeeCLIP) framework that explicitly addresses this dilemma through fine-grained semantic enhancement. In SeeCLIP, we propose a semantic-aware prompt enhancement module to decompose images into discriminative semantic tokens, enabling nuanced vision-language alignment beyond coarse category labels. To position unknown prompts effectively, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Face recognition and analysis
