PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization
Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men, Hongying, Meng

TL;DR
PracticalDG introduces a perturbation distillation approach to transfer knowledge from vision-language models to lightweight vision models, enhancing hybrid domain generalization robustness, especially under data scarcity and diverse domain splits.
Contribution
The paper proposes SCI-PD, a novel perturbation distillation method from vision-language models to lightweight models, and introduces a new benchmark and metric for robust hybrid domain generalization evaluation.
Findings
SCI-PD outperforms state-of-the-art methods on multiple datasets.
The approach improves robustness under data scarcity.
New benchmark and metric reveal existing methods' performance decay.
Abstract
Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories. Nevertheless, there exists unseen classes from target domains in practical scenarios. To address this issue, Open Set Domain Generalization (OSDG) has emerged and several methods have been exclusively proposed. However, most existing methods adopt complex architectures with slight improvement compared with DG methods. Recently, vision-language models (VLMs) have been introduced in DG following the fine-tuning paradigm, but consume huge training overhead with large vision models. Therefore, in this paper, we innovate to transfer knowledge from VLMs to lightweight vision models and improve the robustness by introducing Perturbation Distillation (PD) from three perspectives,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
MethodsSparse Evolutionary Training
