PracticalDG: Perturbation Distillation on Vision-Language Models for   Hybrid Domain Generalization

Zining Chen; Weiqiu Wang; Zhicheng Zhao; Fei Su; Aidong Men; Hongying; Meng

arXiv:2404.09011·cs.CV·April 16, 2024·1 cites

PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization

Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men, Hongying, Meng

PDF

Open Access 1 Repo

TL;DR

PracticalDG introduces a perturbation distillation approach to transfer knowledge from vision-language models to lightweight vision models, enhancing hybrid domain generalization robustness, especially under data scarcity and diverse domain splits.

Contribution

The paper proposes SCI-PD, a novel perturbation distillation method from vision-language models to lightweight models, and introduces a new benchmark and metric for robust hybrid domain generalization evaluation.

Findings

01

SCI-PD outperforms state-of-the-art methods on multiple datasets.

02

The approach improves robustness under data scarcity.

03

New benchmark and metric reveal existing methods' performance decay.

Abstract

Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories. Nevertheless, there exists unseen classes from target domains in practical scenarios. To address this issue, Open Set Domain Generalization (OSDG) has emerged and several methods have been exclusively proposed. However, most existing methods adopt complex architectures with slight improvement compared with DG methods. Recently, vision-language models (VLMs) have been introduced in DG following the fine-tuning paradigm, but consume huge training overhead with large vision models. Therefore, in this paper, we innovate to transfer knowledge from VLMs to lightweight vision models and improve the robustness by introducing Perturbation Distillation (PD) from three perspectives,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

znchen666/hdg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research

MethodsSparse Evolutionary Training