ZeroDiff++: Substantial Unseen Visual-semantic Correlation in Zero-shot Learning
Zihan Ye, Shreyank N Gowda, Kaile Du, Weijian Luo, Ling Shao

TL;DR
ZeroDiff++ introduces a diffusion-based generative framework for zero-shot learning that enhances visual-semantic correlations and mitigates spurious correlations, leading to improved recognition of unseen classes with limited training data.
Contribution
The paper proposes ZeroDiff++, a novel diffusion-based approach with test-time adaptation and generation techniques to improve zero-shot learning performance.
Findings
Significant performance improvements on three ZSL benchmarks.
Effective reduction of spurious visual-semantic correlations.
Robustness to scarce training data.
Abstract
Zero-shot Learning (ZSL) enables classifiers to recognize classes unseen during training, commonly via generative two stage methods: (1) learn visual semantic correlations from seen classes; (2) synthesize unseen class features from semantics to train classifiers. In this paper, we identify spurious visual semantic correlations in existing generative ZSL worsened by scarce seen class samples and introduce two metrics to quantify spuriousness for seen and unseen classes. Furthermore, we point out a more critical bottleneck: existing unadaptive fully noised generators produce features disconnected from real test samples, which also leads to the spurious correlation. To enhance the visual-semantic correlations on both seen and unseen classes, we propose ZeroDiff++, a diffusion-based generative framework. In training, ZeroDiff++ uses (i) diffusion augmentation to produce diverse noised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
