RevCD -- Reversed Conditional Diffusion for Generalized Zero-Shot Learning
William Heyden, Habib Ullah, M. Salman Siddiqui, Fadi Al Machot

TL;DR
RevCD introduces a reversed diffusion-based model for generalized zero-shot learning, generating semantic features from visual data to improve recognition of unseen categories, leveraging diffusion models' capabilities for complex data modeling.
Contribution
The paper proposes a novel reversed conditional diffusion approach for GZSL, enabling semantic feature generation from visual inputs and improving knowledge transfer efficiency.
Findings
Enhanced zero-shot learning performance across multiple datasets
Effective semantic feature synthesis from visual data
Demonstrated advantages of diffusion models in GZSL
Abstract
In Generalized Zero-Shot Learning (GZSL), we aim to recognize both seen and unseen categories using a model trained only on seen categories. In computer vision, this translates into a classification problem, where knowledge from seen categories is transferred to unseen categories by exploiting the relationships between visual features and available semantic information, such as text corpora or manual annotations. However, learning this joint distribution is costly and requires one-to-one training with corresponding semantic information. We present a reversed conditional Diffusion-based model (RevCD) that mitigates this issue by generating semantic features synthesized from visual inputs by leveraging Diffusion models' conditional mechanisms. Our RevCD model consists of a cross Hadamard-Addition embedding of a sinusoidal time schedule and a multi-headed visual transformer for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Microwave Imaging and Scattering Analysis · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
