Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot Learning
Shivam Chandhok, Vineeth N Balasubramanian

TL;DR
This paper introduces a two-level joint maximization approach that enhances generative zero-shot learning by improving feature quality and knowledge transfer between visual and semantic domains, leading to better recognition performance.
Contribution
It proposes a novel two-level training framework that captures data modes more effectively and unifies feature synthesis with representation learning for zero-shot classification.
Findings
Outperforms state-of-the-art on four benchmark datasets
Improves feature diversity and data distribution modeling
Enhances cross-modal knowledge transfer
Abstract
The performance of generative zero-shot methods mainly depends on the quality of generated features and how well the model facilitates knowledge transfer between visual and semantic domains. The quality of generated features is a direct consequence of the ability of the model to capture the several modes of the underlying data distribution. To address these issues, we propose a new two-level joint maximization idea to augment the generative network with an inference network during training which helps our model capture the several modes of the data and generate features that better represent the underlying data distribution. This provides strong cross-modal interaction for effective transfer of knowledge between visual and semantic domains. Furthermore, existing methods train the zero-shot classifier either on generate synthetic image features or latent embeddings produced by leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
