Generative Dual Adversarial Network for Generalized Zero-shot Learning
He Huang, Changhu Wang, Philip S. Yu, Chang-Dong Wang

TL;DR
This paper introduces a unified dual adversarial network framework for generalized zero-shot learning, effectively generating visual features from class embeddings and improving classification accuracy on both seen and unseen classes.
Contribution
The paper proposes a novel model combining visual-semantic mapping, metric learning, and dual adversarial training with cyclic consistency, unifying multiple approaches for zero-shot learning.
Findings
Outperforms existing models on unseen class classification
Maintains high accuracy on seen classes
Uses cyclic consistency and dual adversarial loss for training
Abstract
This paper studies the problem of generalized zero-shot learning which requires the model to train on image-label pairs from some seen classes and test on the task of classifying new images from both seen and unseen classes. Most previous models try to learn a fixed one-directional mapping between visual and semantic space, while some recently proposed generative methods try to generate image features for unseen classes so that the zero-shot learning problem becomes a traditional fully-supervised classification problem. In this paper, we propose a novel model that provides a unified framework for three different approaches: visual-> semantic mapping, semantic->visual mapping, and metric learning. Specifically, our proposed model consists of a feature generator that can generate various visual features given class embeddings as input, a regressor that maps each visual feature back to its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
