Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot   Learning

Shivam Chandhok; Vineeth N Balasubramanian

arXiv:2007.07757·cs.CV·December 1, 2020

Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot Learning

Shivam Chandhok, Vineeth N Balasubramanian

PDF

TL;DR

This paper introduces a two-level joint maximization approach that enhances generative zero-shot learning by improving feature quality and knowledge transfer between visual and semantic domains, leading to better recognition performance.

Contribution

It proposes a novel two-level training framework that captures data modes more effectively and unifies feature synthesis with representation learning for zero-shot classification.

Findings

01

Outperforms state-of-the-art on four benchmark datasets

02

Improves feature diversity and data distribution modeling

03

Enhances cross-modal knowledge transfer

Abstract

The performance of generative zero-shot methods mainly depends on the quality of generated features and how well the model facilitates knowledge transfer between visual and semantic domains. The quality of generated features is a direct consequence of the ability of the model to capture the several modes of the underlying data distribution. To address these issues, we propose a new two-level joint maximization idea to augment the generative network with an inference network during training which helps our model capture the several modes of the data and generate features that better represent the underlying data distribution. This provides strong cross-modal interaction for effective transfer of knowledge between visual and semantic domains. Furthermore, existing methods train the zero-shot classifier either on generate synthetic image features or latent embeddings produced by leveraging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.