Class Knowledge Overlay to Visual Feature Learning for Zero-Shot Image   Classification

Cheng Xie; Ting Zeng; Hongxin Xiang; Keqin Li; Yun Yang; Qing Liu

arXiv:2102.13322·cs.CV·March 1, 2021·1 cites

Class Knowledge Overlay to Visual Feature Learning for Zero-Shot Image Classification

Cheng Xie, Ting Zeng, Hongxin Xiang, Keqin Li, Yun Yang, Qing Liu

PDF

Open Access

TL;DR

This paper introduces GAN-CST, a novel zero-shot learning method that leverages class knowledge overlay, semi-supervised learning, and triplet loss to improve the generation of semantically consistent visual features for unseen categories.

Contribution

The paper proposes GAN-CST, which enhances zero-shot image classification by integrating class knowledge overlay and semi-supervised learning to generate more accurate visual features.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets.

02

Generates more semantically consistent visual features.

03

Improves accuracy in zero-shot image classification.

Abstract

New categories can be discovered by transforming semantic features into synthesized visual features without corresponding training samples in zero-shot image classification. Although significant progress has been made in generating high-quality synthesized visual features using generative adversarial networks, guaranteeing semantic consistency between the semantic features and visual features remains very challenging. In this paper, we propose a novel zero-shot learning approach, GAN-CST, based on class knowledge to visual feature learning to tackle the problem. The approach consists of three parts, class knowledge overlay, semi-supervised learning and triplet loss. It applies class knowledge overlay (CKO) to obtain knowledge not only from the corresponding class but also from other classes that have the knowledge overlay. It ensures that the knowledge-to-visual learning process has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques