SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot   Learning

Fengyuan Yang; Ruiping Wang; Xilin Chen

arXiv:2111.04316·cs.CV·November 9, 2021·5 cites

SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning

Fengyuan Yang, Ruiping Wang, Xilin Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces SEGA, a semantic-guided attention mechanism that leverages semantic prior knowledge to improve visual feature discrimination in few-shot learning, achieving superior results on multiple benchmarks.

Contribution

The paper proposes a novel semantic-guided attention method that enhances visual prototypes using semantic knowledge, significantly improving few-shot learning performance.

Findings

01

Outperforms state-of-the-art on miniImageNet and tieredImageNet

02

Effectively utilizes semantic prior to focus on discriminative features

03

Enhances visual prototypes with semantic-guided attention

Abstract

Teaching machines to recognize a new category based on few training samples especially only one remains challenging owing to the incomprehensive understanding of the novel category caused by the lack of data. However, human can learn new classes quickly even given few samples since human can tell what discriminative features should be focused on about each category based on both the visual and semantic prior knowledge. To better utilize those prior knowledge, we propose the SEmantic Guided Attention (SEGA) mechanism where the semantic knowledge is used to guide the visual perception in a top-down manner about what visual features should be paid attention to when distinguishing a category from the others. As a result, the embedding of the novel class even with few samples can be more discriminative. Concretely, a feature extractor is trained to embed few images of each novel class into a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

martayang/sega
pytorchOfficial

Videos

SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Image Processing Techniques and Applications · Multimodal Machine Learning Applications

MethodsFeature Selection