SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning
Fengyuan Yang, Ruiping Wang, Xilin Chen

TL;DR
This paper introduces SEGA, a semantic-guided attention mechanism that leverages semantic prior knowledge to improve visual feature discrimination in few-shot learning, achieving superior results on multiple benchmarks.
Contribution
The paper proposes a novel semantic-guided attention method that enhances visual prototypes using semantic knowledge, significantly improving few-shot learning performance.
Findings
Outperforms state-of-the-art on miniImageNet and tieredImageNet
Effectively utilizes semantic prior to focus on discriminative features
Enhances visual prototypes with semantic-guided attention
Abstract
Teaching machines to recognize a new category based on few training samples especially only one remains challenging owing to the incomprehensive understanding of the novel category caused by the lack of data. However, human can learn new classes quickly even given few samples since human can tell what discriminative features should be focused on about each category based on both the visual and semantic prior knowledge. To better utilize those prior knowledge, we propose the SEmantic Guided Attention (SEGA) mechanism where the semantic knowledge is used to guide the visual perception in a top-down manner about what visual features should be paid attention to when distinguishing a category from the others. As a result, the embedding of the novel class even with few samples can be more discriminative. Concretely, a feature extractor is trained to embed few images of each novel class into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Image Processing Techniques and Applications · Multimodal Machine Learning Applications
MethodsFeature Selection
