LPN: Language-guided Prototypical Network for few-shot classification

Kaihui Cheng; Chule Yang; Xiao Liu; Naiyang Guan; Zhiyuan Wang

arXiv:2307.01515·cs.CV·September 13, 2024

LPN: Language-guided Prototypical Network for few-shot classification

Kaihui Cheng, Chule Yang, Xiao Liu, Naiyang Guan, Zhiyuan Wang

PDF

Open Access

TL;DR

This paper introduces LPN, a novel few-shot classification model that integrates vision and language modalities using a pre-trained text encoder and a language-guided decoder to improve classification accuracy.

Contribution

LPN leverages multi-modality information with a dual-branch architecture, combining visual and language features for enhanced few-shot learning performance.

Findings

01

LPN outperforms state-of-the-art methods on benchmark datasets.

02

The integration of language features improves classification robustness.

03

The proposed method effectively aligns visual and textual features for better prototypes.

Abstract

Few-shot classification aims to adapt to new tasks with limited labeled examples. To fully use the accessible data, recent methods explore suitable measures for the similarity between the query and support images and better high-dimensional features with meta-training and pre-training strategies. However, the potential of multi-modality information has barely been explored, which may bring promising improvement for few-shot classification. In this paper, we propose a Language-guided Prototypical Network (LPN) for few-shot classification, which leverages the complementarity of vision and language modalities via two parallel branches to improve the classifier. Concretely, to introduce language modality with limited samples in the visual task, we leverage a pre-trained text encoder to extract class-level text features directly from class names while processing images with a conventional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research

MethodsALIGN