Learning to Focus: Cascaded Feature Matching Network for Few-shot Image   Recognition

Mengting Chen; Xinggang Wang; Heng Luo; Yifeng Geng; Wenyu; Liu

arXiv:2101.05018·cs.CV·January 14, 2021

Learning to Focus: Cascaded Feature Matching Network for Few-shot Image Recognition

Mengting Chen, Xinggang Wang, Heng Luo, Yifeng Geng, Wenyu, Liu

PDF

TL;DR

The paper introduces a Cascaded Feature Matching Network (CFMN) that improves few-shot image recognition by focusing on highly correlated features and incorporating multi-scale information, leading to better generalization and performance.

Contribution

It proposes a novel feature matching block for deep metric learning in few-shot recognition, enhancing adaptability and multi-scale feature integration.

Findings

01

Effective on miniImageNet and Omniglot datasets.

02

Outperforms existing methods in few-shot learning tasks.

03

First to study multi-label few-shot learning on COCO.

Abstract

Deep networks can learn to accurately recognize objects of a category by training on a large number of annotated images. However, a meta-learning challenge known as a low-shot image recognition task comes when only a few images with annotations are available for learning a recognition model for one category. The objects in testing/query and training/support images are likely to be different in size, location, style, and so on. Our method, called Cascaded Feature Matching Network (CFMN), is proposed to solve this problem. We train the meta-learner to learn a more fine-grained and adaptive deep distance metric by focusing more on the features that have high correlations between compared images by the feature matching block which can align associated features together and naturally ignore those non-discriminative features. By applying the proposed feature matching block in different layers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.