Few-shot Learning with Noisy Labels
Kevin J Liang, Samrudhdhi B. Rangrej, Vladan Petrovic, Tal Hassner

TL;DR
This paper addresses the challenge of label noise in few-shot learning by proposing feature aggregation enhancements and a Transformer-based model, TraNFS, which effectively identifies and mitigates mislabeled samples, improving robustness.
Contribution
It introduces a new Transformer model, TraNFS, and improved feature aggregation methods to enhance robustness of FSL methods against noisy labels.
Findings
TraNFS performs comparably to leading methods on clean data.
TraNFS significantly outperforms others with noisy support sets.
Proposed methods improve robustness to label noise in FSL.
Abstract
Few-shot learning (FSL) methods typically assume clean support sets with accurately labeled samples when training on novel classes. This assumption can often be unrealistic: support sets, no matter how small, can still include mislabeled samples. Robustness to label noise is therefore essential for FSL methods to be practical, but this problem surprisingly remains largely unexplored. To address mislabeled samples in FSL settings, we make several technical contributions. (1) We offer simple, yet effective, feature aggregation methods, improving the prototypes used by ProtoNet, a popular FSL technique. (2) We describe a novel Transformer model for Noisy Few-Shot Learning (TraNFS). TraNFS leverages a transformer's attention mechanism to weigh mislabeled versus correct samples. (3) Finally, we extensively test these methods on noisy versions of MiniImageNet and TieredImageNet. Our results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Layer Normalization · Residual Connection · Softmax
