TL;DR
This paper introduces a novel few-shot classification method that pseudo-labels a large dataset to hallucinate novel classes, then fine-tunes the model with distillation, outperforming existing methods on multiple benchmarks.
Contribution
The proposed approach uniquely combines pseudo-labeling and distillation to improve few-shot learning, addressing overfitting and underfitting issues effectively.
Findings
Outperforms state-of-the-art on four benchmarks
Effectively leverages large datasets for few-shot learning
Balances overfitting and underfitting in model training
Abstract
Few-shot classification requires adapting knowledge learned from a large annotated base dataset to recognize novel unseen classes, each represented by few labeled examples. In such a scenario, pretraining a network with high capacity on the large dataset and then finetuning it on the few examples causes severe overfitting. At the same time, training a simple linear classifier on top of "frozen" features learned from the large labeled dataset fails to adapt the model to the properties of the novel classes, effectively inducing underfitting. In this paper we propose an alternative approach to both of these two popular strategies. First, our method pseudo-labels the entire large dataset using the linear classifier trained on the novel classes. This effectively "hallucinates" the novel classes in the large dataset, despite the novel categories not being present in the base database (novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsBalanced Selection
