Self-supervised Knowledge Distillation for Few-shot Learning

Jathushan Rajasegaran; Salman Khan; Munawar Hayat; Fahad Shahbaz Khan,; Mubarak Shah

arXiv:2006.09785·cs.CV·August 5, 2020·70 cites

Self-supervised Knowledge Distillation for Few-shot Learning

Jathushan Rajasegaran, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan,, Mubarak Shah

PDF

Open Access 2 Repos

TL;DR

This paper introduces a two-stage self-supervised knowledge distillation method to enhance feature representations for few-shot learning, outperforming existing approaches through entropy maximization and distillation.

Contribution

It proposes a novel two-stage training process combining entropy maximization and student-teacher distillation to improve few-shot learning performance.

Findings

01

Self-supervised pretraining outperforms current state-of-the-art methods.

02

Two-stage process yields significant improvements in few-shot tasks.

03

Code availability facilitates reproducibility and further research.

Abstract

Real-world contains an overwhelmingly large number of object classes, learning all of which at once is infeasible. Few shot learning is a promising learning paradigm due to its ability to learn out of order distributions quickly with only a few samples. Recent works [7, 41] show that simply learning a good feature embedding can outperform more sophisticated meta-learning and metric learning algorithms for few-shot learning. In this paper, we propose a simple approach to improve the representation capacity of deep neural networks for few-shot learning tasks. We follow a two-stage learning process: First, we train a neural network to maximize the entropy of the feature embedding, thus creating an optimal output manifold using a self-supervised auxiliary loss. In the second stage, we minimize the entropy on feature embedding by bringing self-supervised twins together, while constraining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications