Meta-Transfer Learning for Few-Shot Learning
Qianru Sun, Yaoyao Liu, Tat-Seng Chua, Bernt Schiele

TL;DR
This paper introduces meta-transfer learning (MTL), a novel approach that adapts deep neural networks for few-shot learning by learning task-specific scaling and shifting functions, significantly improving performance on benchmarks.
Contribution
The paper proposes a new meta-transfer learning method that enables deep neural networks to effectively learn from few samples by learning to adapt weights through scaling and shifting functions.
Findings
MTL achieves top performance on miniImageNet and Fewshot-CIFAR100 benchmarks.
The hard task meta-batch scheme improves learning efficiency.
Both the transfer mechanism and meta-batch scheme contribute to high accuracy.
Abstract
Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which only a few labeled samples are available. As deep neural networks (DNNs) tend to overfit using a few samples only, meta-learning typically uses shallow neural networks (SNNs), thus limiting its effectiveness. In this paper we propose a novel few-shot learning method called meta-transfer learning (MTL) which learns to adapt a deep NN for few shot learning tasks. Specifically, "meta" refers to training multiple tasks, and "transfer" is achieved by learning scaling and shifting functions of DNN weights for each task. In addition, we introduce the hard task (HT) meta-batch scheme as an effective learning curriculum for MTL. We conduct experiments using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
