Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
Aniruddh Raghu, Maithra Raghu, Samy Bengio, Oriol Vinyals

TL;DR
This paper investigates why MAML is effective in few-shot learning, concluding that feature reuse is the main factor, leading to the development of the simpler ANIL algorithm that maintains performance with improved efficiency.
Contribution
The paper demonstrates that feature reuse, rather than rapid learning, drives MAML's success, introduces ANIL as a simplified alternative, and analyzes the roles of network components in meta-learning.
Findings
Feature reuse dominates MAML's effectiveness.
ANIL matches MAML's performance on benchmarks.
Removing the network head still retains high performance.
Abstract
An important research direction in machine learning has centered around developing meta-learning algorithms to tackle few-shot learning. An especially successful algorithm has been Model Agnostic Meta-Learning (MAML), a method that consists of two optimization loops, with the outer loop finding a meta-initialization, from which the inner loop can efficiently learn new tasks. Despite MAML's popularity, a fundamental open question remains -- is the effectiveness of MAML due to the meta-initialization being primed for rapid learning (large, efficient changes in the representations) or due to feature reuse, with the meta initialization already containing high quality features? We investigate this question, via ablation studies and analysis of the latent representations, finding that feature reuse is the dominant factor. This leads to the ANIL (Almost No Inner Loop) algorithm, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications
MethodsModel-Agnostic Meta-Learning
