Meta-Learning without Memorization
Mingzhang Yin, George Tucker, Mingyuan Zhou, Sergey Levine, Chelsea, Finn

TL;DR
This paper introduces a meta-regularization method based on information theory that enables meta-learning models to adapt effectively without requiring mutually-exclusive tasks, broadening applicability and improving performance.
Contribution
The authors propose a novel meta-regularization objective that allows meta-learning from non-mutually-exclusive tasks, overcoming a key limitation of prior meta-learning algorithms.
Findings
Outperforms standard meta-learning in practical, non-mutually-exclusive task settings
Applicable to both contextual and gradient-based meta-learning algorithms
Enables effective learning in domains where traditional meta-learning fails
Abstract
The ability to learn new concepts with small amounts of data is a critical aspect of intelligence that has proven challenging for deep learning methods. Meta-learning has emerged as a promising technique for leveraging data from previous tasks to enable efficient learning of new tasks. However, most meta-learning algorithms implicitly require that the meta-training tasks be mutually-exclusive, such that no single model can solve all of the tasks at once. For example, when creating tasks for few-shot image classification, prior work uses a per-task random assignment of image classes to N-way classification labels. If this is not done, the meta-learner can ignore the task training data and learn a single model that performs all of the meta-training tasks zero-shot, but does not adapt effectively to new image classes. This requirement means that the user must take great care in designing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
