Meta-Regularization by Enforcing Mutual-Exclusiveness
Edwin Pan, Pankaj Rajak, Shubham Shrivastava

TL;DR
This paper introduces a regularization technique for meta-learning that reduces task overfitting by maximizing the distance between task-specific summaries, improving accuracy on few-shot classification tasks.
Contribution
It proposes a novel regularization method that enforces mutual exclusiveness among tasks, addressing overfitting and memorization issues in meta-learning models.
Findings
Achieves approximately 36% accuracy improvement on Omniglot dataset.
Effective for both black-box and optimization-based meta-learning models.
Enhances generalization to unseen tasks in few-shot learning scenarios.
Abstract
Meta-learning models have two objectives. First, they need to be able to make predictions over a range of task distributions while utilizing only a small amount of training data. Second, they also need to adapt to new novel unseen tasks at meta-test time again by using only a small amount of training data from that task. It is the second objective where meta-learning models fail for non-mutually exclusive tasks due to task overfitting. Given that guaranteeing mutually exclusive tasks is often difficult, there is a significant need for regularization methods that can help reduce the impact of task-memorization in meta-learning. For example, in the case of N-way, K-shot classification problems, tasks becomes non-mutually exclusive when the labels associated with each task is fixed. Under this design, the model will simply memorize the class labels of all the training tasks, and thus will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
