Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Simon Guiroy, Christopher Pal, Gon\c{c}alo Mordido, Sarath Chandar

TL;DR
This paper introduces Activation Based Early-Stopping (ABE), a novel method for meta-learning that uses activation statistics from unlabelled target support examples to determine optimal stopping points, improving generalization in few-shot transfer tasks.
Contribution
The paper proposes a new activation-based early-stopping method that does not rely on labeled validation data, enhancing meta-learning performance across diverse transfer learning scenarios.
Findings
Activation statistics effectively estimate target generalization.
ABE improves early-stopping accuracy across datasets.
Method is applicable to various meta-learning algorithms.
Abstract
Meta-Learning algorithms for few-shot learning aim to train neural networks capable of generalizing to novel tasks using only a few examples. Early-stopping is critical for performance, halting model training when it reaches optimal generalization to the new task distribution. Early-stopping mechanisms in Meta-Learning typically rely on measuring the model performance on labeled examples from a meta-validation set drawn from the training (source) dataset. This is problematic in few-shot transfer learning settings, where the meta-test set comes from a different target dataset (OOD) and can potentially have a large distributional shift with the meta-validation set. In this work, we propose Activation Based Early-stopping (ABE), an alternative to using validation-based early-stopping for meta-learning. Specifically, we analyze the evolution, during meta-training, of the neural activations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Machine Learning and Data Classification
