TL;DR
This paper introduces a context-agnostic meta-learning method that improves generalization to unseen contexts by incorporating a context-adversarial component, leading to better performance in few-shot classification and energy prediction tasks.
Contribution
It proposes a novel context-adversarial meta-learning approach that enhances the robustness of initializations against irrelevant contextual information.
Findings
Improves Omniglot classification accuracy by 4.3% on unseen alphabets.
Reduces energy expenditure prediction error by 30%.
Enhances meta-learning performance across multiple algorithms and tasks.
Abstract
Meta-learning approaches have addressed few-shot problems by finding initialisations suited for fine-tuning to target tasks. Often there are additional properties within training data (which we refer to as context), not relevant to the target task, which act as a distractor to meta-learning, particularly when the target task contains examples from a novel context not seen during training. We address this oversight by incorporating a context-adversarial component into the meta-learning process. This produces an initialisation for fine-tuning to target which is both context-agnostic and task-generalised. We evaluate our approach on three commonly used meta-learning algorithms and two problems. We demonstrate our context-agnostic meta-learning improves results in each case. First, we report on Omniglot few-shot character classification, using alphabets as context. An average improvement of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
