On the ERM Principle in Meta-Learning
Yannay Alon, Steve Hanneke, Shay Moran, Uri Shalit

TL;DR
This paper characterizes the behavior of meta-learning algorithms' error surfaces, revealing how the number of tasks and examples per task influence the ability to learn effectively, including cases where few examples suffice.
Contribution
It provides a theoretical analysis of the meta-ERM principle, establishing conditions under which a finite or increasing number of examples per task guarantees vanishing error.
Findings
Number of tasks must increase inversely with error for consistent learning.
A dichotomy exists where either examples per task grow inversely with error or remain finite.
Developed necessary and sufficient conditions for meta-learnability with limited examples.
Abstract
Classic supervised learning involves algorithms trained on labeled examples to produce a hypothesis aimed at performing well on unseen examples. Meta-learning extends this by training across tasks, with examples per task, producing a hypothesis class within some meta-class . This setting applies to many modern problems such as in-context learning, hypernetworks, and learning-to-learn. A common method for evaluating the performance of supervised learning algorithms is through their learning curve, which depicts the expected error as a function of the number of training examples. In meta-learning, the learning curve becomes a two-dimensional learning surface, which evaluates the expected error on unseen domains for varying values of (number of tasks) and (number of training examples). Our findings characterize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Online Learning and Analytics
