A Distribution-Dependent Analysis of Meta-Learning
Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesv\'ari

TL;DR
This paper provides a distribution-dependent analysis of meta-learning in Gaussian linear regression, establishing lower bounds on transfer risk and proposing an EM-based method that achieves these bounds, unifying different meta-learning paradigms.
Contribution
It introduces distribution-dependent lower bounds on transfer risk and a weighted biased regularized regression method that matches these bounds, unifying parameter sharing and representation learning.
Findings
Weighted regression matches lower bounds in experiments.
EM algorithm effectively learns representations and attains bounds.
Meta-learning difficulty characterized in Gaussian setting.
Abstract
A key problem in the theory of meta-learning is to understand how the task distributions influence transfer risk, the expected error of a meta-learner on a new task drawn from the unknown task distribution. In this paper, focusing on fixed design linear regression with Gaussian noise and a Gaussian task (or parameter) distribution, we give distribution-dependent lower bounds on the transfer risk of any algorithm, while we also show that a novel, weighted version of the so-called biased regularized regression method is able to match these lower bounds up to a fixed constant factor. Notably, the weighting is derived from the covariance of the Gaussian task distribution. Altogether, our results provide a precise characterization of the difficulty of meta-learning in this Gaussian setting. While this problem setting may appear simple, we show that it is rich enough to unify the "parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Gaussian Processes and Bayesian Inference
MethodsLinear Regression
