A Distribution-Dependent Analysis of Meta-Learning

Mikhail Konobeev; Ilja Kuzborskij; Csaba Szepesv\'ari

arXiv:2011.00344·stat.ML·June 15, 2021·1 cites

A Distribution-Dependent Analysis of Meta-Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesv\'ari

PDF

Open Access

TL;DR

This paper provides a distribution-dependent analysis of meta-learning in Gaussian linear regression, establishing lower bounds on transfer risk and proposing an EM-based method that achieves these bounds, unifying different meta-learning paradigms.

Contribution

It introduces distribution-dependent lower bounds on transfer risk and a weighted biased regularized regression method that matches these bounds, unifying parameter sharing and representation learning.

Findings

01

Weighted regression matches lower bounds in experiments.

02

EM algorithm effectively learns representations and attains bounds.

03

Meta-learning difficulty characterized in Gaussian setting.

Abstract

A key problem in the theory of meta-learning is to understand how the task distributions influence transfer risk, the expected error of a meta-learner on a new task drawn from the unknown task distribution. In this paper, focusing on fixed design linear regression with Gaussian noise and a Gaussian task (or parameter) distribution, we give distribution-dependent lower bounds on the transfer risk of any algorithm, while we also show that a novel, weighted version of the so-called biased regularized regression method is able to match these lower bounds up to a fixed constant factor. Notably, the weighting is derived from the covariance of the Gaussian task distribution. Altogether, our results provide a precise characterization of the difficulty of meta-learning in this Gaussian setting. While this problem setting may appear simple, we show that it is rich enough to unify the "parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Gaussian Processes and Bayesian Inference

MethodsLinear Regression