First-order ANIL provably learns representations despite   overparametrization

O\u{g}uz Kaan Y\"uksel; Etienne Boursier; Nicolas Flammarion

arXiv:2303.01335·cs.LG·July 24, 2024·1 cites

First-order ANIL provably learns representations despite overparametrization

O\u{g}uz Kaan Y\"uksel, Etienne Boursier, Nicolas Flammarion

PDF

Open Access

TL;DR

This paper proves that first-order ANIL, a meta-learning method, can theoretically learn shared linear representations even with overparametrization, ensuring effective adaptation to new tasks.

Contribution

It provides the first theoretical proof that first-order ANIL learns shared representations under overparametrization in a linear setting.

Findings

01

First-order ANIL learns low-rank shared representations.

02

Overparametrization leads to asymptotically low-rank solutions.

03

Shared representations enable effective one-step adaptation.

Abstract

Due to its empirical success in few-shot classification and reinforcement learning, meta-learning has recently received significant interest. Meta-learning methods leverage data from previous tasks to learn a new task in a sample-efficient manner. In particular, model-agnostic methods look for initialization points from which gradient descent quickly adapts to any new task. Although it has been empirically suggested that such methods perform well by learning shared representations during pretraining, there is limited theoretical evidence of such behavior. More importantly, it has not been shown that these methods still learn a shared structure, despite architectural misspecifications. In this direction, this work shows, in the limit of an infinite number of tasks, that first-order ANIL with a linear two-layer network architecture successfully learns linear shared representations. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and ELM

MethodsTest