A New First-Order Meta-Learning Algorithm with Convergence Guarantees

El Mahdi Chayti; Martin Jaggi

arXiv:2409.03682·cs.LG·September 6, 2024·2 cites

A New First-Order Meta-Learning Algorithm with Convergence Guarantees

El Mahdi Chayti, Martin Jaggi

PDF

Open Access

TL;DR

This paper introduces a new first-order meta-learning algorithm based on MAML that guarantees convergence to a stationary point and addresses computational issues, supported by theoretical analysis and synthetic experiments.

Contribution

A novel first-order MAML variant with proven convergence guarantees and insights into the smoothness properties of the MAML objective.

Findings

01

Proposed algorithm converges to a stationary point.

02

MAML's smoothness constant depends on the gradient norm.

03

Validation through synthetic experiments supports theoretical claims.

Abstract

Learning new tasks by drawing on prior experience gathered from other (related) tasks is a core property of any intelligent system. Gradient-based meta-learning, especially MAML and its variants, has emerged as a viable solution to accomplish this goal. One problem MAML encounters is its computational and memory burdens needed to compute the meta-gradients. We propose a new first-order variant of MAML that we prove converges to a stationary point of the MAML objective, unlike other first-order variants. We also show that the MAML objective does not satisfy the smoothness assumption assumed in previous works; we show instead that its smoothness constant grows with the norm of the meta-gradient, which theoretically suggests the use of normalized or clipped-gradient methods compared to the plain gradient method used in previous works. We validate our theory on a synthetic experiment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Sensor and Control Systems

MethodsModel-Agnostic Meta-Learning