Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

Kaiyi Ji; Junjie Yang; Yingbin Liang

arXiv:2002.07836·cs.LG·July 14, 2020·32 cites

Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

Kaiyi Ji, Junjie Yang, Yingbin Liang

PDF

Open Access 2 Repos

TL;DR

This paper provides the first theoretical convergence analysis for multi-step MAML, establishing convergence rates and complexity bounds for both resampling and finite-sum cases in nonconvex settings.

Contribution

It introduces a new theoretical framework that guarantees convergence of multi-step MAML in practical scenarios, addressing both resampling and finite-sum cases.

Findings

01

Inner stepsize should be inversely proportional to the number of inner steps for convergence.

02

Characterized convergence rate and complexity for multi-step MAML in nonconvex settings.

03

Developed novel techniques for nested meta gradient analysis.

Abstract

As a popular meta-learning approach, the model-agnostic meta-learning (MAML) algorithm has been widely used due to its simplicity and effectiveness. However, the convergence of the general multi-step MAML still remains unexplored. In this paper, we develop a new theoretical framework to provide such convergence guarantee for two types of objective functions that are of interest in practice: (a) resampling case (e.g., reinforcement learning), where loss functions take the form in expectation and new data are sampled as the algorithm runs; and (b) finite-sum case (e.g., supervised learning), where loss functions take the finite-sum form with given samples. For both cases, we characterize the convergence rate and the computational complexity to attain an $ϵ$ -accurate solution for multi-step MAML in the general nonconvex setting. In particular, our results suggest that an inner-stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and Algorithms

MethodsModel-Agnostic Meta-Learning