Convergence of Meta-Learning with Task-Specific Adaptation over Partial   Parameters

Kaiyi Ji; Jason D. Lee; Yingbin Liang; H. Vincent Poor

arXiv:2006.09486·cs.LG·October 26, 2020·25 cites

Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters

Kaiyi Ji, Jason D. Lee, Yingbin Liang, H. Vincent Poor

PDF

Open Access 1 Video

TL;DR

This paper analyzes the convergence and efficiency of the ANIL meta-learning algorithm, showing how the geometric properties of the inner-loop loss affect its performance and providing theoretical and empirical validation.

Contribution

It provides the first theoretical convergence analysis of ANIL, revealing how inner-loop loss geometry impacts its convergence rate and efficiency compared to MAML.

Findings

01

ANIL converges faster with strongly-convex inner-loop loss as inner steps increase

02

ANIL's convergence slows with nonconvex inner-loop loss as inner steps increase

03

Theoretical analysis quantifies ANIL's improved efficiency over MAML

Abstract

Although model-agnostic meta-learning (MAML) is a very successful algorithm in meta-learning practice, it can have high computational cost because it updates all model parameters over both the inner loop of task-specific adaptation and the outer-loop of meta initialization training. A more efficient algorithm ANIL (which refers to almost no inner loop) was proposed recently by Raghu et al. 2019, which adapts only a small subset of parameters in the inner loop and thus has substantially less computational cost than MAML as demonstrated by extensive experiments. However, the theoretical convergence of ANIL has not been studied yet. In this paper, we characterize the convergence rate and the computational complexity for ANIL under two representative inner-loop loss geometries, i.e., strongly-convexity and nonconvexity. Our results show that such a geometric property can significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Machine Learning and Data Classification

MethodsModel-Agnostic Meta-Learning