Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters
Kaiyi Ji, Jason D. Lee, Yingbin Liang, H. Vincent Poor

TL;DR
This paper analyzes the convergence and efficiency of the ANIL meta-learning algorithm, showing how the geometric properties of the inner-loop loss affect its performance and providing theoretical and empirical validation.
Contribution
It provides the first theoretical convergence analysis of ANIL, revealing how inner-loop loss geometry impacts its convergence rate and efficiency compared to MAML.
Findings
ANIL converges faster with strongly-convex inner-loop loss as inner steps increase
ANIL's convergence slows with nonconvex inner-loop loss as inner steps increase
Theoretical analysis quantifies ANIL's improved efficiency over MAML
Abstract
Although model-agnostic meta-learning (MAML) is a very successful algorithm in meta-learning practice, it can have high computational cost because it updates all model parameters over both the inner loop of task-specific adaptation and the outer-loop of meta initialization training. A more efficient algorithm ANIL (which refers to almost no inner loop) was proposed recently by Raghu et al. 2019, which adapts only a small subset of parameters in the inner loop and thus has substantially less computational cost than MAML as demonstrated by extensive experiments. However, the theoretical convergence of ANIL has not been studied yet. In this paper, we characterize the convergence rate and the computational complexity for ANIL under two representative inner-loop loss geometries, i.e., strongly-convexity and nonconvexity. Our results show that such a geometric property can significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Machine Learning and Data Classification
MethodsModel-Agnostic Meta-Learning
