Provable Guarantees for Gradient-Based Meta-Learning

Mikhail Khodak; Maria-Florina Balcan; Ameet Talwalkar

arXiv:1902.10644·cs.LG·May 17, 2019·34 cites

Provable Guarantees for Gradient-Based Meta-Learning

Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a meta-learning algorithm based on online convex optimization that offers strong theoretical guarantees, good sample efficiency, and scalability to deep learning, with empirical validation.

Contribution

It presents the first scalable meta-learning method with theoretical guarantees that improve with task similarity and matches lower bounds under natural assumptions.

Findings

01

The algorithm achieves sample efficiency in convex settings.

02

It generalizes better as task similarity increases.

03

Experimental results confirm theoretical predictions in convex and deep learning contexts.

Abstract

We study the problem of meta-learning through the lens of online convex optimization, developing a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods. Our method is the first to simultaneously satisfy good sample efficiency guarantees in the convex setting, with generalization bounds that improve with task-similarity, while also being computationally scalable to modern deep learning architectures and the many-task setting. Despite its simplicity, the algorithm matches, up to a constant factor, a lower bound on the performance of any such parameter-transfer method under natural task similarity assumptions. We use experiments in both convex and deep learning settings to verify and demonstrate the applicability of our theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mkhodak/FMRL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques · Multimodal Machine Learning Applications