Adaptive Gradient-Based Meta-Learning Methods

Mikhail Khodak; Maria-Florina Balcan; Ameet Talwalkar

arXiv:1906.02717·cs.LG·December 10, 2019·22 cites

Adaptive Gradient-Based Meta-Learning Methods

Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

PDF

Open Access 1 Repo

TL;DR

This paper develops a theoretical framework for adaptive gradient-based meta-learning that learns task similarity, improves transfer-risk bounds, and enhances performance in few-shot and federated learning scenarios.

Contribution

It introduces a unified theory for meta-learning that incorporates task similarity, leading to sharper bounds and improved algorithms for dynamic and structured task environments.

Findings

01

Sharper transfer-risk bounds in statistical learning-to-learn.

02

Improved meta-test-time performance in few-shot learning.

03

Enhanced algorithms for federated learning.

Abstract

We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential prediction algorithms. Our approach enables the task-similarity to be learned adaptively, provides sharper transfer-risk bounds in the setting of statistical learning-to-learn, and leads to straightforward derivations of average-case regret bounds for efficient algorithms in settings where the task-environment changes dynamically or the tasks share a certain geometric structure. We use our theory to modify several popular meta-learning algorithms and improve their meta-test-time performance on standard problems in few-shot learning and federated learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mkhodak/ARUBA
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques · Machine Learning and ELM