Meta-Learning with Adaptive Hyperparameters

Sungyong Baik; Myungsub Choi; Janghoon Choi; Heewon Kim; Kyoung Mu Lee

arXiv:2011.00209·cs.LG·December 9, 2020·6 cites

Meta-Learning with Adaptive Hyperparameters

Sungyong Baik, Myungsub Choi, Janghoon Choi, Heewon Kim, Kyoung Mu Lee

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces ALFA, a meta-network that adaptively generates hyperparameters for inner-loop optimization in MAML, significantly improving fast adaptation and outperforming traditional MAML even from random initialization.

Contribution

The paper proposes ALFA, a novel method that adaptively generates hyperparameters during fast adaptation, enhancing MAML's effectiveness especially when test tasks differ from training tasks.

Findings

01

ALFA outperforms MAML in few-shot learning tasks.

02

Fast adaptation with ALFA can surpass MAML even from random initialization.

03

Adaptive hyperparameters are crucial for effective meta-learning.

Abstract

Despite its popularity, several recent works question the effectiveness of MAML when test tasks are different from training tasks, thus suggesting various task-conditioned methodology to improve the initialization. Instead of searching for better task-aware initialization, we focus on a complementary factor in MAML framework, inner-loop optimization (or fast adaptation). Consequently, we propose a new weight update rule that greatly enhances the fast adaptation process. Specifically, we introduce a small meta-network that can adaptively generate per-step hyperparameters: learning rate and weight decay coefficients. The experimental results validate that the Adaptive Learning of hyperparameters for Fast Adaptation (ALFA) is the equally important ingredient that was often neglected in the recent few-shot learning approaches. Surprisingly, fast adaptation from random initialization with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Meta-Learning with Adaptive Hyperparameters· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning and ELM

MethodsWeight Decay · Model-Agnostic Meta-Learning