Online Learning of a Memory for Learning Rates

Franziska Meier; Daniel Kappler; Stefan Schaal

arXiv:1709.06709·cs.LG·March 28, 2018

Online Learning of a Memory for Learning Rates

Franziska Meier, Daniel Kappler, Stefan Schaal

PDF

1 Repo

TL;DR

This paper presents an online meta-learning algorithm that creates and updates a memory of optimal learning rates, enabling faster learning across tasks by predicting gradient scaling, applicable in various optimization scenarios.

Contribution

Introduces a computationally efficient online meta-learning method that learns and updates a memory of learning rates to accelerate task-specific learning.

Findings

01

Speeds up MNIST classification learning.

02

Improves learning control tasks in batch and online settings.

03

Can be combined with any gradient-based optimizer.

Abstract

The promise of learning to learn for robotics rests on the hope that by extracting some information about the learning process itself we can speed up subsequent similar learning tasks. Here, we introduce a computationally efficient online meta-learning algorithm that builds and optimizes a memory model of the optimal learning rate landscape from previously observed gradient behaviors. While performing task specific optimization, this memory of learning rates predicts how to scale currently observed gradients. After applying the gradient scaling our meta-learner updates its internal memory based on the observed effect its prediction had. Our meta-learner can be combined with any gradient-based optimizer, learns on the fly and can be transferred to new optimization tasks. In our evaluations we show that our meta-learning algorithm speeds up learning of MNIST classification and a variety…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fmeier/online-meta-learning
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings