Meta-Gradients in Non-Stationary Environments

Jelena Luketina; Sebastian Flennerhag; Yannick Schroecker; David Abel,; Tom Zahavy; Satinder Singh

arXiv:2209.06159·cs.LG·September 14, 2022

Meta-Gradients in Non-Stationary Environments

Jelena Luketina, Sebastian Flennerhag, Yannick Schroecker, David Abel,, Tom Zahavy, Satinder Singh

PDF

Open Access

TL;DR

This paper systematically studies meta-gradient methods in non-stationary reinforcement learning, showing that providing contextual information to meta-optimizers enhances adaptation and performance in changing environments.

Contribution

It introduces a framework for incorporating context features into meta-gradient methods and demonstrates their benefits for adaptation in non-stationary settings.

Findings

01

Adding context improves meta-gradient adaptation speed.

02

Contextual meta-gradients outperform baselines in non-stationary environments.

03

Without context, meta-gradients offer limited advantages.

Abstract

Meta-gradient methods (Xu et al., 2018; Zahavy et al., 2020) offer a promising solution to the problem of hyperparameter selection and adaptation in non-stationary reinforcement learning problems. However, the properties of meta-gradients in such environments have not been systematically studied. In this work, we bring new clarity to meta-gradients in non-stationary environments. Concretely, we ask: (i) how much information should be given to the learned optimizers, so as to enable faster adaptation and generalization over a lifetime, (ii) what meta-optimizer functions are learned in this process, and (iii) whether meta-gradient methods provide a bigger advantage in highly non-stationary environments. To study the effect of information provided to the meta-optimizer, as in recent works (Flennerhag et al., 2021; Almeida et al., 2021), we replace the tuned meta-parameters of fixed update…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Human Pose and Action Recognition · Reinforcement Learning in Robotics