Can Learned Optimization Make Reinforcement Learning Less Difficult?

Alexander David Goldie; Chris Lu; Matthew Thomas Jackson; Shimon; Whiteson; Jakob Nicolaus Foerster

arXiv:2407.07082·cs.LG·April 16, 2025

Can Learned Optimization Make Reinforcement Learning Less Difficult?

Alexander David Goldie, Chris Lu, Matthew Thomas Jackson, Shimon, Whiteson, Jakob Nicolaus Foerster

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces OPEN, a learned optimizer designed to address key challenges in reinforcement learning such as non-stationarity, exploration, and plasticity loss, demonstrating improved performance and generalization across various environments.

Contribution

The paper presents a novel meta-learned optimizer, OPEN, tailored to mitigate RL difficulties and capable of generalizing across different tasks and architectures.

Findings

01

OPEN outperforms traditional optimizers in tested environments.

02

OPEN generalizes well across diverse environments and agent architectures.

03

The method effectively incorporates stochasticity for exploration.

Abstract

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from high degrees of plasticity loss; and requires exploration to prevent premature convergence to local optima and maximize return. In this paper, we consider whether learned optimization can help overcome these problems. Our method, Learned Optimization for Plasticity, Exploration and Non-stationarity (OPEN), meta-learns an update rule whose input features and output structure are informed by previously proposed solutions to these difficulties. We show that our parameterization is flexible enough to enable meta-learning in diverse learning contexts, including the ability to use stochasticity for exploration. Our experiments demonstrate that when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexgoldie/rl-learned-optimization
jaxOfficial

Videos

Can Learned Optimization Make Reinforcement Learning Less Difficult?· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics