Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Esraa Elelimy; Adam White; Michael Bowling; Martha White

arXiv:2409.01449·cs.LG·October 31, 2024

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Esraa Elelimy, Adam White, Michael Bowling, Martha White

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Recurrent Trace Units (RTUs), a lightweight modification of linear recurrent architectures, that significantly improve online reinforcement learning performance with efficient real-time recurrent learning.

Contribution

The paper proposes RTUs, a novel recurrent architecture that enhances RTRL efficiency and effectiveness in online reinforcement learning scenarios.

Findings

01

RTUs outperform other recurrent architectures in partially observable environments.

02

RTUs require less computation while achieving better performance.

03

RTUs demonstrate significant improvements over LRUs.

Abstract

Recurrent Neural Networks (RNNs) are used to learn representations in partially observable environments. For agents that learn online and continually interact with the environment, it is desirable to train RNNs with real-time recurrent learning (RTRL); unfortunately, RTRL is prohibitively expensive for standard RNNs. A promising direction is to use linear recurrent architectures (LRUs), where dense recurrent weights are replaced with a complex-valued diagonal, making RTRL efficient. In this work, we build on these insights to provide a lightweight but effective approach for training RNNs in online RL. We introduce Recurrent Trace Units (RTUs), a small modification on LRUs that we nonetheless find to have significant performance benefits over LRUs when trained with RTRL. We find RTUs significantly outperform other recurrent architectures across several partially observable environments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

esraaelelimy/rtus
jaxOfficial

Videos

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning· slideslive

Taxonomy

TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics