Tempo Adaptation in Non-stationary Reinforcement Learning

Hyunin Lee; Yuhao Ding; Jongmin Lee; Ming Jin; Javad Lavaei; Somayeh; Sojoudi

arXiv:2309.14989·cs.LG·October 31, 2023

Tempo Adaptation in Non-stationary Reinforcement Learning

Hyunin Lee, Yuhao Ding, Jongmin Lee, Ming Jin, Javad Lavaei, Somayeh, Sojoudi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper addresses the challenge of time synchronization in non-stationary reinforcement learning by proposing a framework that optimally schedules interaction times to improve policy performance in changing environments.

Contribution

It introduces the ProST framework that computes an optimal sequence of interaction times to balance training and environmental change, reducing regret in non-stationary RL.

Findings

01

ProST outperforms existing methods in high-dimensional non-stationary environments.

02

Theoretical analysis shows sublinear dynamic regret with ProST.

03

Optimal scheduling improves online returns in experiments.

Abstract

We first raise and tackle a ``time synchronization'' issue between the agent and the environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real-world applications. In reality, environmental changes occur over wall-clock time ( $t$ ) rather than episode progress ( $k$ ), where wall-clock time signifies the actual elapsed time within the fixed duration $t \in [0, T]$ . In existing works, at episode $k$ , the agent rolls a trajectory and trains a policy before transitioning to episode $k + 1$ . In the context of the time-desynchronized environment, however, the agent at time $t_{k}$ allocates $Δ t$ for trajectory generation and training, subsequently moves to the next episode at $t_{k + 1} = t_{k} + Δ t$ . Despite a fixed total number of episodes ( $K$ ), the agent accumulates different trajectories influenced by the choice of interaction times…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hyunin-lee/TempoRL
pytorchOfficial

Videos

Tempo Adaptation in Non-stationary Reinforcement Learning· slideslive

Taxonomy

TopicsNeural Networks and Reservoir Computing · Reinforcement Learning in Robotics · Smart Grid Energy Management