Deep Reinforcement Learning amidst Lifelong Non-Stationarity

Annie Xie; James Harrison; Chelsea Finn

arXiv:2006.10701·cs.LG·June 19, 2020·25 cites

Deep Reinforcement Learning amidst Lifelong Non-Stationarity

Annie Xie, James Harrison, Chelsea Finn

PDF

Open Access

TL;DR

This paper introduces a novel off-policy reinforcement learning algorithm designed to handle lifelong non-stationarity by learning environment representations from past experiences, outperforming existing methods in dynamic settings.

Contribution

The authors formalize lifelong non-stationarity in RL and develop a latent variable-based off-policy algorithm that adapts to persistent environment changes.

Findings

01

Our method significantly outperforms non-adaptive approaches in non-stationary environments.

02

The approach effectively learns environment representations that capture ongoing changes.

03

Empirical results demonstrate robustness to continuous environment shifts.

Abstract

As humans, our goals and our environment are persistently changing throughout our lifetime based on our experiences, actions, and internal and external drives. In contrast, typical reinforcement learning problem set-ups consider decision processes that are stationary across episodes. Can we develop reinforcement learning algorithms that can cope with the persistent change in the former, more realistic problem settings? While on-policy algorithms such as policy gradients in principle can be extended to non-stationary settings, the same cannot be said for more efficient off-policy algorithms that replay past experiences when learning. In this work, we formalize this problem setting, and draw upon ideas from the online learning and probabilistic inference literature to derive an off-policy RL algorithm that can reason about and tackle such lifelong non-stationarity. Our method leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Mental Health Research Topics