Dynamics-Adaptive Continual Reinforcement Learning via Progressive   Contextualization

Tiantian Zhang; Zichuan Lin; Yuxing Wang; Deheng Ye; Qiang Fu; Wei; Yang; Xueqian Wang; Bin Liang; Bo Yuan; and Xiu Li

arXiv:2209.00347·cs.LG·May 25, 2023

Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization

Tiantian Zhang, Zichuan Lin, Yuxing Wang, Deheng Ye, Qiang Fu, Wei, Yang, Xueqian Wang, Bin Liang, Bo Yuan, and Xiu Li

PDF

Open Access

TL;DR

DaCoRL introduces a dynamics-adaptive continual reinforcement learning framework that incrementally clusters environment contexts and adaptively expands neural networks to improve stability and generalization in dynamic environments.

Contribution

It proposes a novel context inference method using Bayesian clustering and an expandable neural network architecture for continual RL in changing environments.

Findings

01

Outperforms existing methods in stability and overall performance.

02

Effectively classifies and adapts to new environment contexts.

03

Demonstrates superior generalization in robot navigation and MuJoCo tasks.

Abstract

A key challenge of continual reinforcement learning (CRL) in dynamic environments is to promptly adapt the RL agent's behavior as the environment changes over its lifetime, while minimizing the catastrophic forgetting of the learned information. To address this challenge, in this article, we propose DaCoRL, i.e., dynamics-adaptive continual RL. DaCoRL learns a context-conditioned policy using progressive contextualization, which incrementally clusters a stream of stationary tasks in the dynamic environment into a series of contexts and opts for an expandable multihead neural network to approximate the policy. Specifically, we define a set of tasks with similar dynamics as an environmental context and formalize context inference as a procedure of online Bayesian infinite Gaussian mixture clustering on environment features, resorting to online Bayesian inference to infer the posterior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsKnowledge Distillation