Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization
Tiantian Zhang, Zichuan Lin, Yuxing Wang, Deheng Ye, Qiang Fu, Wei, Yang, Xueqian Wang, Bin Liang, Bo Yuan, and Xiu Li

TL;DR
DaCoRL introduces a dynamics-adaptive continual reinforcement learning framework that incrementally clusters environment contexts and adaptively expands neural networks to improve stability and generalization in dynamic environments.
Contribution
It proposes a novel context inference method using Bayesian clustering and an expandable neural network architecture for continual RL in changing environments.
Findings
Outperforms existing methods in stability and overall performance.
Effectively classifies and adapts to new environment contexts.
Demonstrates superior generalization in robot navigation and MuJoCo tasks.
Abstract
A key challenge of continual reinforcement learning (CRL) in dynamic environments is to promptly adapt the RL agent's behavior as the environment changes over its lifetime, while minimizing the catastrophic forgetting of the learned information. To address this challenge, in this article, we propose DaCoRL, i.e., dynamics-adaptive continual RL. DaCoRL learns a context-conditioned policy using progressive contextualization, which incrementally clusters a stream of stationary tasks in the dynamic environment into a series of contexts and opts for an expandable multihead neural network to approximate the policy. Specifically, we define a set of tasks with similar dynamics as an environmental context and formalize context inference as a procedure of online Bayesian infinite Gaussian mixture clustering on environment features, resorting to online Bayesian inference to infer the posterior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
MethodsKnowledge Distillation
