Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback
Fabian Domberg, Georg Schildbach

TL;DR
This paper introduces a biologically inspired online continual reinforcement learning framework that enables robotic agents to adapt during deployment by detecting out-of-distribution events and finetuning autonomously.
Contribution
It extends DreamerV3 with world model feedback to facilitate automatic online adaptation without external supervision or domain knowledge.
Findings
Effective adaptation on continuous control tasks
Successful deployment on real-world robotic vehicle
Demonstrated autonomous self-improvement capabilities
Abstract
As learning-based robotic controllers are typically trained offline and deployed with fixed parameters, their ability to cope with unforeseen changes during operation is limited. Biologically inspired, this work presents a framework for online Continual Reinforcement Learning that enables automated adaptation during deployment. Building on DreamerV3, a model-based Reinforcement Learning algorithm, the proposed method leverages world model prediction residuals to detect out-of-distribution events and automatically trigger finetuning. Adaptation progress is monitored using both task-level performance signals and internal training metrics, allowing convergence to be assessed without external supervision and domain knowledge. The approach is validated on a variety of contemporary continuous control problems, including a quadruped robot in high-fidelity simulation, and a real-world model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety
