Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback

Fabian Domberg; Georg Schildbach

arXiv:2603.04029·cs.RO·March 5, 2026

Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback

Fabian Domberg, Georg Schildbach

PDF

Open Access

TL;DR

This paper introduces a biologically inspired online continual reinforcement learning framework that enables robotic agents to adapt during deployment by detecting out-of-distribution events and finetuning autonomously.

Contribution

It extends DreamerV3 with world model feedback to facilitate automatic online adaptation without external supervision or domain knowledge.

Findings

01

Effective adaptation on continuous control tasks

02

Successful deployment on real-world robotic vehicle

03

Demonstrated autonomous self-improvement capabilities

Abstract

As learning-based robotic controllers are typically trained offline and deployed with fixed parameters, their ability to cope with unforeseen changes during operation is limited. Biologically inspired, this work presents a framework for online Continual Reinforcement Learning that enables automated adaptation during deployment. Building on DreamerV3, a model-based Reinforcement Learning algorithm, the proposed method leverages world model prediction residuals to detect out-of-distribution events and automatically trigger finetuning. Adaptation progress is monitored using both task-level performance signals and internal training metrics, allowing convergence to be assessed without external supervision and domain knowledge. The approach is validated on a variety of contemporary continuous control problems, including a quadruped robot in high-fidelity simulation, and a real-world model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety