ARROW: Augmented Replay for RObust World models

Abdulaziz Alyahya; Abdallah Al Siyabi; Markus R. Ernst; Luke Yang; Levin Kuhlmann; Gideon Kowadlo

arXiv:2603.11395·cs.LG·May 19, 2026

ARROW: Augmented Replay for RObust World models

Abdulaziz Alyahya, Abdallah Al Siyabi, Markus R. Ernst, Luke Yang, Levin Kuhlmann, Gideon Kowadlo

PDF

TL;DR

ARROW introduces a memory-efficient, bio-inspired replay buffer for continual reinforcement learning, significantly reducing forgetting and enhancing knowledge retention across diverse tasks.

Contribution

It extends DreamerV3 with a dual-buffer system for better experience retention, inspired by neuroscience, improving continual RL performance.

Findings

01

ARROW shows less forgetting on Atari tasks without shared structure.

02

ARROW maintains comparable forward transfer to baselines.

03

The approach is effective in both shared and non-shared task settings.

Abstract

Continual reinforcement learning challenges agents to acquire new skills while retaining previously learned ones with the goal of improving performance in both past and future tasks. Most existing approaches rely on model-free methods with replay buffers to mitigate catastrophic forgetting; however, these solutions often face significant scalability challenges due to large memory demands. Drawing inspiration from neuroscience, where the brain replays experiences to a predictive World Model rather than directly to the policy, we present ARROW (Augmented Replay for RObust World models), a model-based continual RL algorithm that extends DreamerV3 with a memory-efficient, distribution-matching replay buffer. Unlike standard fixed-size FIFO buffers, ARROW maintains two complementary buffers: a short-term buffer for recent experiences and a long-term buffer that preserves task diversity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications