Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
Philip J. Ball, Cong Lu, Jack Parker-Holder, Stephen Roberts

TL;DR
This paper introduces Augmented World Models (AugWM), which enhance offline reinforcement learning policies to adapt to changing dynamics in new environments, significantly improving zero-shot generalization capabilities.
Contribution
The paper proposes augmenting learned dynamics models with simple transformations and self-supervised context learning to handle environment changes, improving zero-shot transfer in RL.
Findings
Significant improvement in zero-shot generalization across 100+ dynamic settings.
AugWM often outperforms baseline methods where they fail.
Simple transformations effectively capture environmental changes.
Abstract
Reinforcement learning from large-scale offline datasets provides us with the ability to learn policies without potentially unsafe or impractical exploration. Significant progress has been made in the past few years in dealing with the challenge of correcting for differing behavior between the data collection and learned policies. However, little attention has been paid to potentially changing dynamics when transferring a policy to the online setting, where performance can be up to 90% reduced for existing methods. In this paper we address this problem with Augmented World Models (AugWM). We augment a learned dynamics model with simple transformations that seek to capture potential changes in physical properties of the robot, leading to more robust policies. We not only train our policy in this new setting, but also provide it with the sampled augmentation as a context, allowing it to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Adversarial Robustness in Machine Learning
