Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan, Xiangming Zhu, Yitao Zheng, Yunbo Wang and, Xiaokang Yang

TL;DR
Iso-Dream++ advances model-based reinforcement learning by isolating controllable and noncontrollable dynamics, enabling better long-horizon visuomotor control in complex environments like autonomous driving.
Contribution
It introduces a novel approach to decouple controllable and noncontrollable states via inverse dynamics optimization and policy learning on decoupled latent imaginations.
Findings
Outperforms existing models on CARLA and DeepMind Control benchmarks.
Effectively isolates mixed environment dynamics for improved long-term control.
Addresses training collapse and sparse dependency issues in state decoupling.
Abstract
World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios like autonomous driving, noncontrollable dynamics that are independent or sparsely dependent on action signals often exist, making it challenging to learn effective world models. To address this issue, we propose Iso-Dream++, a model-based reinforcement learning approach that has two main contributions. First, we optimize the inverse dynamics to encourage the world model to isolate controllable state transitions from the mixed spatiotemporal variations of the environment. Second, we perform policy optimization based on the decoupled latent imaginations, where we roll out noncontrollable states into the future and adaptively associate them with the current controllable state. This enables long-horizon visuomotor control tasks to benefit from isolating mixed dynamics sources…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Functional Brain Connectivity Studies
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
