Model-Based Reinforcement Learning with Isolated Imaginations

Minting Pan; Xiangming Zhu; Yitao Zheng; Yunbo Wang and; Xiaokang Yang

arXiv:2303.14889·cs.LG·November 20, 2023·1 cites

Model-Based Reinforcement Learning with Isolated Imaginations

Minting Pan, Xiangming Zhu, Yitao Zheng, Yunbo Wang and, Xiaokang Yang

PDF

Open Access 1 Repo

TL;DR

Iso-Dream++ advances model-based reinforcement learning by isolating controllable and noncontrollable dynamics, enabling better long-horizon visuomotor control in complex environments like autonomous driving.

Contribution

It introduces a novel approach to decouple controllable and noncontrollable states via inverse dynamics optimization and policy learning on decoupled latent imaginations.

Findings

01

Outperforms existing models on CARLA and DeepMind Control benchmarks.

02

Effectively isolates mixed environment dynamics for improved long-term control.

03

Addresses training collapse and sparse dependency issues in state decoupling.

Abstract

World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios like autonomous driving, noncontrollable dynamics that are independent or sparsely dependent on action signals often exist, making it challenging to learn effective world models. To address this issue, we propose Iso-Dream++, a model-based reinforcement learning approach that has two main contributions. First, we optimize the inverse dynamics to encourage the world model to isolate controllable state transitions from the mixed spatiotemporal variations of the environment. Second, we perform policy optimization based on the decoupled latent imaginations, where we roll out noncontrollable states into the future and adaptively associate them with the current controllable state. This enables long-horizon visuomotor control tasks to benefit from isolating mixed dynamics sources…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

panmt/mbrl_with_isolated_imaginations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Functional Brain Connectivity Studies

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator