ED2: Environment Dynamics Decomposition World Models for Continuous   Control

Jianye Hao; Yifu Yuan; Cong Wang; Zhen Wang

arXiv:2112.02817·cs.LG·February 16, 2024·1 cites

ED2: Environment Dynamics Decomposition World Models for Continuous Control

Jianye Hao, Yifu Yuan, Cong Wang, Zhen Wang

PDF

Open Access 1 Repo

TL;DR

ED2 introduces a novel environment decomposition framework for model-based reinforcement learning, significantly reducing model error and improving sample efficiency and performance in continuous control tasks.

Contribution

The paper proposes ED2, a new framework that decomposes environment dynamics into sub-dynamics, enabling more accurate world models and better integration with existing MBRL algorithms.

Findings

01

ED2 reduces model prediction error.

02

ED2 improves sample efficiency in continuous control tasks.

03

ED2 achieves higher asymptotic performance.

Abstract

Model-based reinforcement learning (MBRL) achieves significant sample efficiency in practice in comparison to model-free RL, but its performance is often limited by the existence of model prediction error. To reduce the model error, standard MBRL approaches train a single well-designed network to fit the entire environment dynamics, but this wastes rich information on multiple sub-dynamics which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose the Environment Dynamics Decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment automatically and then D2P constructs the decomposed world model following the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ed2-source-code/ed2
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Muscle activation and electromyography studies · Fuel Cells and Related Materials