ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao, Yifu Yuan, Cong Wang, Zhen Wang

TL;DR
ED2 introduces a novel environment decomposition framework for model-based reinforcement learning, significantly reducing model error and improving sample efficiency and performance in continuous control tasks.
Contribution
The paper proposes ED2, a new framework that decomposes environment dynamics into sub-dynamics, enabling more accurate world models and better integration with existing MBRL algorithms.
Findings
ED2 reduces model prediction error.
ED2 improves sample efficiency in continuous control tasks.
ED2 achieves higher asymptotic performance.
Abstract
Model-based reinforcement learning (MBRL) achieves significant sample efficiency in practice in comparison to model-free RL, but its performance is often limited by the existence of model prediction error. To reduce the model error, standard MBRL approaches train a single well-designed network to fit the entire environment dynamics, but this wastes rich information on multiple sub-dynamics which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose the Environment Dynamics Decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment automatically and then D2P constructs the decomposed world model following the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Muscle activation and electromyography studies · Fuel Cells and Related Materials
