DayDreamer: World Models for Physical Robot Learning
Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter, Abbeel

TL;DR
This paper demonstrates that the Dreamer algorithm can be effectively applied to real robots for online learning without simulators, enabling rapid adaptation and skill acquisition in complex, real-world environments.
Contribution
The paper shows for the first time that Dreamer can be used directly on physical robots for online learning, achieving fast, reset-free skill acquisition in real-world settings.
Findings
Quadruped robot learns to stand and walk in 1 hour.
Robots adapt within 10 minutes to perturbations.
Robots learn object manipulation and navigation from images.
Abstract
To solve tasks in complex environments, robots need to learn from experience. Deep reinforcement learning is a common approach to robot learning but requires a large amount of trial and error to learn, limiting its deployment in the physical world. As a consequence, many advances in robot learning rely on simulators. On the other hand, learning inside of simulators fails to capture the complexity of the real world, is prone to simulator inaccuracies, and the resulting behaviors do not adapt to changes in the world. The Dreamer algorithm has recently shown great promise for learning from small amounts of interaction by planning within a learned world model, outperforming pure reinforcement learning in video games. Learning a world model to predict the outcomes of potential actions enables planning in imagination, reducing the amount of trial and error needed in the real environment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Machine Learning and Data Classification
