Mastering Diverse Domains through World Models

Danijar Hafner; Jurgis Pasukonis; Jimmy Ba; Timothy Lillicrap

arXiv:2301.04104·cs.AI·April 18, 2024·92 cites

Mastering Diverse Domains through World Models

Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap

PDF

Open Access 5 Repos

TL;DR

DreamerV3 is a versatile reinforcement learning algorithm that learns environment models and imagines future scenarios, enabling it to outperform specialized methods across diverse tasks with a single configuration.

Contribution

It introduces DreamerV3, a general algorithm capable of solving over 150 tasks without task-specific tuning, including complex open-world challenges like Minecraft.

Findings

01

Outperforms specialized methods across 150+ tasks

02

Successfully collects diamonds in Minecraft from scratch

03

Achieves stable learning across diverse domains

Abstract

Developing a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement learning algorithms can be readily applied to tasks similar to what they have been developed for, configuring them for new application domains requires significant human expertise and experimentation. We present DreamerV3, a general algorithm that outperforms specialized methods across over 150 diverse tasks, with a single configuration. Dreamer learns a model of the environment and improves its behavior by imagining future scenarios. Robustness techniques based on normalization, balancing, and transformations enable stable learning across domains. Applied out of the box, Dreamer is the first algorithm to collect diamonds in Minecraft from scratch without human data or curricula. This achievement has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)