Diffusion for World Modeling: Visual Details Matter in Atari

Eloi Alonso; Adam Jelley; Vincent Micheli; Anssi Kanervisto; Amos; Storkey; Tim Pearce; Fran\c{c}ois Fleuret

arXiv:2405.12399·cs.LG·October 31, 2024·2 cites

Diffusion for World Modeling: Visual Details Matter in Atari

Eloi Alonso, Adam Jelley, Vincent Micheli, Anssi Kanervisto, Amos, Storkey, Tim Pearce, Fran\c{c}ois Fleuret

PDF

Open Access 1 Repo 1 Models 1 Datasets 1 Video

TL;DR

This paper introduces DIAMOND, a diffusion-based world model for reinforcement learning that captures visual details more effectively, leading to improved performance on Atari benchmarks and potential as an interactive neural game engine.

Contribution

It proposes a novel diffusion-based world model for reinforcement learning, demonstrating enhanced visual detail preservation and superior performance on Atari benchmarks.

Findings

01

DIAMOND achieves a mean human normalized score of 1.46 on Atari 100k.

02

Diffusion models improve visual detail retention in world models.

03

DIAMOND can serve as an interactive neural game engine.

Abstract

World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffusion models have become a dominant approach for image generation, challenging well-established methods modeling discrete latents. Motivated by this paradigm shift, we introduce DIAMOND (DIffusion As a Model Of eNvironment Dreams), a reinforcement learning agent trained in a diffusion world model. We analyze the key design choices that are required to make diffusion suitable for world modeling, and demonstrate how improved visual details can lead to improved agent performance. DIAMOND achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eloialonso/diamond
pytorchOfficial

Models

🤗
eloialonso/diamond
model· ♡ 28
♡ 28

Datasets

TeaPearce/CounterStrike_Deathmatch
dataset· 667 dl
667 dl

Videos

Diffusion for World Modeling: Visual Details Matter in Atari· slideslive

Taxonomy

TopicsData Visualization and Analytics · Artificial Intelligence in Games

MethodsDiffusion