Efficient World Models with Context-Aware Tokenization

Vincent Micheli; Eloi Alonso; Fran\c{c}ois Fleuret

arXiv:2406.19320·cs.LG·June 28, 2024

Efficient World Models with Context-Aware Tokenization

Vincent Micheli, Eloi Alonso, Fran\c{c}ois Fleuret

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces $\Delta$-IRIS, a novel world model architecture that uses context-aware tokenization to improve efficiency and performance in deep RL, achieving state-of-the-art results with faster training.

Contribution

The paper presents a new agent with a world model combining a discrete autoencoder for delta encoding and an autoregressive transformer, significantly reducing training time and improving performance.

Findings

01

Sets new state-of-the-art in Crafter benchmark.

02

Faster training compared to previous attention-based models.

03

Effective environment simulation with fewer tokens.

Abstract

Scaling up deep Reinforcement Learning (RL) methods presents a significant challenge. Following developments in generative modelling, model-based RL positions itself as a strong contender. Recent advances in sequence modelling have led to effective transformer-based world models, albeit at the price of heavy computations due to the long sequences of tokens required to accurately simulate environments. In this work, we propose $Δ$ -IRIS, a new agent with a world model architecture composed of a discrete autoencoder that encodes stochastic deltas between time steps and an autoregressive transformer that predicts future deltas by summarizing the current state of the world with continuous tokens. In the Crafter benchmark, $Δ$ -IRIS sets a new state of the art at multiple frame budgets, while being an order of magnitude faster to train than previous attention-based approaches. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vmicheli/delta-iris
pytorchOfficial

Models

🤗
vmicheli/delta-iris
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Advanced Database Systems and Queries · Graph Theory and Algorithms