Multi-Game Decision Transformers

Kuang-Huei Lee; Ofir Nachum; Mengjiao Yang; Lisa Lee; Daniel Freeman,; Winnie Xu; Sergio Guadarrama; Ian Fischer; Eric Jang; Henryk Michalewski,; Igor Mordatch

arXiv:2205.15241·cs.AI·October 18, 2022·27 cites

Multi-Game Decision Transformers

Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman,, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski,, Igor Mordatch

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that a single transformer model trained offline can learn to play up to 46 Atari games at near-human performance, showing scalability and adaptability similar to language and vision models.

Contribution

It introduces a multi-game decision transformer that learns multiple Atari games simultaneously with a single set of weights, advancing generalist reinforcement learning.

Findings

01

Single transformer can play multiple Atari games at near-human levels

02

Performance scales with model size and improves with fine-tuning on new games

03

Multi-game models outperform other RL and behavioral cloning approaches

Abstract

A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and training them on large, diverse datasets. Motivated by this progress, we investigate whether the same strategy can be used to produce generalist reinforcement learning agents. Specifically, we show that a single transformer-based model - with a single set of weights - trained purely offline can play a suite of up to 46 Atari games simultaneously at close-to-human performance. When trained and evaluated appropriately, we find that the same trends observed in language and vision hold, including scaling of performance with model size and rapid adaptation to new games via fine-tuning. We compare several approaches in this multi-game setting, such as online and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/google-research/tree/master/multi_game_dt
jaxOfficial

Videos

Multi-Game Decision Transformers· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Position-Wise Feed-Forward Layer · Multi-Head Attention · Byte Pair Encoding