Multi-Game Decision Transformers
Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman,, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski,, Igor Mordatch

TL;DR
This paper demonstrates that a single transformer model trained offline can learn to play up to 46 Atari games at near-human performance, showing scalability and adaptability similar to language and vision models.
Contribution
It introduces a multi-game decision transformer that learns multiple Atari games simultaneously with a single set of weights, advancing generalist reinforcement learning.
Findings
Single transformer can play multiple Atari games at near-human levels
Performance scales with model size and improves with fine-tuning on new games
Multi-game models outperform other RL and behavioral cloning approaches
Abstract
A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and training them on large, diverse datasets. Motivated by this progress, we investigate whether the same strategy can be used to produce generalist reinforcement learning agents. Specifically, we show that a single transformer-based model - with a single set of weights - trained purely offline can play a suite of up to 46 Atari games simultaneously at close-to-human performance. When trained and evaluated appropriately, we find that the same trends observed in language and vision hold, including scaling of performance with model size and rapid adaptation to new games via fine-tuning. We compare several approaches in this multi-game setting, such as online and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Position-Wise Feed-Forward Layer · Multi-Head Attention · Byte Pair Encoding
