AlphaViT: A flexible game-playing AI for multiple games and variable board sizes
Kazuhisa Fujita

TL;DR
This paper introduces AlphaViT, a transformer-based game-playing AI capable of handling multiple games and varying board sizes with a single shared neural network, outperforming traditional algorithms and approaching AlphaZero's performance.
Contribution
It presents novel transformer-based architectures (AlphaViT, AlphaViD, AlphaVDA) that enable multi-game and variable board size play within a unified neural network framework.
Findings
AlphaViT performs strongly across multiple games.
Pre-training on small games accelerates convergence.
Multi-game training can surpass single-game performance.
Abstract
We present three game-playing agents incorporating Vision Transformers (ViT) into the AlphaZero framework: AlphaViT, AlphaViD (AlphaViT with a transformer decoder), and AlphaVDA (AlphaViD with learnable action embeddings). These agents can play multiple board games of varying sizes using a single neural network with shared weights, thus overcoming AlphaZero's limitation of fixed board sizes. AlphaViT employs only a transformer encoder, whereas AlphaViD and AlphaVDA incorporate both a transformer encoder and a decoder. In AlphaViD, the decoder processes outputs from the encoder, whereas AlphaVDA uses learnable embeddings as the decoder inputs. The additional decoder in AlphaViD and AlphaVDA provides flexibility to adapt to various action spaces and board sizes. Experimental results show that the proposed agents, trained on either individual games or on multiple games simultaneously,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Games and Gamification
MethodsAttention Is All You Need · Residual Connection · Softmax · Adam · Label Smoothing · Dropout · Dense Connections · Linear Layer · Layer Normalization · Vision Transformer
