Mastering Chess with a Transformer Model
Daniel Monroe, Philip A. Chalmers

TL;DR
This paper introduces Chessformer, a transformer-based chess model that achieves grandmaster-level performance with significantly less computation by emphasizing expressive position representations and domain-specific enhancements.
Contribution
The paper presents Chessformer, a novel transformer architecture with improved position representation that outperforms traditional models and reduces computational requirements in chess AI.
Findings
Chessformer matches grandmaster-level performance with 30x less computation.
It detects high-level positional features unlike traditional engines.
The model outperforms AlphaZero in strength and puzzle solving ability.
Abstract
Transformer models have demonstrated impressive capabilities when trained at scale, excelling at difficult cognitive tasks requiring complex reasoning and rational decision-making. In this paper, we explore the application of transformers to chess, focusing on the critical role of the position representation within the attention mechanism. We show that transformers endowed with a sufficiently expressive position representation can match existing chess-playing models at a fraction of the computational cost. Our architecture, which we call the Chessformer, significantly outperforms AlphaZero in both playing strength and puzzle solving ability with 8x less computation and matches prior grandmaster-level transformer-based agents in those metrics with 30x less computation. Our models also display an understanding of chess dissimilar and orthogonal to that of top traditional engines,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Time Series Analysis and Forecasting
MethodsSoftmax · Attention Is All You Need · AlphaZero
