Out-of-distribution Tests Reveal Compositionality in Chess Transformers

Anna M\'esz\'aros; Patrik Reizinger; Ferenc Husz\'ar

arXiv:2510.20783·cs.LG·October 24, 2025

Out-of-distribution Tests Reveal Compositionality in Chess Transformers

Anna M\'esz\'aros, Patrik Reizinger, Ferenc Husz\'ar

PDF

Open Access

TL;DR

This study demonstrates that large chess Transformers can generalize rules and strategies to out-of-distribution scenarios, showing compositional understanding and rule adherence, with some limitations compared to symbolic AI methods.

Contribution

The paper provides evidence that chess Transformers exhibit compositional generalization and rule adherence in out-of-distribution scenarios, revealing emergent understanding of chess.

Findings

01

Transformers adhere to fundamental chess rules in OOD scenarios.

02

Models generate high-quality moves even in novel situations.

03

Performance gap exists between Transformers and symbolic AI in complex variants.

Abstract

Chess is a canonical example of a task that requires rigorous reasoning and long-term planning. Modern decision Transformers - trained similarly to LLMs - are able to learn competent gameplay, but it is unclear to what extent they truly capture the rules of chess. To investigate this, we train a 270M parameter chess Transformer and test it on out-of-distribution scenarios, designed to reveal failures of systematic generalization. Our analysis shows that Transformers exhibit compositional generalization, as evidenced by strong rule extrapolation: they adhere to fundamental syntactic rules of the game by consistently choosing valid moves even in situations very different from the training data. Moreover, they also generate high-quality moves for OOD puzzles. In a more challenging test, we evaluate the models on variants including Chess960 (Fischer Random Chess) - a variant of chess where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Robot Manipulation and Learning · Reinforcement Learning in Robotics