Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models
Adam Karvonen

TL;DR
This paper investigates whether language models trained on chess can develop internal world models and latent variable estimations, demonstrating that they learn to represent game states and player skills, which enhances their predictive performance.
Contribution
The study extends prior work from synthetic to real chess data, showing that language models learn internal representations of game states and latent variables like player skill, and demonstrates how adding skill vectors improves performance.
Findings
Models learn internal representations of board states.
Models estimate latent variables such as player skill.
Adding skill vectors improves prediction accuracy by up to 2.6 times.
Abstract
Language models have shown unprecedented capabilities, sparking debate over the source of their performance. Is it merely the outcome of learning syntactic patterns and surface level statistics, or do they extract semantics and a world model from the text? Prior work by Li et al. investigated this by training a GPT model on synthetic, randomly generated Othello games and found that the model learned an internal representation of the board state. We extend this work into the more complex domain of chess, training on real games and investigating our model's internal representations using linear probes and contrastive activations. The model is given no a priori knowledge of the game and is solely trained on next character prediction, yet we find evidence of internal representations of board state. We validate these internal representations by using them to make interventions on the model's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Sports Analytics and Performance
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Cosine Annealing · Multi-Head Attention · Linear Warmup With Cosine Annealing · Softmax · Discriminative Fine-Tuning
