Emergent World Models and Latent Variable Estimation in Chess-Playing   Language Models

Adam Karvonen

arXiv:2403.15498·cs.LG·July 16, 2024·3 cites

Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models

Adam Karvonen

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether language models trained on chess can develop internal world models and latent variable estimations, demonstrating that they learn to represent game states and player skills, which enhances their predictive performance.

Contribution

The study extends prior work from synthetic to real chess data, showing that language models learn internal representations of game states and latent variables like player skill, and demonstrates how adding skill vectors improves performance.

Findings

01

Models learn internal representations of board states.

02

Models estimate latent variables such as player skill.

03

Adding skill vectors improves prediction accuracy by up to 2.6 times.

Abstract

Language models have shown unprecedented capabilities, sparking debate over the source of their performance. Is it merely the outcome of learning syntactic patterns and surface level statistics, or do they extract semantics and a world model from the text? Prior work by Li et al. investigated this by training a GPT model on synthetic, randomly generated Othello games and found that the model learned an internal representation of the board state. We extend this work into the more complex domain of chess, training on real games and investigating our model's internal representations using linear probes and contrastive activations. The model is given no a priori knowledge of the game and is solely trained on next character prediction, yet we find evidence of internal representations of board state. We validate these internal representations by using them to make interventions on the model's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adamkarvonen/chess_llm_interpretability
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Sports Analytics and Performance

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Cosine Annealing · Multi-Head Attention · Linear Warmup With Cosine Annealing · Softmax · Discriminative Fine-Tuning