A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment

Raanan Y. Rohekar; Yaniv Gurwicz; Sungduk Yu; Estelle Aflalo; Vasudev Lal

arXiv:2412.07446·cs.AI·July 8, 2025

A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment

Raanan Y. Rohekar, Yaniv Gurwicz, Sungduk Yu, Estelle Aflalo, Vasudev Lal

PDF

Open Access 1 Repo

TL;DR

This paper reveals that GPT models implicitly learn a causal world model through their attention mechanism, enabling zero-shot causal inference in controlled game environments, with high confidence in legal move predictions.

Contribution

It provides a causal interpretation of GPT's attention, introduces a method for zero-shot causal structure learning, and empirically tests this in strategic game settings.

Findings

01

GPT captures causal structures in attention for legal move prediction

02

High confidence correlates with correct causal inference

03

Fails to capture causal structure when illegal moves are generated

Abstract

Are generative pre-trained transformer (GPT) models, trained only to predict the next token, implicitly learning a world model from which sequences are generated one token at a time? We address this question by deriving a causal interpretation of the attention mechanism in GPT and presenting a causal world model that arises from this interpretation. Furthermore, we propose that GPT models, at inference time, can be utilized for zero-shot causal structure learning for input sequences, and introduce a corresponding confidence score. Empirical tests were conducted in controlled environments using the setups of the Othello and Chess strategy games. A GPT, pre-trained on real-world games played with the intention of winning, was tested on out-of-distribution synthetic data consisting of sequences of random legal moves. We find that the GPT model is likely to generate legal next moves for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IntelLabs/causality-lab
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Bayesian Modeling and Causal Inference · Machine Learning in Healthcare

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dropout · Attention Dropout · Softmax · Cosine Annealing · Byte Pair Encoding · Linear Layer · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning