Markovian Circuit Tracing for Transformer State Dynamic

Abdullah X

arXiv:2605.20824·cs.LG·May 21, 2026

Markovian Circuit Tracing for Transformer State Dynamic

Abdullah X

PDF

TL;DR

This paper introduces Markovian Circuit Tracing (MCT), a benchmark and framework for analyzing transformer internal states and their coarse state-transition structures using synthetic HMM tasks.

Contribution

It provides a controlled benchmark and evaluation pipeline for interpreting transformer state dynamics, demonstrating that tiny transformers learn near-Bayes predictors and recover coarse transition signals.

Findings

01

Transformers learn near-Bayes next-token predictors with minimal excess loss.

02

State abstractions recover coarse transition signals, especially in persistent and lower-state regimes.

03

Patching recovered states significantly reduces KL divergence to true HMM targets.

Abstract

Many sequence computations are easier to study as movement through internal states than as isolated local circuits. We introduce Markovian Circuit Tracing (MCT), a diagnostic pipeline for testing whether transformer activations contain coarse state-transition structure. The benchmark uses synthetic Hidden Markov Model (HMM) tasks where latent states, transition matrices, Bayesian belief vectors, Bayes-optimal predictions, and forced-state counterfactual targets are known exactly. Across six HMM families and three seeds per family, tiny causal transformers learn near-Bayes next-token predictors, with mean excess loss over Bayes of 0.0138. Residual activations contain partial Bayesian belief information in this controlled synthetic benchmark. State abstractions extracted from these activations recover coarse transition signal, strongest in persistent and lower-state regimes, and weaker in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.