Decomposing Prediction Mechanisms for In-Context Recall

Sultan Daniels; Dylan Davis; Dhruv Gautam; Wentinn Liao; Gireeja Ranade; Anant Sahai

arXiv:2507.01414·cs.LG·July 3, 2025

Decomposing Prediction Mechanisms for In-Context Recall

Sultan Daniels, Dylan Davis, Dhruv Gautam, Wentinn Liao, Gireeja Ranade, Anant Sahai

PDF

Open Access 3 Reviews

TL;DR

This paper investigates how transformer models perform in a toy problem combining in-context learning and associative recall, revealing two distinct mechanisms with different learning dynamics and demonstrating similar phenomena in translation tasks.

Contribution

The study introduces a toy problem to analyze in-context recall, identifying two separate mechanisms and their different emergence patterns, and extends findings to real translation tasks.

Findings

01

Two mechanisms for next-token prediction identified

02

First mechanism uses symbolic labels for recall, second performs Bayesian prediction

03

Distinct learning dynamics and phase transitions observed

Abstract

We introduce a new family of toy problems that combine features of linear-regression-style continuous in-context learning (ICL) with discrete associative recall. We pretrain transformer models on sample traces from this toy, specifically symbolically-labeled interleaved state observations from randomly drawn linear deterministic dynamical systems. We study if the transformer models can recall the state of a sequence previously seen in its context when prompted to do so with the corresponding in-context label. Taking a closer look at this task, it becomes clear that the model must perform two functions: (1) identify which system's state should be recalled and apply that system to its last seen state, and (2) continuing to apply the correct system to predict the subsequent states. Training dynamics reveal that the first capability emerges well into a model's training. Surprisingly, the…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 4

Strengths

The paper proposes a new synthetic needle in a haystack task, which is interesting and novel. The experimental design of using 1) misdirected SPL and 2) synchronized observation are well-motivated. They also extend their analysis on OLMo checkpoints on translation tasks, which ground their findings with real-world evidence.

Weaknesses

I find that many of the claims in the paper could be considerably strengthened and simplified. * I am not fully convinced that the label-based recall hypothesis (H1) is decisively ruled out. One could explain Figure 1b) simply from the fact that the model sees more observation tokens (the 1-after and 2-after query) than the open SPL token itself. It would be valuable to test whether increasing the representational weight or length of the SPL (e.g., by replacing each open-label with a multi-toke

Reviewer 02Rating 2Confidence 3

Strengths

- The paper provides a new family of well-specified toy problems to study mechanisms used in Transformers for in-context recall

Weaknesses

- The setup, as motivated in Section 1.1, appears quite specific. I was missing a motivation of why the setup is of broader relevance or interest, e.g. to language models, or the transformer architecture, etc. This is a concern especially as the paper mainly concerns empirical studies of toy models trained on a toy task. - Interpretation of Section 3: Section 3.3, line 400: "0% edge overlap between the 1-after query and 2-after query circuits": As far as I understood the description in the secti

Reviewer 03Rating 6Confidence 4

Strengths

- This new tasks used to test ICL is appealing. It provides a nice link between standard ICL problems while keeping almost everything continuous, hence interpretable. The dynamics are quite intuitive yet retain significant depth to make an interesting analysis. - The experiments investigate a variety of different interventions to test the hypotheses H1 and H2 and show, clearly, that models will learn to perform the correct task on the first token after the new-task identifier, and then relying o

Weaknesses

- Much of the work (specifically the figures) is focused on the training dynamics of these model. While interesting, and should certainly be highlighted, the claims of the resulting model having these two distinct mechanisms to understand these linear-dynamics inputs typically are most important at the very end of training. This wasn't particularly central in the played results and rather had to be pulled out from the training dynamics - The paper focuses on one specific task and found a propert

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Child and Animal Learning Development