Task Schema and Binding: A Double Dissociation Study of In-Context Learning

Chaeha Kim

arXiv:2512.17325·cs.LG·December 22, 2025

Task Schema and Binding: A Double Dissociation Study of In-Context Learning

Chaeha Kim

PDF

Open Access

TL;DR

This paper demonstrates that in-context learning in models decomposes into two distinct mechanisms, Task Schema and Binding, which are causally validated through extensive experiments across multiple architectures, revealing their separability and implications for prompt engineering.

Contribution

The study provides causal mechanistic validation of the dual mechanisms underlying in-context learning, establishing their separability and generality across architectures, and offers insights for improving prompt engineering.

Findings

01

Task Schema transfers at 100% via late MLP patching.

02

Binding transfers at 62% via residual stream patching.

03

Models rely on Task Schema when prior knowledge is absent.

Abstract

We provide causal mechanistic validation that in-context learning (ICL) decomposes into two separable mechanisms: Task Schema (abstract task type recognition) and Binding (specific input-output associations). Through activation patching experiments across 9 models from 7 Transformer families plus Mamba (370M-13B parameters), we establish three key findings: 1. Double dissociation: Task Schema transfers at 100% via late MLP patching; Binding transfers at 62% via residual stream patching -- proving separable mechanisms 2. Prior-Schema trade-off: Schema reliance inversely correlates with prior knowledge (Spearman rho = -0.596, p < 0.001, N=28 task-model pairs) 3. Architecture generality: The mechanism operates across all tested architectures including the non-Transformer Mamba These findings offer a mechanistic account of the ICL puzzle that contrasts with prior views treating ICL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · EEG and Brain-Computer Interfaces