Task Schema and Binding: A Double Dissociation Study of In-Context Learning
Chaeha Kim

TL;DR
This paper demonstrates that in-context learning in models decomposes into two distinct mechanisms, Task Schema and Binding, which are causally validated through extensive experiments across multiple architectures, revealing their separability and implications for prompt engineering.
Contribution
The study provides causal mechanistic validation of the dual mechanisms underlying in-context learning, establishing their separability and generality across architectures, and offers insights for improving prompt engineering.
Findings
Task Schema transfers at 100% via late MLP patching.
Binding transfers at 62% via residual stream patching.
Models rely on Task Schema when prior knowledge is absent.
Abstract
We provide causal mechanistic validation that in-context learning (ICL) decomposes into two separable mechanisms: Task Schema (abstract task type recognition) and Binding (specific input-output associations). Through activation patching experiments across 9 models from 7 Transformer families plus Mamba (370M-13B parameters), we establish three key findings: 1. Double dissociation: Task Schema transfers at 100% via late MLP patching; Binding transfers at 62% via residual stream patching -- proving separable mechanisms 2. Prior-Schema trade-off: Schema reliance inversely correlates with prior knowledge (Spearman rho = -0.596, p < 0.001, N=28 task-model pairs) 3. Architecture generality: The mechanism operates across all tested architectures including the non-Transformer Mamba These findings offer a mechanistic account of the ICL puzzle that contrasts with prior views treating ICL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · EEG and Brain-Computer Interfaces
