In-Context Learning Operates as Concept Subspace Learning
Wei Tang, Xinyan Jiang, Fakhri Karray, Lijie Hu

TL;DR
This paper proposes a concept subspace framework for understanding in-context learning, showing that task information is concentrated in low-dimensional, task-aligned activation subspaces, enabling effective prediction and concept manipulation.
Contribution
It introduces a mechanistic, subspace-based view of in-context learning, demonstrating how structured demonstrations induce low-dimensional, task-specific concept subspaces in neural activations.
Findings
Prediction decomposes into concept-coordinate regression and off-subspace leakage.
A low-dimensional subspace (68-73 dimensions) captures most task information in Llama-3-8B.
Concept swaps redirect predictions, confirming the role of concept subspaces.
Abstract
Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains unclear whether structured demonstrations induce low-dimensional concept inference. We study this question through a concept-subspace view of ICL, in which tasks vary only along intrinsic concept coordinates, although inputs are observed in a high-dimensional ambient space. For ridge and least-squares ICL proxies, prediction decomposes exactly into concept-coordinate regression and off-subspace leakage. Under block-diagonal or near-block-diagonal covariance assumptions, the leading estimation and nuisance-sensitivity terms scale with the dimension of the concept subspace, while residual effects are controlled by cross-subspace coupling. This separation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
