In-Context Learning Operates as Concept Subspace Learning

Wei Tang; Xinyan Jiang; Fakhri Karray; Lijie Hu

arXiv:2605.18830·cs.LG·May 20, 2026

In-Context Learning Operates as Concept Subspace Learning

Wei Tang, Xinyan Jiang, Fakhri Karray, Lijie Hu

PDF

TL;DR

This paper proposes a concept subspace framework for understanding in-context learning, showing that task information is concentrated in low-dimensional, task-aligned activation subspaces, enabling effective prediction and concept manipulation.

Contribution

It introduces a mechanistic, subspace-based view of in-context learning, demonstrating how structured demonstrations induce low-dimensional, task-specific concept subspaces in neural activations.

Findings

01

Prediction decomposes into concept-coordinate regression and off-subspace leakage.

02

A low-dimensional subspace (68-73 dimensions) captures most task information in Llama-3-8B.

03

Concept swaps redirect predictions, confirming the role of concept subspaces.

Abstract

Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains unclear whether structured demonstrations induce low-dimensional concept inference. We study this question through a concept-subspace view of ICL, in which tasks vary only along intrinsic concept coordinates, although inputs are observed in a high-dimensional ambient space. For ridge and least-squares ICL proxies, prediction decomposes exactly into concept-coordinate regression and off-subspace leakage. Under block-diagonal or near-block-diagonal covariance assumptions, the leading estimation and nuisance-sensitivity terms scale with the dimension of the concept subspace, while residual effects are controlled by cross-subspace coupling. This separation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.