Language Models Struggle to Use Representations Learned In-Context

Michael A. Lepori; Tal Linzen; Ann Yuan; Katja Filippova

arXiv:2602.04212·cs.CL·May 4, 2026

Language Models Struggle to Use Representations Learned In-Context

Michael A. Lepori, Tal Linzen, Ann Yuan, Katja Filippova

PDF

TL;DR

This paper investigates whether large language models can effectively use in-context learned representations for downstream tasks, revealing significant struggles even in state-of-the-art models, and aims to inspire new methods for better deployment of in-context information.

Contribution

It demonstrates that current open- and closed-source LLMs struggle to utilize in-context representations for task completion, highlighting a gap in flexible deployment of learned in-context information.

Findings

01

Open-weights LLMs struggle with in-context representations for next-token prediction.

02

State-of-the-art reasoning models cannot reliably leverage novel in-context patterns.

03

Models encode in-context information but fail to deploy it effectively for downstream tasks.

Abstract

Though large language models (LLMs) have enabled great success across a wide variety of tasks, they still appear to fall short of one of the loftier goals of artificial intelligence research: creating an artificial system that can adapt its behavior to radically new contexts upon deployment. One important step towards this goal is to create systems that can induce rich representations of data that are seen in-context, and then flexibly deploy these representations to accomplish goals. Recently, Park et al. (2024) demonstrated that current LLMs are indeed capable of inducing such representation from context (i.e., in-context representation learning). The present study investigates whether LLMs can use these representations to complete simple downstream tasks. We first assess whether open-weights LLMs can use in-context representations for next-token prediction, and then probe models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.