Learning Task Representations from In-Context Learning

Baturay Saglam; Xinyang Hu; Zhuoran Yang; Dionysis Kalogerias; Amin Karbasi

arXiv:2502.05390·cs.CL·November 11, 2025

Learning Task Representations from In-Context Learning

Baturay Saglam, Xinyang Hu, Zhuoran Yang, Dionysis Kalogerias, Amin Karbasi

PDF

Open Access

TL;DR

This paper introduces a method to encode task information in in-context learning prompts by leveraging attention heads in transformers, improving task generalization across text and regression modalities.

Contribution

It presents an automated approach to derive task vectors from attention heads, enhancing understanding and generalization of task representations in large language models.

Findings

01

Effective extraction of task-specific information from demonstrations

02

Improved generalization across text and regression tasks

03

A new benchmark for evaluating task fidelity in ICL

Abstract

Large language models (LLMs) have demonstrated remarkable proficiency in in-context learning (ICL), where models adapt to new tasks through example-based prompts without requiring parameter updates. However, understanding how tasks are internally encoded and generalized remains a challenge. To address some of the empirical and technical gaps in the literature, we introduce an automated formulation for encoding task information in ICL prompts as a function of attention heads within the transformer architecture. This approach computes a single task vector as a weighted sum of attention heads, with the weights optimized causally via gradient descent. Our findings show that existing methods fail to generalize effectively to modalities beyond text. In response, we also design a benchmark to evaluate whether a task vector can preserve task fidelity in functional regression tasks. The proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics