A Survey to Recent Progress Towards Understanding In-Context Learning

Haitao Mao; Guangliang Liu; Yao Ma; Rongrong Wang; Kristen Johnson,; Jiliang Tang

arXiv:2402.02212·cs.CL·January 28, 2025·1 cites

A Survey to Recent Progress Towards Understanding In-Context Learning

Haitao Mao, Guangliang Liu, Yao Ma, Rongrong Wang, Kristen Johnson,, Jiliang Tang

PDF

Open Access

TL;DR

This survey reviews recent progress in understanding In-Context Learning (ICL) in large language models, proposing a data generation perspective to clarify its mechanisms and identify future research directions.

Contribution

It introduces a systematic data generation perspective to reinterpret ICL, defining skill recognition and skill learning, and analyzes their strengths, weaknesses, and commonalities.

Findings

01

Provides a conceptual framework for ICL mechanisms.

02

Highlights the potential of data generation perspective.

03

Suggests future research directions in ICL.

Abstract

In-Context Learning (ICL) empowers Large Language Models (LLMs) with the ability to learn from a few examples provided in the prompt, enabling downstream generalization without the requirement for gradient updates. Despite encouragingly empirical success, the underlying mechanism of ICL remains unclear. Existing research remains ambiguous with various viewpoints, utilizing intuition-driven and ad-hoc technical solutions to interpret ICL. In this paper, we leverage a data generation perspective to reinterpret recent efforts from a systematic angle, demonstrating the potential broader usage of these popular technical solutions. For a conceptual definition, we rigorously adopt the terms of skill recognition and skill learning. Skill recognition selects one learned data generation function previously seen during pre-training while skill learning can learn new data generation functions from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics