Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study

Pengfei He; Yingqian Cui; Han Xu; Hui Liu; Makoto Yamada; Jiliang Tang; Yue Xing

arXiv:2410.09411·cs.LG·June 3, 2025

Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study

Pengfei He, Yingqian Cui, Han Xu, Hui Liu, Makoto Yamada, Jiliang Tang, Yue Xing

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of in-context learning in large language models, quantifying how pre-training knowledge, example label distribution, and noise influence classification accuracy.

Contribution

It introduces a probabilistic model extending Gaussian mixtures to analyze the impact of various factors on ICL performance in binary classification.

Findings

01

Contradictions between pre-training knowledge and examples affect reliance on each.

02

Label frequency and noise levels influence prediction accuracy.

03

Simulations and real-data experiments support theoretical insights.

Abstract

In-context learning (ICL) has emerged as a powerful capability for large language models (LLMs) to adapt to downstream tasks by leveraging a few (demonstration) examples. Despite its effectiveness, the mechanism behind ICL remains underexplored. To better understand how ICL integrates the examples with the knowledge learned by the LLM during pre-training (i.e., pre-training knowledge) and how the examples impact ICL, this paper conducts a theoretical study in binary classification tasks. In particular, we introduce a probabilistic model extending from the Gaussian mixture model to exactly quantify the impact of pre-training knowledge, label frequency, and label noise on the prediction accuracy. Based on our analysis, when the pre-training knowledge contradicts the knowledge in the examples, whether ICL prediction relies more on the pre-training knowledge or the examples depends on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Speech and dialogue systems

MethodsALIGN