Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels

Anantha Padmanaban Krishna Kumar (Boston University)

arXiv:2511.21038·cs.CL·November 27, 2025

Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels

Anantha Padmanaban Krishna Kumar (Boston University)

PDF

Open Access

TL;DR

This paper investigates whether in-context learning (ICL) in large language models can override pre-trained label semantics, finding that small models primarily adjust input projections onto stable semantic directions and cannot flip label meanings.

Contribution

The study introduces a semantic override rate metric and demonstrates that small LLMs cannot fundamentally change label semantics through ICL, highlighting fundamental limits of few-shot prompting.

Findings

01

Models improve accuracy with natural demonstrations while maintaining prior alignment.

02

Models cannot learn anti-semantic classifiers; prompt alignment increases at the expense of accuracy.

03

Semantic override rates remain zero in small to medium models, indicating stable semantic anchors.

Abstract

Can in-context learning (ICL) override pre-trained label semantics, or does it merely refine an existing semantic backbone? We address this question by treating LLMs as prompt-induced classifiers and contrasting their behavior under \emph{natural} demonstrations (with correct labels) and \emph{inverted} demonstrations (systematically flipping label meanings). We decompose ICL behavior into three alignment metrics (truth, prior, and prompt alignment) and introduce a semantic override rate, defined as correctness under flipped semantics. Across eight classification tasks and eight open-source LLMs (1--12B parameters), we find consistent evidence for a semantic anchor view. With natural demonstrations, ICL improves accuracy while maintaining strong prior alignment; most correct predictions coincide with zero-shot behavior, even when the prior is weak. With inverted demonstrations, models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education · Language and cultural evolution