Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
Anantha Padmanaban Krishna Kumar (Boston University)

TL;DR
This paper investigates whether in-context learning (ICL) in large language models can override pre-trained label semantics, finding that small models primarily adjust input projections onto stable semantic directions and cannot flip label meanings.
Contribution
The study introduces a semantic override rate metric and demonstrates that small LLMs cannot fundamentally change label semantics through ICL, highlighting fundamental limits of few-shot prompting.
Findings
Models improve accuracy with natural demonstrations while maintaining prior alignment.
Models cannot learn anti-semantic classifiers; prompt alignment increases at the expense of accuracy.
Semantic override rates remain zero in small to medium models, indicating stable semantic anchors.
Abstract
Can in-context learning (ICL) override pre-trained label semantics, or does it merely refine an existing semantic backbone? We address this question by treating LLMs as prompt-induced classifiers and contrasting their behavior under \emph{natural} demonstrations (with correct labels) and \emph{inverted} demonstrations (systematically flipping label meanings). We decompose ICL behavior into three alignment metrics (truth, prior, and prompt alignment) and introduce a semantic override rate, defined as correctness under flipped semantics. Across eight classification tasks and eight open-source LLMs (1--12B parameters), we find consistent evidence for a semantic anchor view. With natural demonstrations, ICL improves accuracy while maintaining strong prior alignment; most correct predictions coincide with zero-shot behavior, even when the prior is weak. With inverted demonstrations, models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education · Language and cultural evolution
