Learn to Select: Exploring Label Distribution Divergence for In-Context Demonstration Selection in Text Classification
Ye Jiang, Taihang Wang, Youzheng Liu, Yimin Wang, Yuhan Xia, Yunfei Long

TL;DR
This paper introduces a novel demonstration selection method for in-context learning in text classification that considers both semantic similarity and label distribution alignment, improving performance across benchmarks.
Contribution
It proposes a two-stage selection method leveraging label distribution divergence, addressing the gap in existing methods that ignore label distribution alignment.
Findings
Our method outperforms previous strategies on seven benchmarks.
Label distribution alignment positively correlates with LLM performance.
Using a fine-tuned SLM improves demonstration selection accuracy.
Abstract
In-context learning (ICL) for text classification, which uses a few input-label demonstrations to describe a task, has demonstrated impressive performance on large language models (LLMs). However, the selection of in-context demonstrations plays a crucial role and can significantly affect LLMs' performance. Most existing demonstration selection methods primarily focus on semantic similarity between test inputs and demonstrations, often overlooking the importance of label distribution alignment. To address this limitation, we propose a two-stage demonstration selection method, TopK + Label Distribution Divergence (L2D), which leverages a fine-tuned BERT-like small language model (SLM) to generate label distributions and calculate their divergence for both test inputs and candidate demonstrations. This enables the selection of demonstrations that are not only semantically similar but also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Machine Learning and Data Classification · Sentiment Analysis and Opinion Mining
