Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models

Yue Li; Zhixue Zhao; Carolina Scarton

arXiv:2410.19195·cs.CL·August 27, 2025

Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models

Yue Li, Zhixue Zhao, Carolina Scarton

PDF

Open Access 1 Video

TL;DR

This paper introduces LOADS, a method that optimizes label set selection for zero-shot classification with large language models by analyzing neuron activation kurtosis, leading to significant performance improvements.

Contribution

The study systematically examines how label design impacts zero-shot ICL and proposes a kurtosis-based method for label selection that enhances performance without additional training.

Findings

01

Label choice significantly affects model performance and sensitivity.

02

Optimal labels activate fewer outlier neurons in LLMs.

03

LOADS improves zero-shot classification accuracy across tasks and languages.

Abstract

In-context learning (ICL) performance is highly sensitive to prompt design, yet the impact of class label options (e.g. lexicon or order) in zero-shot classification remains underexplored. This study proposes LOADS (Label set Optimization via Activation Distribution kurtosiS), a post-hoc method for selecting optimal label sets in zero-shot ICL with large language models (LLMs). LOADS is built upon the observations in our empirical analysis, the first to systematically examine how label option design (i.e., lexical choice, order, and elaboration) impacts classification performance. This analysis shows that the lexical choice of the labels in the prompt (such as agree vs. support in stance classification) plays an important role in both model performance and model's sensitivity to the label order. A further investigation demonstrates that optimal label words tend to activate fewer outlier…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Label Set Optimization via Activation Distribution Kurtosis for Zero-Shot Classification with Generative Models· underline

Taxonomy

TopicsMachine Learning and Data Classification

MethodsSparse Evolutionary Training