TL;DR
This paper studies how large language models handle multi-label classification, revealing their tendency to focus on one label at a time and proposing methods to improve label distribution alignment and classification accuracy.
Contribution
It uncovers the behavior of LLMs in multi-label tasks, introduces distribution alignment techniques, and proposes a simple method to enhance classification performance.
Findings
LLMs tend to suppress all but one label at each generation step.
Model scale increases confidence and reduces entropy in token distributions.
Max probability over all label distributions improves classification and alignment.
Abstract
Multi-label classification is prevalent in real-world settings, but the behavior of Large Language Models (LLMs) in this setting is understudied. We investigate how autoregressive LLMs perform multi-label classification, focusing on subjective tasks, by analyzing the output distributions of the models at each label generation step. We find that the initial probability distribution for the first label often does not reflect the eventual final output, even in terms of relative order and find LLMs tend to suppress all but one label at each generation step. We further observe that as model scale increases, their token distributions exhibit lower entropy and higher single-label confidence, but the internal relative ranking of the labels improves. Finetuning methods such as supervised finetuning and reinforcement learning amplify this phenomenon. We introduce the task of distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
