A Little Confidence Goes a Long Way
John Scoville, Shang Gao, Devanshu Agrawal, Javed Qadrud-Din

TL;DR
This paper presents resource-efficient methods for binary classification in large language models by leveraging hidden state probes, enabling high performance without labeled data or extensive computation.
Contribution
It introduces novel unsupervised probing techniques and confidence scoring methods that match large LLM performance with significantly less resources.
Findings
Comparable accuracy to state-of-the-art LLMs
Requires no labeled data for training
Uses significantly less computational resources
Abstract
We introduce a group of related methods for binary classification tasks using probes of the hidden state activations in large language models (LLMs). Performance is on par with the largest and most advanced LLMs currently available, but requiring orders of magnitude fewer computational resources and not requiring labeled data. This approach involves translating class labels into a semantically rich description, spontaneous symmetry breaking of multilayer perceptron probes for unsupervised learning and inference, training probes to generate confidence scores (prior probabilities) from hidden state activations subject to known constraints via entropy maximization, and selecting the most confident probe model from an ensemble for prediction. These techniques are evaluated on four datasets using five base LLMs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Algorithms · Machine Learning and Data Classification
MethodsBalanced Selection
