A Little Confidence Goes a Long Way

John Scoville; Shang Gao; Devanshu Agrawal; Javed Qadrud-Din

arXiv:2408.11239·cs.LG·August 22, 2024

A Little Confidence Goes a Long Way

John Scoville, Shang Gao, Devanshu Agrawal, Javed Qadrud-Din

PDF

Open Access

TL;DR

This paper presents resource-efficient methods for binary classification in large language models by leveraging hidden state probes, enabling high performance without labeled data or extensive computation.

Contribution

It introduces novel unsupervised probing techniques and confidence scoring methods that match large LLM performance with significantly less resources.

Findings

01

Comparable accuracy to state-of-the-art LLMs

02

Requires no labeled data for training

03

Uses significantly less computational resources

Abstract

We introduce a group of related methods for binary classification tasks using probes of the hidden state activations in large language models (LLMs). Performance is on par with the largest and most advanced LLMs currently available, but requiring orders of magnitude fewer computational resources and not requiring labeled data. This approach involves translating class labels into a semantically rich description, spontaneous symmetry breaking of multilayer perceptron probes for unsupervised learning and inference, training probes to generate confidence scores (prior probabilities) from hidden state activations subject to known constraints via entropy maximization, and selecting the most confident probe model from an ensemble for prediction. These techniques are evaluated on four datasets using five base LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Algorithms · Machine Learning and Data Classification

MethodsBalanced Selection