Uncertainty Estimation and Quantification for LLMs: A Simple Supervised   Approach

Linyu Liu; Yu Pan; Xiaocheng Li; Guanting Chen

arXiv:2404.15993·cs.LG·October 24, 2024·6 cites

Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach

Linyu Liu, Yu Pan, Xiaocheng Li, Guanting Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a supervised method for uncertainty estimation in large language models, leveraging hidden activations to improve calibration and transferability across tasks and model access levels.

Contribution

It proposes a novel supervised approach that utilizes hidden neuron activations for uncertainty estimation in LLMs, addressing a gap in existing calibration methods.

Findings

01

Enhanced uncertainty estimation using hidden activations.

02

Improved calibration performance with the proposed method.

03

Robust transferability in out-of-distribution scenarios.

Abstract

In this paper, we study the problem of uncertainty estimation and calibration for LLMs. We begin by formulating the uncertainty estimation problem, a relevant yet underexplored area in existing literature. We then propose a supervised approach that leverages labeled datasets to estimate the uncertainty in LLMs' responses. Based on the formulation, we illustrate the difference between the uncertainty estimation for LLMs and that for standard ML models and explain why the hidden neurons of the LLMs may contain uncertainty information. Our designed approach demonstrates the benefits of utilizing hidden activations to enhance uncertainty estimation across various tasks and shows robust transferability in out-of-distribution settings. We distinguish the uncertainty estimation task from the uncertainty calibration task and show that better uncertainty estimation leads to better calibration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LoveCatc/supervised-llm-uncertainty-estimation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Measurement and Uncertainty Evaluation