Teaching Models to Express Their Uncertainty in Words

Stephanie Lin; Jacob Hilton; Owain Evans

arXiv:2205.14334·cs.CL·June 14, 2022·54 cites

Teaching Models to Express Their Uncertainty in Words

Stephanie Lin, Jacob Hilton, Owain Evans

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that GPT-3 can learn to naturally express calibrated uncertainty about its answers in words, maintaining accuracy and calibration even under distribution shifts, without relying on logits.

Contribution

It introduces a method for GPT-3 to verbalize its uncertainty, showing calibration in natural language and under distribution shifts, a novel capability not previously demonstrated.

Findings

01

GPT-3 can generate well-calibrated uncertainty levels in natural language.

02

Uncertainty expressed verbally remains calibrated under distribution shifts.

03

Calibration depends on pre-trained latent representations related to epistemic uncertainty.

Abstract

We show that a GPT-3 model can learn to express uncertainty about its own answers in natural language -- without use of model logits. When given a question, the model generates both an answer and a level of confidence (e.g. "90% confidence" or "high confidence"). These levels map to probabilities that are well calibrated. The model also remains moderately calibrated under distribution shift, and is sensitive to uncertainty in its own answers, rather than imitating human examples. To our knowledge, this is the first time a model has been shown to express calibrated uncertainty about its own answers in natural language. For testing calibration, we introduce the CalibratedMath suite of tasks. We compare the calibration of uncertainty expressed in words ("verbalized probability") to uncertainty extracted from model logits. Both kinds of uncertainty are capable of generalizing calibration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sylinrl/calibratedmath
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Attention Dropout · Adam · Residual Connection · Layer Normalization · Softmax