Generating with Confidence: Uncertainty Quantification for Black-box   Large Language Models

Zhen Lin; Shubhendu Trivedi; Jimeng Sun

arXiv:2305.19187·cs.CL·May 21, 2024·35 cites

Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

PDF

Open Access 2 Repos

TL;DR

This paper explores uncertainty quantification methods for black-box large language models in natural language generation, proposing measures to assess response reliability and improve trustworthiness without needing model access.

Contribution

It introduces and compares uncertainty and confidence measures for black-box LLMs, focusing on selective NLG to identify unreliable outputs.

Findings

01

Semantic dispersion measure predicts response quality effectively

02

Uncertainty measures help in filtering unreliable responses

03

Results assist practitioners in managing LLM response trustworthiness

Abstract

Large language models (LLMs) specializing in natural language generation (NLG) have recently started exhibiting promising capabilities across a variety of domains. However, gauging the trustworthiness of responses generated by LLMs remains an open challenge, with limited research on uncertainty quantification (UQ) for NLG. Furthermore, existing literature typically assumes white-box access to language models, which is becoming unrealistic either due to the closed-source nature of the latest LLMs or computational constraints. In this work, we investigate UQ in NLG for *black-box* LLMs. We first differentiate *uncertainty* vs *confidence*: the former refers to the ``dispersion'' of the potential predictions for a fixed input, and the latter refers to the confidence on a particular prediction/generation. We then propose and compare several confidence/uncertainty measures, applying them to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)