Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur

TL;DR
This paper investigates finetuning large language models to produce linguistically expressed uncertainty, improving their calibration and reliability in conveying confidence levels in answers.
Contribution
It introduces a supervised finetuning approach that enhances LLMs' ability to generate calibrated expressions of uncertainty based on their confidence assessments.
Findings
LLMs can be calibrated to assess their prediction confidence.
Supervised finetuning improves the accuracy of expressed uncertainty.
Calibrated models better align confidence with correctness.
Abstract
Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can make these inaccuracies appear confident and convincing. As a result, end-users struggle to consistently align the confidence expressed by LLMs with the accuracy of their predictions, often leading to either blind trust in all outputs or a complete disregard for their reliability. In this work, we explore supervised finetuning on uncertainty-augmented predictions as a method to develop models that produce linguistic expressions of uncertainty. Specifically, we measure the calibration of pre-trained models and then fine-tune language models to generate calibrated linguistic expressions of uncertainty. Through experiments on various question-answering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
