"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
Sunnie S. Y. Kim, Q. Vera Liao, Mihaela Vorvoreanu, Stephanie, Ballard, Jennifer Wortman Vaughan

TL;DR
This study investigates how natural language expressions of uncertainty in large language models influence user reliance, trust, and accuracy, revealing that explicit uncertainty expressions can reduce overreliance and improve decision-making.
Contribution
It provides empirical evidence on the effects of natural language uncertainty expressions in LLMs, highlighting their potential to mitigate overreliance and improve user accuracy.
Findings
First-person uncertainty expressions decrease reliance and increase accuracy.
General perspective uncertainty expressions have weaker, non-significant effects.
Natural language uncertainty communication can effectively reduce overreliance.
Abstract
Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We explore this question through a large-scale, pre-registered, human-subject experiment (N=404) in which participants answer medical questions with or without access to responses from a fictional LLM-infused search engine. Using both behavioral and self-reported measures, we examine how different natural language expressions of uncertainty impact participants' reliance, trust, and overall task performance. We find that first-person expressions (e.g., "I'm not sure, but...") decrease participants'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
