"I'm Not Sure, But...": Examining the Impact of Large Language Models'   Uncertainty Expression on User Reliance and Trust

Sunnie S. Y. Kim; Q. Vera Liao; Mihaela Vorvoreanu; Stephanie; Ballard; Jennifer Wortman Vaughan

arXiv:2405.00623·cs.HC·May 16, 2024

"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust

Sunnie S. Y. Kim, Q. Vera Liao, Mihaela Vorvoreanu, Stephanie, Ballard, Jennifer Wortman Vaughan

PDF

TL;DR

This study investigates how natural language expressions of uncertainty in large language models influence user reliance, trust, and accuracy, revealing that explicit uncertainty expressions can reduce overreliance and improve decision-making.

Contribution

It provides empirical evidence on the effects of natural language uncertainty expressions in LLMs, highlighting their potential to mitigate overreliance and improve user accuracy.

Findings

01

First-person uncertainty expressions decrease reliance and increase accuracy.

02

General perspective uncertainty expressions have weaker, non-significant effects.

03

Natural language uncertainty communication can effectively reduce overreliance.

Abstract

Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has been little empirical work examining how users perceive and act upon LLMs' expressions of uncertainty. We explore this question through a large-scale, pre-registered, human-subject experiment (N=404) in which participants answer medical questions with or without access to responses from a fictional LLM-infused search engine. Using both behavioral and self-reported measures, we examine how different natural language expressions of uncertainty impact participants' reliance, trust, and overall task performance. We find that first-person expressions (e.g., "I'm not sure, but...") decrease participants'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.