"I Don't Know" -- Towards Appropriate Trust with Certainty-Aware Retrieval Augmented Generation

Daan Di Scala; Maaike de Boer; P{\i}nar Yolum

arXiv:2605.00957·cs.IR·May 5, 2026

"I Don't Know" -- Towards Appropriate Trust with Certainty-Aware Retrieval Augmented Generation

Daan Di Scala, Maaike de Boer, P{\i}nar Yolum

PDF

TL;DR

This paper introduces CERTA, a retrieval-augmented system that conveys uncertainty in LLM answers to foster appropriate trust, supported by a new benchmark with diverse question-context pairs.

Contribution

The paper presents a novel CERTA system that integrates relevance-based uncertainty reflection and a comprehensive benchmark for non-objective questions.

Findings

01

CERTA improves identification of uncertain answers

02

It reduces over-agreement in responses

03

CERTA promotes cautiousness in moral judgments

Abstract

Achieving the right amount of trust in AI systems is important, but challenging. The problem is exacerbated with the rise of Large Language Models (LLMs) as they provide human-level communication capabilities, but potentially hallucinate in the content that they generate. Moreover, they express over-confidence in their answers, making it difficult for users to judge their truthfulness. An important human value that users seek is benevolence, which can be met by LLM's self-reflection leading to reliable and honest answers. Accordingly, this paper proposes conveying appropriate levels of self-reflected certainty to build appropriate trust. Our contributions are twofold: 1) We develop CERTA (Certainty Enhanced RAG for Trustworthy Answers), a specialized Retrieval Augmented Generation (RAG) system that incorporates the relevance between question, context, and answer to reflect its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.