"I Don't Know" -- Towards Appropriate Trust with Certainty-Aware Retrieval Augmented Generation
Daan Di Scala, Maaike de Boer, P{\i}nar Yolum

TL;DR
This paper introduces CERTA, a retrieval-augmented system that conveys uncertainty in LLM answers to foster appropriate trust, supported by a new benchmark with diverse question-context pairs.
Contribution
The paper presents a novel CERTA system that integrates relevance-based uncertainty reflection and a comprehensive benchmark for non-objective questions.
Findings
CERTA improves identification of uncertain answers
It reduces over-agreement in responses
CERTA promotes cautiousness in moral judgments
Abstract
Achieving the right amount of trust in AI systems is important, but challenging. The problem is exacerbated with the rise of Large Language Models (LLMs) as they provide human-level communication capabilities, but potentially hallucinate in the content that they generate. Moreover, they express over-confidence in their answers, making it difficult for users to judge their truthfulness. An important human value that users seek is benevolence, which can be met by LLM's self-reflection leading to reliable and honest answers. Accordingly, this paper proposes conveying appropriate levels of self-reflected certainty to build appropriate trust. Our contributions are twofold: 1) We develop CERTA (Certainty Enhanced RAG for Trustworthy Answers), a specialized Retrieval Augmented Generation (RAG) system that incorporates the relevance between question, context, and answer to reflect its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
