Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection

Steven Robinson; Antonio Carlos Rivera

arXiv:2506.01104·cs.CL·June 3, 2025

Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection

Steven Robinson, Antonio Carlos Rivera

PDF

Open Access

TL;DR

This paper presents Reinforced Unanswerability Learning (RUL), a hybrid training method that improves large language models' ability to detect unanswerable questions and generate trustworthy responses, enhancing AI reliability and user trust.

Contribution

Introduction of RUL, a novel hybrid training paradigm combining hierarchical unanswerability detection with reinforcement learning and a new annotated dataset, improving LLM trustworthiness.

Findings

01

RUL significantly improves unanswerability detection accuracy.

02

RUL increases appropriate refusal responses for unanswerable questions.

03

Human evaluations show enhanced helpfulness and trustworthiness.

Abstract

The pervasive deployment of large language models (LLMs) in conversational AI systems has revolutionized information access, yet their propensity for generating factually unsupported or hallucinated responses remains a critical impediment to trustworthiness and widespread adoption. This paper introduces Reinforced Unanswerability Learning (RUL), a novel hybrid training paradigm designed to imbue LLMs with the intrinsic capability to accurately detect unanswerable questions and generate reliably appropriate responses. Unlike conventional approaches that rely on external classifiers or simple prompting, RUL integrates a discriminative unanswerability prediction head with the LLM's generative core, guided by a multi-stage learning strategy. This includes supervised fine-tuning on a novel, richly annotated dataset, Enhanced-CAsT-Answerability (ECA), which features hierarchical answerability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Access Control and Trust · Cloud Data Security Solutions