Robust Uncertainty Quantification for Factual Generation of Large Language Models

Yuhao Zhang; Zhongliang Yang; Linna Zhou

arXiv:2601.00348·cs.CL·January 5, 2026

Robust Uncertainty Quantification for Factual Generation of Large Language Models

Yuhao Zhang, Zhongliang Yang, Linna Zhou

PDF

Open Access

TL;DR

This paper introduces a novel uncertainty quantification method (RU) to improve the detection of hallucinations in large language models during factual generation, especially under adversarial questioning, enhancing reliability.

Contribution

The study proposes a robust uncertainty quantification approach tailored for factual generation, addressing limitations of existing methods in non-canonical and adversarial scenarios.

Findings

01

RU outperforms baseline methods in trap question tests

02

Average ROCAUC increase of 0.1-0.2 across four models

03

Effective in detecting hallucinations in factual generation

Abstract

The rapid advancement of large language model(LLM) technology has facilitated its integration into various domains of professional and daily life. However, the persistent challenge of LLM hallucination has emerged as a critical limitation, significantly compromising the reliability and trustworthiness of AI-generated content. This challenge has garnered significant attention within the scientific community, prompting extensive research efforts in hallucination detection and mitigation strategies. Current methodological frameworks reveal a critical limitation: traditional uncertainty quantification approaches demonstrate effectiveness primarily within conventional question-answering paradigms, yet exhibit notable deficiencies when confronted with non-canonical or adversarial questioning strategies. This performance gap raises substantial concerns regarding the dependability of LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Artificial Intelligence in Healthcare and Education