Robust Uncertainty Quantification for Factual Generation of Large Language Models
Yuhao Zhang, Zhongliang Yang, Linna Zhou

TL;DR
This paper introduces a novel uncertainty quantification method (RU) to improve the detection of hallucinations in large language models during factual generation, especially under adversarial questioning, enhancing reliability.
Contribution
The study proposes a robust uncertainty quantification approach tailored for factual generation, addressing limitations of existing methods in non-canonical and adversarial scenarios.
Findings
RU outperforms baseline methods in trap question tests
Average ROCAUC increase of 0.1-0.2 across four models
Effective in detecting hallucinations in factual generation
Abstract
The rapid advancement of large language model(LLM) technology has facilitated its integration into various domains of professional and daily life. However, the persistent challenge of LLM hallucination has emerged as a critical limitation, significantly compromising the reliability and trustworthiness of AI-generated content. This challenge has garnered significant attention within the scientific community, prompting extensive research efforts in hallucination detection and mitigation strategies. Current methodological frameworks reveal a critical limitation: traditional uncertainty quantification approaches demonstrate effectiveness primarily within conventional question-answering paradigms, yet exhibit notable deficiencies when confronted with non-canonical or adversarial questioning strategies. This performance gap raises substantial concerns regarding the dependability of LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Artificial Intelligence in Healthcare and Education
