DynHD: Hallucination Detection for Diffusion Large Language Models via Denoising Dynamics Deviation Learning
Yanyu Qian, Yue Tan, Yixin Liu, Wang Yu, Shirui Pan

TL;DR
DynHD is a novel method for hallucination detection in diffusion large language models that leverages both token-level semantic signals and the evolution of uncertainty during denoising to improve reliability.
Contribution
It introduces a semantic-aware evidence construction and a deviation-based detection approach that models denoising dynamics for more accurate hallucination detection.
Findings
Outperforms state-of-the-art baselines in multiple benchmarks
Achieves higher detection accuracy and efficiency
Effectively models denoising dynamics for hallucination detection
Abstract
Diffusion large language models (D-LLMs) have emerged as a promising alternative to auto-regressive models due to their iterative refinement capabilities. However, hallucinations remain a critical issue that hinders their reliability. To detect hallucination responses from model outputs, token-level uncertainty (e.g., entropy) has been widely used as an effective signal to indicate potential factual errors. Nevertheless, the fixed-length generation paradigm of D-LLMs implies that tokens contribute unevenly to hallucination detection, with only a small subset providing meaningful signals. Moreover, the evolution trend of uncertainty throughout the diffusion process can also provide important signals, highlighting the necessity of modeling its denoising dynamics for hallucination detection. In this paper, we propose DynHD that bridge these gaps from both spatial (token sequence) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts
