Uncertainty Quantification for Large Language Diffusion Models

Artem Vazhentsev; Vladislav Smirnov; David Li; Maxim Panov; Timothy Baldwin; Artem Shelmanov

arXiv:2605.14570·cs.CL·May 15, 2026

Uncertainty Quantification for Large Language Diffusion Models

Artem Vazhentsev, Vladislav Smirnov, David Li, Maxim Panov, Timothy Baldwin, Artem Shelmanov

PDF

TL;DR

This paper introduces a novel, efficient uncertainty quantification method for Large Language Diffusion Models, enabling reliable hallucination detection with minimal computational cost.

Contribution

It presents the first systematic UQ approach for LLDMs, leveraging iterative denoising signals and adapting a state-of-the-art method for improved efficiency.

Findings

01

Achieves near state-of-the-art uncertainty detection with up to 100x less computation.

02

Utilizes intermediate generations and denoising dynamics for zero-shot UQ signals.

03

Demonstrates effectiveness across multiple tasks, datasets, and models.

Abstract

Large Language Diffusion Models (LLDMs) are emerging as an alternative to autoregressive models, offering faster inference through higher parallelism. Similar to autoregressive LLMs, they remain prone to hallucinations, making reliable uncertainty quantification (UQ) crucial for safe deployment. However, existing UQ methods are fundamentally misaligned with this new paradigm: they assume autoregressive factorization or use expensive repeated sampling, negating the efficiency of LLDMs. In this work, we present the first systematic study of UQ for LLDMs and propose lightweight, zero-shot uncertainty signals derived from the iterative denoising process, leveraging intermediate generations, token remasking dynamics, and denoising complexity. We further adapt a state-of-the-art UQ method to LLDMs by combining masked diffusion likelihoods with trajectory-based semantic dissimilarity. We prove…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.