Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

Mingyu Jo; Jaesik Yoon; Justin Deschenaux; Caglar Gulcehre; Sungjin Ahn

arXiv:2510.19304·cs.LG·May 14, 2026

Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

Mingyu Jo, Jaesik Yoon, Justin Deschenaux, Caglar Gulcehre, Sungjin Ahn

PDF

1 Repo 1 Video

TL;DR

The paper introduces Loopholing Discrete Diffusion Models (LDDMs), a deterministic mechanism that preserves distributional information in discrete diffusion models, significantly improving text generation quality and reasoning task performance.

Contribution

It proposes a novel loopholing mechanism with a self-conditioning training strategy, reducing the sampling wall in discrete diffusion models and enhancing generative and reasoning capabilities.

Findings

01

Reduced perplexity by up to 61% over baselines.

02

Achieved comparable or better performance than autoregressive models.

03

Improved coherence and reasoning in generated text.

Abstract

Discrete diffusion models offer a promising alternative to autoregressive generation through parallel decoding, but they suffer from a sampling wall: once categorical sampling occurs, rich distributional information collapses into one-hot vectors and cannot be propagated across steps, forcing subsequent steps to operate with limited information. To mitigate this problem, we introduce Loopholing, a novel and simple mechanism that preserves this information via a deterministic latent pathway, leading to Loopholing Discrete Diffusion Models (LDDMs). Trained efficiently with a self-conditioning strategy that avoids unrolling the full denoising trajectory, LDDMs achieve substantial gains-reducing generative perplexity by up to 61% over prior baselines, thereby closing (and in some cases surpassing) the gap with autoregressive models, and producing more coherent text. Applied to reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ahn-ml/lddm
github

Videos

Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall· slideslive