Reversible Diffusion Decoding for Diffusion Language Models

Xinyun Wang; Min Zhang; Sen Cui; Zhikang Chen; Bo Jiang; Kun Kuang; Mingbao Lin

arXiv:2602.00150·cs.CL·February 3, 2026

Reversible Diffusion Decoding for Diffusion Language Models

Xinyun Wang, Min Zhang, Sen Cui, Zhikang Chen, Bo Jiang, Kun Kuang, Mingbao Lin

PDF

Open Access

TL;DR

Reversible Diffusion Decoding (RDD) introduces reversibility into diffusion language models, enabling recovery from stagnation and errors during parallel token generation, thus improving robustness and quality with minimal extra computation.

Contribution

The paper presents RDD, a novel reversible decoding framework that enhances diffusion language models by allowing backtracking and selective reinitialization during generation.

Findings

01

RDD improves generation robustness and quality.

02

RDD achieves these benefits with minimal computational overhead.

03

Experiments validate RDD's effectiveness over baseline models.

Abstract

Diffusion language models enable parallel token generation through block-wise decoding, but their irreversible commitments can lead to stagnation, where the reverse diffusion process fails to make further progress under a suboptimal context.We propose Reversible Diffusion Decoding (RDD), a decoding framework that introduces reversibility into block-wise diffusion generation. RDD detects stagnation as a state-dependent failure of the reverse process and enables efficient backtracking to earlier blocks without recomputation via cached model states. To avoid repeated failure trajectories, RDD applies confidence-guided re-masking to selectively reinitialize uncertain tokens while preserving reliable context.This reversible formulation allows decoding to recover from early commitment errors while maintaining the parallel efficiency of diffusion-based generation. Experiments show that RDD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Natural Language Processing Techniques