Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding

Yanzheng Xiang; Lan Wei; Yizhen Yao; Qinglin Zhu; Hanqi Yan; Chen Jin; Philip Alexander Teare; Dandan Zhang; Lin Gui; Amrutha Saseendran; Yulan He

arXiv:2602.06161·cs.CL·February 9, 2026

Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding

Yanzheng Xiang, Lan Wei, Yizhen Yao, Qinglin Zhu, Hanqi Yan, Chen Jin, Philip Alexander Teare, Dandan Zhang, Lin Gui, Amrutha Saseendran, Yulan He

PDF

Open Access

TL;DR

COVER introduces a novel verification method that reduces unnecessary revisions in parallel diffusion decoding, significantly speeding up inference while maintaining quality.

Contribution

The paper presents COVER, a new verification scheme that performs leave-one-out verification within a single pass, reducing flip-flop oscillations and improving decoding efficiency.

Findings

01

COVER reduces revision cycles and speeds up decoding.

02

It maintains output quality comparable to existing methods.

03

COVER adapts verification based on stability scores.

Abstract

Parallel diffusion decoding can accelerate diffusion language model inference by unmasking multiple tokens per step, but aggressive parallelism often harms quality. Revocable decoding mitigates this by rechecking earlier tokens, yet we observe that existing verification schemes frequently trigger flip-flop oscillations, where tokens are remasked and later restored unchanged. This behaviour slows inference in two ways: remasking verified positions weakens the conditioning context for parallel drafting, and repeated remask cycles consume the revision budget with little net progress. We propose COVER (Cache Override Verification for Efficient Revision), which performs leave-one-out verification and stable drafting within a single forward pass. COVER constructs two attention views via KV cache override: selected seeds are masked for verification, while their cached key value states are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Natural Language Processing Techniques · Machine Learning and Algorithms