Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

Mingyu Cao; Alvaro H.C. Correia; Christos Louizos; Shiwei Liu; Lu Yin

arXiv:2602.10953·cs.CL·February 26, 2026

Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

Mingyu Cao, Alvaro H.C. Correia, Christos Louizos, Shiwei Liu, Lu Yin

PDF

Open Access

TL;DR

SOAR is a confidence-adaptive decoding algorithm for diffusion language models that improves reasoning and code generation quality by dynamically balancing search breadth and speed based on model confidence.

Contribution

It introduces a training-free, confidence-switched decoding method that enhances diffusion language model performance without additional training.

Findings

01

Improves reasoning and code generation benchmarks.

02

Balances quality and efficiency effectively.

03

Maintains competitive inference speed.

Abstract

Diffusion Language Models (DLMs) generate text by iteratively denoising a masked sequence, repeatedly deciding which positions to commit at each step. Standard decoding follows a greedy rule: unmask the most confident positions, yet this local choice can lock the model into a suboptimal unmasking order, especially on reasoning-heavy prompts. We present SOAR, a training-free decoding algorithm that adapts its behavior to the model's uncertainty. When confidence is low, SOAR briefly widens the search over alternative unmasking decisions to avoid premature commitments; when confidence is high, it collapses the search and decodes many positions in parallel to reduce the number of denoising iterations. Across mathematical reasoning and code generation benchmarks (GSM8K, MBPP, HumanEval) on Dream-7B and LLaDA-8B, SOAR improves generation quality while maintaining competitive inference speed,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis