dVoting: Fast Voting for dLLMs
Sicheng Feng, Zigeng Chen, Xinyin Ma, Gongfan Fang, Xinchao Wang

TL;DR
dVoting is a novel technique that enhances the reasoning ability of diffusion large language models by iterative token refinement without additional training, significantly improving performance on multiple benchmarks.
Contribution
This paper introduces dVoting, a fast voting method that leverages the parallel token generation of dLLMs to improve reasoning accuracy through iterative refinement without retraining.
Findings
Achieves up to 7.66% improvement on GSM8K.
Improves performance on MATH500, ARC-C, and MMLU benchmarks.
Demonstrates consistent performance gains across various tasks.
Abstract
Diffusion Large Language Models (dLLMs) represent a new paradigm beyond autoregressive modeling, offering competitive performance while naturally enabling a flexible decoding process. Specifically, dLLMs can generate tokens at arbitrary positions in parallel, endowing them with significant potential for parallel test-time scaling, which was previously constrained by severe inefficiency in autoregressive modeling. In this work, we introduce dVoting, a fast voting technique that boosts reasoning capability without training, with only an acceptable extra computational overhead. dVoting is motivated by the observation that, across multiple samples for the same prompt, token predictions remain largely consistent, whereas performance is determined by a small subset of tokens exhibiting cross-sample variability. Leveraging the arbitrary-position generation capability of dLLMs, dVoting performs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Machine Learning in Healthcare
