Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models
Sean Lamont, Christian Walder, Paul Montague, Amir Dezfouli, Michael Norrish

TL;DR
This paper introduces a simple, training-free method to increase diversity in diffusion language model outputs, significantly improving Pass@$k$ performance on reasoning benchmarks without additional training or high computational costs.
Contribution
The authors propose a novel, low-cost sampling intervention that actively penalizes redundancy among samples in diffusion language models, enhancing diversity without retraining.
Findings
Improved diversity and Pass@$k$ metrics on HumanEval and GSM8K benchmarks.
Significant performance gains across various temperature settings.
Method incurs negligible computational overhead.
Abstract
Diverse outputs in text generation are necessary for effective exploration in complex reasoning tasks, such as code generation and mathematical problem solving. Such Pass@ problems benefit from distinct candidates covering the solution space. However, traditional sampling approaches often waste computational resources on repetitive failure modes. While Diffusion Language Models have emerged as a competitive alternative to the prevailing Autoregressive paradigm, they remain susceptible to this redundancy, with independent samples frequently collapsing into similar modes. To address this, we propose a training free, low cost intervention to enhance generative diversity in Diffusion Language Models. Our approach modifies intermediate samples in a batch sequentially, where each sample is repelled from the feature space of previous samples, actively penalising redundancy. Unlike prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
