LogicDiff: Logic-Guided Denoising Improves Zero-Shot Reasoning in Masked Diffusion Language Models
Shaik Aman

TL;DR
LogicDiff enhances zero-shot reasoning in masked diffusion language models by using logic-role-guided unmasking, significantly improving accuracy with minimal speed impact.
Contribution
Introduces LogicDiff, a novel inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking, improving reasoning performance in MDLMs.
Findings
LogicDiff improves GSM8K accuracy from 22.0% to 60.7%.
LogicDiff enhances MATH-500 accuracy from 23.6% to 29.2%.
Less than 6% speed overhead with LogicDiff.
Abstract
Masked diffusion language models (MDLMs) generate text by iteratively unmasking tokens from a fully masked sequence. Their standard confidence-based unmasking strategy systematically defers high-entropy logical connective tokens, degrading reasoning performance. We introduce LogicDiff, an inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking. A lightweight classification head (4.2M parameters, 0.05% of the base model) predicts the logical role of each masked position (premise, connective, derived step, conclusion, or filler) from the base model's hidden states with 98.4% accuracy, and a dependency-ordered scheduler unmasks tokens in logical order. In zero-shot settings, LogicDiff improves LLaDA-8B-Instruct accuracy from 22.0% to 60.7% on GSM8K (+38.7 percentage points) and from 23.6% to 29.2% on MATH-500 (+5.6 pp), with less than 6% speed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
