LogicDiff: Logic-Guided Denoising Improves Zero-Shot Reasoning in Masked Diffusion Language Models

Shaik Aman

arXiv:2603.26771·cs.CL·April 21, 2026

LogicDiff: Logic-Guided Denoising Improves Zero-Shot Reasoning in Masked Diffusion Language Models

Shaik Aman

PDF

TL;DR

LogicDiff enhances zero-shot reasoning in masked diffusion language models by using logic-role-guided unmasking, significantly improving accuracy with minimal speed impact.

Contribution

Introduces LogicDiff, a novel inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking, improving reasoning performance in MDLMs.

Findings

01

LogicDiff improves GSM8K accuracy from 22.0% to 60.7%.

02

LogicDiff enhances MATH-500 accuracy from 23.6% to 29.2%.

03

Less than 6% speed overhead with LogicDiff.

Abstract

Masked diffusion language models (MDLMs) generate text by iteratively unmasking tokens from a fully masked sequence. Their standard confidence-based unmasking strategy systematically defers high-entropy logical connective tokens, degrading reasoning performance. We introduce LogicDiff, an inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking. A lightweight classification head (4.2M parameters, 0.05% of the base model) predicts the logical role of each masked position (premise, connective, derived step, conclusion, or filler) from the base model's hidden states with 98.4% accuracy, and a dependency-ordered scheduler unmasks tokens in logical order. In zero-shot settings, LogicDiff improves LLaDA-8B-Instruct accuracy from 22.0% to 60.7% on GSM8K (+38.7 percentage points) and from 23.6% to 29.2% on MATH-500 (+5.6 pp), with less than 6% speed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.