Edit-Based Refinement for Parallel Masked Diffusion Language Models

Houxing Ren; Mingjie Zhan; Zimu Lu; Ke Wang; Yunqiao Yang; Haotian Hou; Junting Pan; Hongsheng Li

arXiv:2605.09603·cs.CL·May 12, 2026

Edit-Based Refinement for Parallel Masked Diffusion Language Models

Houxing Ren, Mingjie Zhan, Zimu Lu, Ke Wang, Yunqiao Yang, Haotian Hou, Junting Pan, Hongsheng Li

PDF

1 Repo 3 Models

TL;DR

This paper introduces ME-DLM, an edit-based refinement framework for parallel masked diffusion language models that enhances multi-token generation quality and efficiency through minimal post-editing steps.

Contribution

It proposes a novel edit-based refinement method that improves sequence-level consistency and robustness in parallel diffusion language models.

Findings

01

Achieves 11.6 points improvement on HumanEval

02

Achieves 33.6 points improvement on GSM8K

03

Uses one-eighth of the diffusion steps for comparable performance

Abstract

Masked diffusion language models enable parallel token generation and offer improved decoding efficiency over autoregressive models. However, their performance degrades significantly when generating multiple tokens simultaneously, due to a mismatch between token-level training objectives and joint sequence consistency. In this paper, we propose ME-DLM, an edit-based refinement framework that augments diffusion generation with lightweight post-editing steps. After producing an initial complete response, the model refines it through minimal edit operations, including replacement, deletion, and insertion, conditioned on the full sequence. Training supervision is derived from edit distance, providing a deterministic signal under a fixed canonicalization scheme for learning minimal corrections. This approach encourages sequence-level consistency through globally conditioned edits while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

renhouxing/ME-DLM
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.