Review, Remask, Refine (R3): Process-Guided Block Diffusion for Text Generation

Nikita Mounier; Parsa Idehpour

arXiv:2507.08018·cs.CL·July 14, 2025

Review, Remask, Refine (R3): Process-Guided Block Diffusion for Text Generation

Nikita Mounier, Parsa Idehpour

PDF

TL;DR

The paper introduces R3, a process-guided framework for iterative text generation that improves output quality by reviewing, remasking, and refining generated text blocks without additional training.

Contribution

R3 is a novel, training-free framework that leverages a Process Reward Model to guide targeted correction in pre-trained masked diffusion models.

Findings

01

Enhanced text generation quality demonstrated

02

Applicable to various pre-trained models

03

No additional training required

Abstract

A key challenge for iterative text generation is enabling models to efficiently identify and correct their own errors. We propose Review, Remask, Refine (R3), a relatively simple yet elegant framework that requires no additional model training and can be applied to any pre-trained masked text diffusion model (e.g., LLaDA or BD3-LM). In R3, a Process Reward Model (PRM) is utilized for the Review of intermediate generated blocks. The framework then translates these PRM scores into a Remask strategy: the lower a block's PRM score, indicating potential mistakes, the greater the proportion of tokens within that block are remasked. Finally, the model is compelled to Refine these targeted segments, focusing its efforts more intensively on specific sub-optimal parts of past generations, leading to improved final output.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.