Generalized Discrete Diffusion with Self-Correction

Linxuan Wang; Ziyi Wang; Yikun Bai; Wei Deng; Guang Lin; Qifan Song

arXiv:2603.02230·cs.LG·March 4, 2026

Generalized Discrete Diffusion with Self-Correction

Linxuan Wang, Ziyi Wang, Yikun Bai, Wei Deng, Guang Lin, Qifan Song

PDF

Open Access

TL;DR

This paper introduces SCDD, a discrete diffusion model with explicit state transitions and learned self-correction, improving parallel decoding efficiency while maintaining high-quality generation.

Contribution

It reformulates self-correction in discrete diffusion models with explicit states, simplifying training and enhancing decoding efficiency.

Findings

01

Enables more efficient parallel decoding.

02

Preserves generation quality comparable to prior methods.

03

Simplifies training by removing redundant steps.

Abstract

Self-correction is an effective technique for maintaining parallel sampling in discrete diffusion models with minimal performance degradation. Prior work has explored self-correction at inference time or during post-training; however, such approaches often suffer from limited generalization and may impair reasoning performance. GIDD pioneers pretraining-based self-correction via a multi-step BERT-style uniform-absorbing objective. However, GIDD relies on a continuous interpolation-based pipeline with opaque interactions between uniform transitions and absorbing masks, which complicates hyperparameter tuning and hinders practical performance. In this work, we propose a Self-Correcting Discrete Diffusion (SCDD) model to reformulate pretrained self-correction with explicit state transitions and learn directly in discrete time. Our framework also simplifies the training noise schedule,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning