Generalized Discrete Diffusion with Self-Correction
Linxuan Wang, Ziyi Wang, Yikun Bai, Wei Deng, Guang Lin, Qifan Song

TL;DR
This paper introduces SCDD, a discrete diffusion model with explicit state transitions and learned self-correction, improving parallel decoding efficiency while maintaining high-quality generation.
Contribution
It reformulates self-correction in discrete diffusion models with explicit states, simplifying training and enhancing decoding efficiency.
Findings
Enables more efficient parallel decoding.
Preserves generation quality comparable to prior methods.
Simplifies training by removing redundant steps.
Abstract
Self-correction is an effective technique for maintaining parallel sampling in discrete diffusion models with minimal performance degradation. Prior work has explored self-correction at inference time or during post-training; however, such approaches often suffer from limited generalization and may impair reasoning performance. GIDD pioneers pretraining-based self-correction via a multi-step BERT-style uniform-absorbing objective. However, GIDD relies on a continuous interpolation-based pipeline with opaque interactions between uniform transitions and absorbing masks, which complicates hyperparameter tuning and hinders practical performance. In this work, we propose a Self-Correcting Discrete Diffusion (SCDD) model to reformulate pretrained self-correction with explicit state transitions and learn directly in discrete time. Our framework also simplifies the training noise schedule,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
