Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion
Linrui Ma, Yufei Cui, Kai Han, Yunhe Wang

TL;DR
This paper introduces a 'draft-then-refine' framework for semi-autoregressive diffusion models that enhances global coherence and reduces perplexity, outperforming previous models with less fine-tuning.
Contribution
The paper proposes Diffusion in Diffusion, a novel framework combining block diffusion and global bidirectional refinement to improve coherence and efficiency in discrete diffusion models.
Findings
Sets new benchmark on OpenWebText dataset
Reduces perplexity from 25.7 to 21.9 with less fine-tuning
Enhances global contextual understanding in diffusion models
Abstract
One of the most compelling features of global discrete diffusion language models is their global bidirectional contextual capability. However, existing block-based diffusion studies tend to introduce autoregressive priors, which, while offering benefits, can cause models to lose this global coherence at the macro level. To regain global contextual understanding while preserving the advantages of the semi-autoregressive paradigm, we propose Diffusion in Diffusion, a 'draft-then-refine' framework designed to overcome the irreversibility and myopia problems inherent in block diffusion models. Our approach first employs block diffusion to generate rapid drafts using small blocks, then refines these drafts through global bidirectional diffusion with a larger bidirectional receptive field. We utilize snapshot confidence remasking to identify the most critical tokens that require modification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Language and cultural evolution · Generative Adversarial Networks and Image Synthesis
