Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
Boyuan Chen, Diego Marti Monso, Yilun Du, Max Simchowitz, Russ, Tedrake, Vincent Sitzmann

TL;DR
Diffusion Forcing introduces a novel training paradigm that combines the advantages of next-token prediction and full-sequence diffusion models, enabling flexible, guided sequence generation with improved performance on decision-making tasks.
Contribution
The paper proposes Diffusion Forcing, a new method that trains diffusion models for sequence generation using independent per-token noise, blending next-token prediction with full-sequence diffusion capabilities.
Findings
Enables generation of sequences longer than training horizon.
Provides improved guidance and control during sampling.
Achieves better performance in decision-making and planning tasks.
Abstract
This paper presents Diffusion Forcing, a new training paradigm where a diffusion model is trained to denoise a set of tokens with independent per-token noise levels. We apply Diffusion Forcing to sequence generative modeling by training a causal next-token prediction model to generate one or several future tokens without fully diffusing past ones. Our approach is shown to combine the strengths of next-token prediction models, such as variable-length generation, with the strengths of full-sequence diffusion models, such as the ability to guide sampling to desirable trajectories. Our method offers a range of additional capabilities, such as (1) rolling-out sequences of continuous tokens, such as video, with lengths past the training horizon, where baselines diverge and (2) new sampling and guiding schemes that uniquely profit from Diffusion Forcing's variable-horizon and causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Skywork/SkyReels-V2-I2V-14B-540Pmodel· 54 dl· ♡ 8854 dl♡ 88
- 🤗Skywork/SkyReels-V2-DF-14B-540Pmodel· 91 dl· ♡ 2191 dl♡ 21
- 🤗Skywork/SkyReels-V2-T2V-14B-540Pmodel· 170 dl· ♡ 22170 dl♡ 22
- 🤗Skywork/SkyReels-V2-T2V-14B-720Pmodel· 304 dl· ♡ 42304 dl♡ 42
- 🤗Skywork/SkyReels-V2-DF-1.3B-540Pmodel· 397 dl· ♡ 45397 dl♡ 45
- 🤗Skywork/SkyReels-V2-DF-14B-720Pmodel· 441 dl· ♡ 32441 dl♡ 32
- 🤗Skywork/SkyReels-V2-I2V-1.3B-540Pmodel· 104 dl· ♡ 45104 dl♡ 45
- 🤗Skywork/SkyReels-V2-I2V-14B-720Pmodel· 373 dl· ♡ 34373 dl♡ 34
- 🤗jobs-git/SkyReels-V2-DF-14B-720Pmodel· 56 dl56 dl
- 🤗jobs-git/SkyReels-V2-I2V-14B-720Pmodel· 14 dl14 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic and Genetic Research · Nuclear Materials and Properties · Magnetic confinement fusion research
MethodsSparse Evolutionary Training · Diffusion
