One-Step Distillation of Discrete Diffusion Image Generators via Fixed-Point Iteration

Chaoyang Wang; Yunhai Tong

arXiv:2605.21484·cs.CV·May 21, 2026

One-Step Distillation of Discrete Diffusion Image Generators via Fixed-Point Iteration

Chaoyang Wang, Yunhai Tong

PDF

TL;DR

This paper introduces Fixed-Point Distillation (FPD), a novel end-to-end method for one-step discrete diffusion image generation that improves speed and quality by refining a single draft with a teacher model.

Contribution

FPD constructs local correction targets through partial corruption and refinement, using a continuous feature space and a straight-through estimator for effective training and inference.

Findings

01

FPD achieves competitive visual fidelity in a single inference step.

02

It narrows the gap between one-step and multi-step diffusion models.

03

Outperforms existing discrete distillation baselines.

Abstract

Discrete diffusion models excel at visual synthesis but rely on slow, iterative decoding. Existing single-step distillation methods attempt to bypass this bottleneck, either by training auxiliary score networks that effectively double compute, or by introducing specialized parameterizations and multi-stage pipelines that fragment optimization. In this paper, we introduce Fixed-Point Distillation (FPD), an end-to-end framework that constructs local correction targets by partially corrupting the student's one-step draft and refining it with a single teacher step. To compute the training objective in a semantically meaningful space, we lift discrete tokens into continuous features and apply a multi-bandwidth drift loss that iteratively accumulates these corrections. To backpropagate through the discrete bottleneck, we employ a straight-through estimator that feeds exact hard-sampled tokens…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.