DiffRatio: Training One-Step Diffusion Models Without Teacher Supervision
Wenlin Chen, Mingtian Zhang, Jiajun He, Zijing Ou, Jos\'e Miguel Hern\'andez-Lobato, Bernhard Sch\"olkopf, David Barber

TL;DR
DiffRatio introduces a novel method for training one-step diffusion models by directly estimating score differences via learned density ratios, reducing bias and improving efficiency and quality over traditional teacher-supervised distillation methods.
Contribution
The paper proposes a new framework, DiffRatio, that simplifies training, reduces bias, and enhances efficiency by directly estimating score differences with a lightweight density-ratio network.
Findings
Outperforms most teacher-supervised methods on CIFAR-10 and ImageNet.
Reduces gradient estimation bias and auxiliary network size.
Enables inference-time scaling for improved quality without extra training.
Abstract
Score-based distillation methods (e.g., variational score distillation) train one-step diffusion models by first pre-training a teacher score model and then distilling it into a one-step student model. However, the gradient estimator in the distillation stage usually suffers from two sources of bias: (1) biased teacher supervision due to score estimation error incurred during pre-training, and (2) the student model's score estimation error during distillation. These biases can degrade the quality of the resulting one-step diffusion model. To address this, we propose DiffRatio, a new framework for training one-step diffusion models: instead of estimating the teacher and student scores independently and then taking their difference, we directly estimate the score difference as the gradient of a learned log density ratio between the student and data distributions across diffusion time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProcess Optimization and Integration · Advanced Control Systems Optimization
MethodsDiffusion
