Accelerating Diffusion Sampling with Optimized Time Steps

Shuchen Xue; Zhaoqiang Liu; Fei Chen; Shifeng Zhang; Tianyang Hu; Enze; Xie; Zhenguo Li

arXiv:2402.17376·cs.CV·July 4, 2024·1 cites

Accelerating Diffusion Sampling with Optimized Time Steps

Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze, Xie, Zhenguo Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework for optimizing time steps in diffusion probabilistic models to enhance sampling efficiency and image quality, especially when using fewer steps.

Contribution

It proposes a novel optimization approach for selecting non-uniform time steps tailored to specific ODE solvers in DPMs, improving sampling performance.

Findings

01

Optimized time steps significantly improve FID scores.

02

The method reduces sampling time to under 15 seconds.

03

Enhanced image quality on CIFAR-10 and ImageNet datasets.

Abstract

Diffusion probabilistic models (DPMs) have shown remarkable performance in high-resolution image synthesis, but their sampling efficiency is still to be desired due to the typically large number of sampling steps. Recent advancements in high-order numerical ODE solvers for DPMs have enabled the generation of high-quality images with much fewer sampling steps. While this is a significant development, most sampling methods still employ uniform time steps, which is not optimal when using a small number of steps. To address this issue, we propose a general framework for designing an optimization problem that seeks more appropriate time steps for a specific numerical ODE solver for DPMs. This optimization problem aims to minimize the distance between the ground-truth solution to the ODE and an approximate solution corresponding to the numerical solver. It can be efficiently solved using the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

scxue/DM-NonUniform
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing