OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models
Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang

TL;DR
This paper introduces OMS-DPM, a method to optimize the sequence of models used during diffusion probabilistic model generation, significantly improving speed without sacrificing quality by leveraging a model schedule approach.
Contribution
It proposes a predictor-based algorithm to optimize model schedules for DPMs, enabling faster generation with maintained quality across multiple datasets.
Findings
OMS-DPM outperforms prior methods in quality-speed trade-offs.
Achieves 2x acceleration on Stable Diffusion checkpoints.
Effective across diverse datasets like CIFAR-10, CelebA, ImageNet, and LSUN.
Abstract
Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains. Despite the promise, one major drawback of DPMs is the slow generation speed due to the large number of neural network evaluations required in the generation process. In this paper, we reveal an overlooked dimension -- model schedule -- for optimizing the trade-off between generation quality and speed. More specifically, we observe that small models, though having worse generation quality when used alone, could outperform large models in certain generation steps. Therefore, unlike the traditional way of using a single model, using different models in different generation steps in a carefully designed \emph{model schedule} could potentially improve generation quality and speed \emph{simultaneously}. We design OMS-DPM, a predictor-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)
MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
