Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment
Zhenbang Du, Yonggan Fu, Lifu Wang, Jiayi Qian, Xiao Luo, Yingyan (Celine) Lin

TL;DR
This paper introduces PostDiff, a training-free framework that enhances diffusion model efficiency by reducing redundancy, showing that lowering per-step inference cost often yields better results than fewer denoising steps.
Contribution
PostDiff provides a novel post-training method combining mixed-resolution denoising and hybrid module caching to improve diffusion model deployment without fine-tuning.
Findings
Reducing per-step inference cost is more effective than fewer denoising steps for efficiency.
PostDiff significantly improves the fidelity-efficiency trade-off of diffusion models.
The framework is training-free and applicable to pre-trained models.
Abstract
Diffusion models have shown remarkable success across generative tasks, yet their high computational demands challenge deployment on resource-limited platforms. This paper investigates a critical question for compute-optimal diffusion model deployment: Under a post-training setting without fine-tuning, is it more effective to reduce the number of denoising steps or to use a cheaper per-step inference? Intuitively, reducing the number of denoising steps increases the variability of the distributions across steps, making the model more sensitive to compression. In contrast, keeping more denoising steps makes the differences smaller, preserving redundancy, and making post-training compression more feasible. To systematically examine this, we propose PostDiff, a training-free framework for accelerating pre-trained diffusion models by reducing redundancy at both the input level and module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · NMR spectroscopy and applications
