Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Zhenbang Du; Yonggan Fu; Lifu Wang; Jiayi Qian; Xiao Luo; Yingyan (Celine) Lin

arXiv:2508.06160·cs.CV·August 11, 2025

Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Zhenbang Du, Yonggan Fu, Lifu Wang, Jiayi Qian, Xiao Luo, Yingyan (Celine) Lin

PDF

Open Access

TL;DR

This paper introduces PostDiff, a training-free framework that enhances diffusion model efficiency by reducing redundancy, showing that lowering per-step inference cost often yields better results than fewer denoising steps.

Contribution

PostDiff provides a novel post-training method combining mixed-resolution denoising and hybrid module caching to improve diffusion model deployment without fine-tuning.

Findings

01

Reducing per-step inference cost is more effective than fewer denoising steps for efficiency.

02

PostDiff significantly improves the fidelity-efficiency trade-off of diffusion models.

03

The framework is training-free and applicable to pre-trained models.

Abstract

Diffusion models have shown remarkable success across generative tasks, yet their high computational demands challenge deployment on resource-limited platforms. This paper investigates a critical question for compute-optimal diffusion model deployment: Under a post-training setting without fine-tuning, is it more effective to reduce the number of denoising steps or to use a cheaper per-step inference? Intuitively, reducing the number of denoising steps increases the variability of the distributions across steps, making the model more sensitive to compression. In contrast, keeping more denoising steps makes the differences smaller, preserving redundancy, and making post-training compression more feasible. To systematically examine this, we propose PostDiff, a training-free framework for accelerating pre-trained diffusion models by reducing redundancy at both the input level and module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · NMR spectroscopy and applications