Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures
Huijie Zhang, Yifu Lu, Ismail Alkhouri, Saiprasad Ravishankar, Dogyoon Song, Qing Qu

TL;DR
This paper introduces a multi-stage framework with tailored multi-decoder architectures to improve the training and sampling efficiency of diffusion models, addressing their computational challenges while maintaining high performance.
Contribution
The paper proposes a novel multi-stage approach with custom multi-decoder U-net architectures and a timestep clustering algorithm to enhance diffusion models' efficiency.
Findings
Significant improvements in training speed across three state-of-the-art diffusion models.
Enhanced sampling efficiency without sacrificing generative quality.
Effective resource distribution and reduced inter-stage interference.
Abstract
Diffusion models, emerging as powerful deep generative tools, excel in various applications. They operate through a two-steps process: introducing noise into training samples and then employing a model to convert random noise into new samples (e.g., images). However, their remarkable generative performance is hindered by slow training and sampling. This is due to the necessity of tracking extensive forward and reverse diffusion trajectories, and employing a large model with numerous parameters across multiple timesteps (i.e., noise levels). To tackle these challenges, we present a multi-stage framework inspired by our empirical findings. These observations indicate the advantages of employing distinct parameters tailored to each timestep while retaining universal parameters shared across all time steps. Our approach involves segmenting the time interval into multiple stages where we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Advanced Mathematical Modeling in Engineering · Generative Adversarial Networks and Image Synthesis
MethodsConvolution · Concatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · U-Net · Diffusion
