Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model Training
Myunsoo Kim, Donghyeon Ki, Seong-Woong Shim, Byung-Jun Lee

TL;DR
This paper introduces an adaptive non-uniform timestep sampling method for diffusion models that accelerates training and enhances performance by focusing on high-variance, critical timesteps during the learning process.
Contribution
The paper presents a novel adaptive sampling technique that dynamically prioritizes important timesteps, improving training efficiency and robustness across multiple datasets and architectures.
Findings
Accelerates diffusion model training by focusing on critical timesteps.
Improves model performance at convergence.
Demonstrates robustness across datasets and architectures.
Abstract
As a highly expressive generative model, diffusion models have demonstrated exceptional success across various domains, including image generation, natural language processing, and combinatorial optimization. However, as data distributions grow more complex, training these models to convergence becomes increasingly computationally intensive. While diffusion models are typically trained using uniform timestep sampling, our research shows that the variance in stochastic gradients varies significantly across timesteps, with high-variance timesteps becoming bottlenecks that hinder faster convergence. To address this issue, we introduce a non-uniform timestep sampling method that prioritizes these more critical timesteps. Our method tracks the impact of gradient updates on the objective for each timestep, adaptively selecting those most likely to minimize the objective effectively.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsDiffusion
