Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model Training

Myunsoo Kim; Donghyeon Ki; Seong-Woong Shim; Byung-Jun Lee

arXiv:2411.09998·cs.LG·October 27, 2025

Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model Training

Myunsoo Kim, Donghyeon Ki, Seong-Woong Shim, Byung-Jun Lee

PDF

Open Access

TL;DR

This paper introduces an adaptive non-uniform timestep sampling method for diffusion models that accelerates training and enhances performance by focusing on high-variance, critical timesteps during the learning process.

Contribution

The paper presents a novel adaptive sampling technique that dynamically prioritizes important timesteps, improving training efficiency and robustness across multiple datasets and architectures.

Findings

01

Accelerates diffusion model training by focusing on critical timesteps.

02

Improves model performance at convergence.

03

Demonstrates robustness across datasets and architectures.

Abstract

As a highly expressive generative model, diffusion models have demonstrated exceptional success across various domains, including image generation, natural language processing, and combinatorial optimization. However, as data distributions grow more complex, training these models to convergence becomes increasingly computationally intensive. While diffusion models are typically trained using uniform timestep sampling, our research shows that the variance in stochastic gradients varies significantly across timesteps, with high-variance timesteps becoming bottlenecks that hinder faster convergence. To address this issue, we introduce a non-uniform timestep sampling method that prioritizes these more critical timesteps. Our method tracks the impact of gradient updates on the objective for each timestep, adaptively selecting those most likely to minimize the objective effectively.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsDiffusion