DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion Sampling
Tong Zhao, Mingkun Lei, Liangyu Yuan, Yanming Yang, Chenxi Song, Yang Wang, Beier Zhu, Chi Zhang

TL;DR
DyWeight is a novel learning-based multi-step solver for diffusion models that adaptively combines historical gradients, enabling faster sampling with fewer evaluations while maintaining high visual quality.
Contribution
Introduces DyWeight, a lightweight, implicit multi-step solver that learns adaptive parameters to improve diffusion sampling efficiency and accuracy.
Findings
Achieves superior visual fidelity with fewer function evaluations.
Demonstrates state-of-the-art efficiency across multiple datasets.
Maintains stability and quality with large integration steps.
Abstract
Diffusion Models (DMs) have achieved state-of-the-art generative performance across multiple modalities, yet their sampling process remains prohibitively slow due to the need for hundreds of function evaluations. Recent progress in multi-step ODE solvers has greatly improved efficiency by reusing historical gradients, but existing methods rely on handcrafted coefficients that fail to adapt to the non-stationary dynamics of diffusion sampling. To address this limitation, we propose Dynamic Gradient Weighting (DyWeight), a lightweight, learning-based multi-step solver that introduces a streamlined implicit coupling paradigm. By relaxing classical numerical constraints, DyWeight learns unconstrained time-varying parameters that adaptively aggregate historical gradients while intrinsically scaling the effective step size. This implicit time calibration accurately aligns the solver's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques
