TL;DR
This paper uncovers collapse errors in deterministic diffusion model samplers, showing they cause data concentration issues, and analyzes their causes and potential mitigation strategies through extensive empirical experiments.
Contribution
It introduces the concept of collapse errors in ODE-based diffusion sampling and provides empirical evidence and analysis of their causes and effects.
Findings
Collapse errors cause data over-concentration in local space.
Score learning in low noise regimes affects high noise regimes adversely.
Existing techniques can mitigate some collapse errors.
Abstract
Despite the widespread adoption of deterministic samplers in diffusion models (DMs), their potential limitations remain largely unexplored. In this paper, we identify collapse errors, a previously unrecognized phenomenon in ODE-based diffusion sampling, where the sampled data is overly concentrated in local data space. To quantify this effect, we introduce a novel metric and demonstrate that collapse errors occur across a variety of settings. When investigating its underlying causes, we observe a see-saw effect, where score learning in low noise regimes adversely impacts the one in high noise regimes. This misfitting in high noise regimes, coupled with the dynamics of deterministic samplers, ultimately causes collapse errors. Guided by these insights, we apply existing techniques from sampling, training, and architecture to empirically support our explanation of collapse errors. This…
Peer Reviews
Decision·ICLR 2026 Conference Withdrawn Submission
- The paper isolates a practical failure mode in widely‑used deterministic samplers, whose popularity stems from speed, controllability, and trajectory “straightness.” Studying their limitations is valuable for real systems and for methods that rely on determinism to learn consistency between long/short trajectories (e.g. consistency and shortcut models). The background clearly motivates why ODE paths are important. - Collapse is shown at the sample level and at the distribution level on synthe
- The paper explicitly does not propose a new sampler or training objective aimed at eliminating collapse, but rather uses existing techniques to validate the cause. - The formal result Proposition 1. supports the see‑saw effect in a specific simple setting; it is not yet a general theorem that deterministic sampling must collapse under realistic assumptions. Theoretical guarantees (or impossibility results) would strengthen the paper. - The ODE is run with 100 steps versus 1000 steps for SDE
The formalization of the collapse error through the TID metric is novel to my knowledge.
1. The collapse error boils down to an underfitting behavior in the training of the neural network score estimator. The authors named it the see-saw effect to emphasize the tradeoff between the low- and high-noise regimes, but all together it reflects the inability of the NN to fully minimize the score matching loss that is aggregated across the different noise levels. Meanwhile, as the authors demonstrate, this issue can be reasonably mitigated by relatively simple techniques such as adding ski
The "collapse" behaviors of the determinstic sampler in diffusion models sampling is indeed important and worth studying. This paper did a good work in investigating the cause of error following Song-SDE 2021 setup of determinstic sampler and score network training, which is a representative baseline to start from. The investigation on the underlying reasons about the score imperfect training is also interesting. The experiments on the Song-SDE 2021 setup is pretty comprehensive.
First, the studied problem is not new and has been widely noticed by the iterature, e.g. in Song SDE. I am especially puzzled about the "collapse" behaviour naming. Usually generation collapse refers to a lack of diversity instead of a large FID/TID score. Here in the paper, the argument is essentially larger FID under determinstic sampler compared to stochastic sampler with the same score network, which has been clearly claimed already by the Song's SDE paper. The paper's contribution lies to i
- **Originality**: A new index that evaluates the collapse error of diffusion models is proposed; causes of collapse error are investigated. - **Quality**: Extensive numerical experiments are conducted to verify the reasoning of the causes of collapse error; the related work is comprehensive. - **Clarity**: The motivation for the new index is clearly articulated, and the analysis of the causes of collapse error is convincingly verified by the experimental results. - **Significance**: Collapse e
- The investigation of the collapse error of diffusion models is only based on empirical experiments, while the theoretical understanding is limited. - The techniques for mitigating collapse error are existing methods, which indicates the contribution in terms of algorithms is limited.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
