Ctrl-Z Sampling: Scaling Diffusion Sampling with Controlled Random Zigzag Explorations
Shunqi Mao, Wei Guo, Chaoyi Zhang, Jieting Long, Ke Xie, Weidong Cai

TL;DR
This paper introduces Ctrl-Z Sampling, a scalable method for improving diffusion model sampling by detecting quality plateaus and exploring alternative denoising paths to enhance generated sample quality.
Contribution
We propose Ctrl-Z Sampling, a novel scalable sampling strategy that detects quality plateaus and adaptively explores alternative trajectories to escape local optima in diffusion models.
Findings
Consistently improves generation quality over existing samplers.
Effective across different NFE budgets and diffusion frameworks.
Offers a scalable compute-quality trade-off.
Abstract
Diffusion models generate conditional samples by progressively denoising Gaussian noise, yet the denoising trajectory can stall at visually plausible but low-quality outcomes with conditional misalignment or structural artifacts. We interpret this behavior as local optima in a surrogate quality landscape: Once early denoising commits to a suboptimal global structure, later steps mainly sharpen details and seldom correct the underlying mistake. While existing inference-time approaches explore alternative diffusion states via re-noising with fixed strength or direction, they exhibit limited capacity to escape steep quality plateaus. We propose Controlled Random Zigzag Sampling (Ctrl-Z Sampling),a scalable sampling strategy that detects plateaus in quality landscape via a surrogate score, and allocates exploration only when a plateau is detected. Upon detection, Ctrl-Z Sampling rolls back…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
