$Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models
Haosen Li, Wenshuo Chen, Shaofeng Liang, Lei Wang, Kaishen Yuan, Yutao Yue

TL;DR
This paper introduces $Z^2$-Sampling, a zero-cost zigzag trajectory method for diffusion models that improves semantic alignment efficiency by algebraically eliminating off-manifold errors, outperforming existing approaches.
Contribution
It proposes Implicit Z-Sampling and $Z^2$-Sampling, reducing computational costs while enhancing semantic exploration in diffusion models through algebraic and theoretical innovations.
Findings
$Z^2$-Sampling restores standard 2-NFE efficiency without losing semantic quality.
It universally applies across architectures like U-Nets and DiTs and modalities such as images and videos.
The method outperforms existing techniques on the performance-efficiency Pareto frontier.
Abstract
Diffusion models have achieved unprecedented success in text-aligned generation, largely driven by Classifier-Free Guidance (CFG). However, standard CFG operates strictly on instantaneous gradients, omitting the intrinsic curvature of the data manifold. Recent methods like Zigzag-sampling (Z-Sampling) explicitly traverse multi-step forward-backward trajectories to probe this curvature, significantly improving semantic alignment. Yet, these explicit traversals triple the Neural Function Evaluation (NFE) cost and introduce unconstrained truncation errors from off-manifold evaluations, causing cumulative drift from the true marginal distribution. In this paper, we theoretically demonstrate that the explicit zigzag sequence is topologically reducible. We propose Implicit Z-Sampling, rigorously proving that intermediate states can be algebraically annihilated via operator dualities,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
