$Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models

Haosen Li; Wenshuo Chen; Shaofeng Liang; Lei Wang; Kaishen Yuan; Yutao Yue

arXiv:2604.23536·cs.CV·April 28, 2026

$Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models

Haosen Li, Wenshuo Chen, Shaofeng Liang, Lei Wang, Kaishen Yuan, Yutao Yue

PDF

TL;DR

This paper introduces $Z^2$-Sampling, a zero-cost zigzag trajectory method for diffusion models that improves semantic alignment efficiency by algebraically eliminating off-manifold errors, outperforming existing approaches.

Contribution

It proposes Implicit Z-Sampling and $Z^2$-Sampling, reducing computational costs while enhancing semantic exploration in diffusion models through algebraic and theoretical innovations.

Findings

01

$Z^2$-Sampling restores standard 2-NFE efficiency without losing semantic quality.

02

It universally applies across architectures like U-Nets and DiTs and modalities such as images and videos.

03

The method outperforms existing techniques on the performance-efficiency Pareto frontier.

Abstract

Diffusion models have achieved unprecedented success in text-aligned generation, largely driven by Classifier-Free Guidance (CFG). However, standard CFG operates strictly on instantaneous gradients, omitting the intrinsic curvature of the data manifold. Recent methods like Zigzag-sampling (Z-Sampling) explicitly traverse multi-step forward-backward trajectories to probe this curvature, significantly improving semantic alignment. Yet, these explicit traversals triple the Neural Function Evaluation (NFE) cost and introduce unconstrained truncation errors from off-manifold evaluations, causing cumulative drift from the true marginal distribution. In this paper, we theoretically demonstrate that the explicit zigzag sequence is topologically reducible. We propose Implicit Z-Sampling, rigorously proving that intermediate states can be algebraically annihilated via operator dualities,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.