Training-Free Representation Guidance for Diffusion Models with a Representation Alignment Projector
Wenqiang Zu, Shenghao Xie, Bo Lei, Lei Ma

TL;DR
This paper introduces a guidance method for diffusion models that uses a representation alignment projector to improve semantic consistency during image generation, without altering the model architecture.
Contribution
It proposes a novel guidance scheme with a representation alignment projector that enhances semantic alignment in diffusion models during inference.
Findings
Significant reduction in FID scores on ImageNet synthesis tasks.
Outperforms existing guidance methods in semantic coherence and visual fidelity.
Provides complementary improvements when combined with classifier-free guidance.
Abstract
Recent progress in generative modeling has enabled high-quality visual synthesis with diffusion-based frameworks, supporting controllable sampling and large-scale training. Inference-time guidance methods such as classifier-free and representative guidance enhance semantic alignment by modifying sampling dynamics; however, they do not fully exploit unsupervised feature representations. Although such visual representations contain rich semantic structure, their integration during generation is constrained by the absence of ground-truth reference images at inference. This work reveals semantic drift in the early denoising stages of diffusion transformers, where stochasticity results in inconsistent alignment even under identical conditioning. To mitigate this issue, we introduce a guidance scheme using a representation alignment projector that injects representations predicted by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices
