Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation
Taehoon Kim, Henry Gouk, Timothy Hospedales

TL;DR
This paper introduces Null-TTA, a novel test-time alignment method for text-to-image diffusion models that optimizes unconditional embeddings in semantic space, achieving state-of-the-art alignment without reward hacking.
Contribution
Null-TTA is the first approach to align diffusion models by optimizing in semantic space, ensuring coherent alignment and preventing reward hacking during test time.
Findings
Null-TTA achieves state-of-the-art test-time alignment performance.
Null-TTA maintains strong generalization across different rewards.
Null-TTA prevents reward hacking by optimizing in semantic space.
Abstract
Test-time alignment (TTA) aims to adapt models to specific rewards during inference. However, existing methods tend to either under-optimise or over-optimise (reward hack) the target reward function. We propose Null-Text Test-Time Alignment (Null-TTA), which aligns diffusion models by optimising the unconditional embedding in classifier-free guidance, rather than manipulating latent or noise variables. Due to the structured semantic nature of the text embedding space, this ensures alignment occurs on a semantically coherent manifold and prevents reward hacking (exploiting non-semantic noise patterns to improve the reward). Since the unconditional embedding in classifier-free guidance serves as the anchor for the model's generative distribution, Null-TTA directly steers model's generative distribution towards the target reward rather than just adjusting the samples, even without updating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Topic Modeling
