TL;DR
HyperAlign introduces a hypernetwork-based framework that enables efficient, test-time alignment of diffusion models, improving semantic consistency and visual quality without high computational costs.
Contribution
The paper presents HyperAlign, a novel hypernetwork approach for test-time diffusion model alignment that balances quality and efficiency, addressing limitations of existing methods.
Findings
HyperAlign outperforms existing methods in semantic consistency.
HyperAlign improves visual quality across multiple generative models.
The framework balances alignment effectiveness with computational efficiency.
Abstract
Diffusion model alignment aims to bridge the gap between generated outputs and human preferences by enhancing both semantic consistency with textual prompts and overall visual quality. Existing alignment methods face a challenging trade-off: test-time approaches enable input-specific adaptability but introduce significant computational overhead and tend to under-optimize, while fine-tuning approaches risk reward over-optimization and loss of generation diversity. To bridge this gap, we propose HyperAlign, a framework that trains a hypernetwork for efficient and effective test-time alignment. Instead of modifying latent states directly, HyperAlign dynamically generates input-and-state-conditioned low-rank adaptation weights to modulate the denoising trajectory toward target rewards. We introduce multiple HyperAlign variants of varying granularity to balance alignment quality and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
