TL;DR
This paper introduces Noise Hypernetworks for diffusion models, enabling test-time scaling benefits without the high computational cost by modulating initial noise through a learned hypernetwork.
Contribution
It proposes a novel Noise Hypernetwork framework that replaces test-time noise optimization, reducing inference overhead while maintaining model fidelity.
Findings
Achieves significant quality improvements with less computation.
Recovers a large portion of test-time optimization benefits.
Operates efficiently during inference without additional optimization steps.
Abstract
The new paradigm of test-time scaling has yielded remarkable breakthroughs in Large Language Models (LLMs) (e.g. reasoning models) and in generative vision models, allowing models to allocate additional computation during inference to effectively tackle increasingly complex problems. Despite the improvements of this approach, an important limitation emerges: the substantial increase in computation time makes the process slow and impractical for many applications. Given the success of this paradigm and its growing usage, we seek to preserve its benefits while eschewing the inference overhead. In this work we propose one solution to the critical problem of integrating test-time scaling knowledge into a model during post-training. Specifically, we replace reward guided test-time noise optimization in diffusion models with a Noise Hypernetwork that modulates initial input noise. We propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
