Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Luca Eyring; Shyamgopal Karthik; Alexey Dosovitskiy; Nataniel Ruiz; Zeynep Akata

arXiv:2508.09968·cs.LG·August 14, 2025

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Luca Eyring, Shyamgopal Karthik, Alexey Dosovitskiy, Nataniel Ruiz, Zeynep Akata

PDF

1 Models 1 Video

TL;DR

This paper introduces Noise Hypernetworks for diffusion models, enabling test-time scaling benefits without the high computational cost by modulating initial noise through a learned hypernetwork.

Contribution

It proposes a novel Noise Hypernetwork framework that replaces test-time noise optimization, reducing inference overhead while maintaining model fidelity.

Findings

01

Achieves significant quality improvements with less computation.

02

Recovers a large portion of test-time optimization benefits.

03

Operates efficiently during inference without additional optimization steps.

Abstract

The new paradigm of test-time scaling has yielded remarkable breakthroughs in Large Language Models (LLMs) (e.g. reasoning models) and in generative vision models, allowing models to allocate additional computation during inference to effectively tackle increasingly complex problems. Despite the improvements of this approach, an important limitation emerges: the substantial increase in computation time makes the process slow and impractical for many applications. Given the success of this paradigm and its growing usage, we seek to preserve its benefits while eschewing the inference overhead. In this work we propose one solution to the critical problem of integrating test-time scaling knowledge into a model during post-training. Specifically, we replace reward guided test-time noise optimization in diffusion models with a Noise Hypernetwork that modulates initial input noise. We propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
lucaeyring/HyperNoise_Sana_Sprint_0.6B
model· ♡ 10
♡ 10

Videos

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models· slideslive