The Universal Normal Embedding
Chen Tasker, Roy Betser, Eyal Gofer, Meir Yossef Levi, Guy Gilboa

TL;DR
This paper proposes the Universal Normal Embedding hypothesis, suggesting a shared Gaussian latent space underlying generative models and vision encoders, supported by empirical evidence from a new dataset and enabling controllable image edits.
Contribution
It introduces NoiseZoo, a dataset linking diffusion noise and encoder embeddings, and demonstrates that both can be modeled as noisy linear projections of a shared Gaussian latent space.
Findings
Strong attribute prediction from linear probes in both spaces.
Controllable image edits via linear directions without architectural changes.
Empirical support for the shared Gaussian latent space hypothesis.
Abstract
Generative models and vision encoders have largely advanced on separate tracks, optimized for different goals and grounded in different mathematical principles. Yet, they share a fundamental property: latent space Gaussianity. Generative models map Gaussian noise to images, while encoders map images to semantic embeddings whose coordinates empirically behave as Gaussian. We hypothesize that both are views of a shared latent source, the Universal Normal Embedding (UNE): an approximately Gaussian latent space from which encoder embeddings and DDIM-inverted noise arise as noisy linear projections. To test our hypothesis, we introduce NoiseZoo, a dataset of per-image latents comprising DDIM-inverted diffusion noise and matching encoder representations (CLIP, DINO). On CelebA, linear probes in both spaces yield strong, aligned attribute predictions, indicating that generative noise encodes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face Recognition and Perception · Language and cultural evolution
