Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models
Ren\'e Haas, Inbar Huberman-Spiegelglas, Rotem Mulayoff, Stella, Gra{\ss}hof, Sami S. Brandt, Tomer Michaeli

TL;DR
This paper investigates the semantic latent space of denoising diffusion models, proposing novel methods to discover interpretable directions for image editing without modifying the model or using external guidance.
Contribution
It introduces unsupervised and supervised techniques for identifying meaningful semantic directions in the diffusion model's latent space, enhancing interpretability and editing capabilities.
Findings
Principal components reveal global semantic directions.
Spectral analysis uncovers image-specific semantic directions.
Supervised methods enable attribute-based semantic editing.
Abstract
Denoising Diffusion Models (DDMs) have emerged as a strong competitor to Generative Adversarial Networks (GANs). However, despite their widespread use in image synthesis and editing applications, their latent space is still not as well understood. Recently, a semantic latent space for DDMs, coined `-space', was shown to facilitate semantic image editing in a way reminiscent of GANs. The -space is comprised of the bottleneck activations in the DDM's denoiser across all timesteps of the diffusion process. In this paper, we explore the properties of h-space and propose several novel methods for finding meaningful semantic directions within it. We start by studying unsupervised methods for revealing interpretable semantic directions in pretrained DDMs. Specifically, we show that global latent directions emerge as the principal components in the latent space. Additionally, we provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging
MethodsDiffusion
