Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

Yong-Hyun Park; Mingi Kwon; Junghyo Jo; Youngjung Uh

arXiv:2302.12469·cs.CV·February 27, 2023·5 cites

Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

Yong-Hyun Park, Mingi Kwon, Junghyo Jo, Youngjung Uh

PDF

Open Access

TL;DR

This paper introduces an unsupervised approach to identify interpretable semantic directions in the latent space of diffusion models, enhancing understanding and editing capabilities without supervision.

Contribution

It proposes a novel Riemannian geometry-based method to discover semantic directions in diffusion models' latent space, revealing disentangled attributes and their geometric properties.

Findings

01

Semantic directions yield disentangled attribute changes

02

Editing at different timesteps affects different attribute levels

03

Method is effective across various datasets and models

Abstract

Despite the success of diffusion models (DMs), we still lack a thorough understanding of their latent space. While image editing with GANs builds upon latent space, DMs rely on editing the conditions such as text prompts. We present an unsupervised method to discover interpretable editing directions for the latent variables $x_{t} \in X$ of DMs. Our method adopts Riemannian geometry between $X$ and the intermediate feature maps $H$ of the U-Nets to provide a deep understanding over the geometrical structure of $X$ . The discovered semantic latent directions mostly yield disentangled attribute changes, and they are globally consistent across different samples. Furthermore, editing in earlier timesteps edits coarse attributes, while ones in later timesteps focus on high-frequency details. We define the curvedness of a line segment between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications

MethodsDiffusion