Unsupervised Discovery of Interpretable Directions in h-space of   Pre-trained Diffusion Models

Zijian Zhang; Luping Liu; Zhijie Lin; Yichen Zhu; Zhou Zhao

arXiv:2310.09912·cs.CV·December 1, 2023·1 cites

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models

Zijian Zhang, Luping Liu, Zhijie Lin, Yichen Zhu, Zhou Zhao

PDF

Open Access

TL;DR

This paper introduces an unsupervised, learning-based approach to discover interpretable directions in the latent space of pre-trained diffusion models, enabling meaningful manipulations without supervision.

Contribution

It presents a novel VRAM-efficient training algorithm and a method to identify disentangled, interpretable directions in diffusion models' h-space without additional procedures.

Findings

01

Successfully discovers global, scalable directions in diffusion models

02

Maintains sample fidelity during manipulations

03

Effective across various datasets

Abstract

We propose the first unsupervised and learning-based method to identify interpretable directions in h-space of pre-trained diffusion models. Our method is derived from an existing technique that operates on the GAN latent space. Specifically, we employ a shift control module that works on h-space of pre-trained diffusion models to manipulate a sample into a shifted version of itself, followed by a reconstructor to reproduce both the type and the strength of the manipulation. By jointly optimizing them, the model will spontaneously discover disentangled and interpretable directions. To prevent the discovery of meaningless and destructive directions, we employ a discriminator to maintain the fidelity of shifted sample. Due to the iterative generative process of diffusion models, our training requires a substantial amount of GPU VRAM to store numerous intermediate tensors for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks

MethodsGradient Checkpointing · Diffusion