AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of   Diffusion Probabilistic Models

Jiachun Pan; Jun Hao Liew; Vincent Y. F. Tan; Jiashi Feng; Hanshu Yan

arXiv:2307.10711·cs.CV·March 21, 2024

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jun Hao Liew, Vincent Y. F. Tan, Jiashi Feng, Hanshu Yan

PDF

Open Access 1 Repo

TL;DR

AdjointDPM introduces an efficient adjoint sensitivity method for gradient backpropagation in diffusion probabilistic models, enabling customization with minimal supervision and reducing memory usage during training.

Contribution

The paper presents AdjointDPM, a novel approach that uses the adjoint sensitivity method and probability-flow ODE reparameterization for memory-efficient gradient computation in DPMs.

Findings

01

Effective in converting visual effects into text embeddings

02

Enables fine-tuning for stylization tasks

03

Facilitates initial noise optimization for adversarial sample generation

Abstract

Existing customization methods require access to multiple reference examples to align pre-trained diffusion probabilistic models (DPMs) with user-provided concepts. This paper aims to address the challenge of DPM customization when the only available supervision is a differentiable metric defined on the generated contents. Since the sampling procedure of DPMs involves recursive calls to the denoising UNet, na\"ive gradient backpropagation requires storing the intermediate states of all iterations, resulting in extremely high memory consumption. To overcome this issue, we propose a novel method AdjointDPM, which first generates new samples from diffusion models by solving the corresponding probability-flow ODEs. It then uses the adjoint sensitivity method to backpropagate the gradients of the loss to the models' parameters (including conditioning signals, network weights, and initial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hanshuyan/adjointdpm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Computational and Text Analysis Methods

MethodsDiffusion · ALIGN