Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Jincheng Zhong, Xingzhuo Guo, Jiaxiang Dong, Mingsheng Long

TL;DR
Diff-Tuning is a simple transfer method for diffusion models that leverages the chain of forgetting phenomenon, improving transfer performance and convergence speed in downstream tasks.
Contribution
The paper introduces Diff-Tuning, a novel transfer approach based on the chain of forgetting in diffusion models, with theoretical insights and practical benefits.
Findings
26% improvement over standard fine-tuning
24% faster convergence of ControlNet
Effective transfer across multiple downstream tasks
Abstract
Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation tasks. Current fine-tuning methods focus on parameter-efficient transfer learning but overlook the fundamental transfer characteristics of diffusion models. In this paper, we investigate the transferability of diffusion models and observe a monotonous chain of forgetting trend of transferability along the reverse process. Based on this observation and novel theoretical insights, we present Diff-Tuning, a frustratingly simple transfer approach that leverages the chain of forgetting tendency. Diff-Tuning encourages the fine-tuned model to retain the pre-trained knowledge at the end of the denoising chain close to the generated data while discarding the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Mathematical Modeling in Engineering
MethodsAttention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Softmax · Focus · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Diffusion · Adam
