DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu,, Jiawei Li, Zhenguo Li

TL;DR
DiffFit introduces a simple, parameter-efficient method for adapting large diffusion models to new domains, significantly reducing training time and storage while maintaining or improving performance.
Contribution
The paper proposes DiffFit, a novel fine-tuning approach that only updates bias terms and scaling factors, enabling fast, resource-efficient adaptation of large diffusion models.
Findings
DiffFit achieves 2× training speed-up compared to full fine-tuning.
It requires only 0.12% of the total model parameters to be stored.
DiffFit sets a new state-of-the-art FID of 3.02 on ImageNet 512×512.
Abstract
Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper proposes DiffFit, a parameter-efficient strategy to fine-tune large pre-trained diffusion models that enable fast adaptation to new domains. DiffFit is embarrassingly simple that only fine-tunes the bias term and newly-added scaling factors in specific layers, yet resulting in significant training speed-up and reduced model storage costs. Compared with full fine-tuning, DiffFit achieves 2 training speed-up and only needs to store approximately 0.12\% of the total model parameters. Intuitive theoretical analysis has been provided to justify the efficacy of scaling factors on fast adaptation. On 8 downstream datasets, DiffFit achieves superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · AI in cancer detection
MethodsDiffusion
