Diffusion Model Patching via Mixture-of-Prompts

Seokil Ham; Sangmin Woo; Jin-Young Kim; Hyojun Go; Byeongjun Park,; Changick Kim

arXiv:2405.17825·cs.CV·December 12, 2024

Diffusion Model Patching via Mixture-of-Prompts

Seokil Ham, Sangmin Woo, Jin-Young Kim, Hyojun Go, Byeongjun Park,, Changick Kim

PDF

Open Access 1 Video

TL;DR

Diffusion Model Patching (DMP) introduces a small, learnable prompt set with a gating mechanism to improve pre-trained diffusion models' performance without retraining from scratch.

Contribution

DMP is a novel method that dynamically combines prompts to enhance frozen diffusion models, requiring minimal additional parameters and training.

Findings

01

DMP improves FID of converged models by over 10%.

02

Only 1.43% parameter increase needed for significant performance gains.

03

Effective even with additional training on original datasets.

Abstract

We present Diffusion Model Patching (DMP), a simple method to boost the performance of pre-trained diffusion models that have already reached convergence, with a negligible increase in parameters. DMP inserts a small, learnable set of prompts into the model's input space while keeping the original model frozen. The effectiveness of DMP is not merely due to the addition of parameters but stems from its dynamic gating mechanism, which selects and combines a subset of learnable prompts at every timestep (i.e., reverse denoising steps). This strategy, which we term "mixture-of-prompts", enables the model to draw on the distinct expertise of each prompt, essentially "patching" the model's functionality at every timestep with minimal yet specialized parameters. Uniquely, DMP enhances the model by further training on the original dataset already used for pre-training, even in a scenario where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Diffusion Model Patching via Mixture-of-Prompts· underline

Taxonomy

TopicsModel Reduction and Neural Networks

MethodsSparse Evolutionary Training · Activation Patching · Diffusion