Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning

Qianli Ma; Xuefei Ning; Dongrui Liu; Li Niu; Linfeng Zhang

arXiv:2410.06664·cs.CV·March 17, 2025

Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning

Qianli Ma, Xuefei Ning, Dongrui Liu, Li Niu, Linfeng Zhang

PDF

Open Access

TL;DR

This paper introduces a Decouple-then-Merge framework for finetuning diffusion models, where separate models are trained for different timesteps and then merged, leading to improved image generation quality.

Contribution

The work proposes a novel finetuning approach that decouples timestep-specific models and merges them, enhancing diffusion model performance without sacrificing inference efficiency.

Findings

01

Significant quality improvements on 6 benchmarks including COCO30K, ImageNet1K, and LSUN datasets.

02

Effective techniques for knowledge sharing during finetuning reduce training interference.

03

Merged models maintain efficiency while achieving higher generation fidelity.

Abstract

Diffusion models are trained by learning a sequence of models that reverse each step of noise corruption. Typically, the model parameters are fully shared across multiple timesteps to enhance training efficiency. However, since the denoising tasks differ at each timestep, the gradients computed at different timesteps may conflict, potentially degrading the overall performance of image generation. To solve this issue, this work proposes a \textbf{De}couple-then-\textbf{Me}rge (\textbf{DeMe}) framework, which begins with a pretrained model and finetunes separate models tailored to specific timesteps. We introduce several improved techniques during the finetuning stage to promote effective knowledge sharing while minimizing training interference across timesteps. Finally, after finetuning, these separate models can be merged into a single model in the parameter space, ensuring efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation

MethodsDiffusion