Addressing Negative Transfer in Diffusion Models
Hyojun Go, JinYoung Kim, Yunsung Lee, Seunghyun Lee, Shinhyeok Oh,, Hyeongdon Moon, Seungtaek Choi

TL;DR
This paper analyzes negative transfer in diffusion models from an MTL perspective, proposing task clustering to mitigate conflicts, leading to improved quality and faster training.
Contribution
It introduces a clustering-based approach to address negative transfer in diffusion models, enabling efficient application of MTL methods.
Findings
Enhanced generation quality in diffusion models.
Faster training convergence achieved.
Effective mitigation of negative transfer.
Abstract
Diffusion-based generative models have achieved remarkable success in various domains. It trains a shared model on denoising tasks that encompass different noise levels simultaneously, representing a form of multi-task learning (MTL). However, analyzing and improving diffusion models from an MTL perspective remains under-explored. In particular, MTL can sometimes lead to the well-known phenomenon of negative transfer, which results in the performance degradation of certain tasks due to conflicts between tasks. In this paper, we first aim to analyze diffusion training from an MTL standpoint, presenting two key observations: (O1) the task affinity between denoising tasks diminishes as the gap between noise levels widens, and (O2) negative transfer can arise even in diffusion training. Building upon these observations, we aim to enhance diffusion training by mitigating negative transfer.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Advanced Neuroimaging Techniques and Applications
MethodsDiffusion
