Addressing Negative Transfer in Diffusion Models

Hyojun Go; JinYoung Kim; Yunsung Lee; Seunghyun Lee; Shinhyeok Oh,; Hyeongdon Moon; Seungtaek Choi

arXiv:2306.00354·cs.CV·January 2, 2024·2 cites

Addressing Negative Transfer in Diffusion Models

Hyojun Go, JinYoung Kim, Yunsung Lee, Seunghyun Lee, Shinhyeok Oh,, Hyeongdon Moon, Seungtaek Choi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper analyzes negative transfer in diffusion models from an MTL perspective, proposing task clustering to mitigate conflicts, leading to improved quality and faster training.

Contribution

It introduces a clustering-based approach to address negative transfer in diffusion models, enabling efficient application of MTL methods.

Findings

01

Enhanced generation quality in diffusion models.

02

Faster training convergence achieved.

03

Effective mitigation of negative transfer.

Abstract

Diffusion-based generative models have achieved remarkable success in various domains. It trains a shared model on denoising tasks that encompass different noise levels simultaneously, representing a form of multi-task learning (MTL). However, analyzing and improving diffusion models from an MTL perspective remains under-explored. In particular, MTL can sometimes lead to the well-known phenomenon of negative transfer, which results in the performance degradation of certain tasks due to conflicts between tasks. In this paper, we first aim to analyze diffusion training from an MTL standpoint, presenting two key observations: (O1) the task affinity between denoising tasks diminishes as the gap between noise levels widens, and (O2) negative transfer can arise even in diffusion training. Building upon these observations, we aim to enhance diffusion training by mitigating negative transfer.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gohyojun15/ANT_diffusion
pytorchOfficial

Videos

Addressing Negative Transfer in Diffusion Models· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Advanced Neuroimaging Techniques and Applications

MethodsDiffusion