Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Benjamin Biggs, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar,, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, and Stefano Soatto

TL;DR
Diffusion Soup introduces a weight averaging method for text-to-image diffusion models that enables efficient model merging, continual learning, unlearning, and style blending without additional training or inference costs.
Contribution
The paper proposes Diffusion Soup, a novel weight averaging approach for diffusion models that allows for training-free model merging, unlearning, and style mixing, backed by theoretical insights and empirical validation.
Findings
Outperforms models trained on combined data in image reward and style scores.
Enables robust unlearning with minimal performance loss.
Supports zero-shot style blending of models trained on different data shards.
Abstract
We present Diffusion Soup, a compartmentalization method for Text-to-Image Generation that averages the weights of diffusion models trained on sharded data. By construction, our approach enables training-free continual learning and unlearning with no additional memory or inference costs, since models corresponding to data shards can be added or removed by re-averaging. We show that Diffusion Soup samples from a point in weight space that approximates the geometric mean of the distributions of constituent datasets, which offers anti-memorization guarantees and enables zero-shot style mixing. Empirically, Diffusion Soup outperforms a paragon model trained on the union of all data shards and achieves a 30% improvement in Image Reward (.34 .44) on domain sharded data, and a 59% improvement in IR (.37 .59) on aesthetic data. In both cases, souping also prevails in TIFA score…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing
MethodsDiffusion
