Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, and Jian Yuan

TL;DR
This paper introduces Isolated Diffusion, a training-free method that improves multi-concept text-to-image generation by isolating and resynthesizing individual concepts to reduce concept bleeding and enhance text-image consistency.
Contribution
It presents a novel, training-free approach that isolates concepts during synthesis using pre-trained detection models, addressing mutual interference in multi-concept image generation.
Findings
Significantly reduces concept bleeding in multi-concept generation.
Improves text-image alignment and scene coherence.
Compatible with recent diffusion models like SDXL and SD.
Abstract
Large-scale text-to-image diffusion models have achieved great success in synthesizing high-quality and diverse images given target text prompts. Despite the revolutionary image generation ability, current state-of-the-art models still struggle to deal with multi-concept generation accurately in many cases. This phenomenon is known as ``concept bleeding" and displays as the unexpected overlapping or merging of various concepts. This paper presents a general approach for text-to-image diffusion models to address the mutual interference between different subjects and their attachments in complex scenes, pursuing better text-image consistency. The core idea is to isolate the synthesizing processes of different concepts. We propose to bind each attachment to corresponding subjects separately with split text prompts. Besides, we introduce a revision method to fix the concept bleeding problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Educational Assessment and Pedagogy · Topic Modeling
MethodsDiffusion
