Composite Diffusion | whole >= \Sigma parts
Vikram Jamwal, Ramaneswaran S

TL;DR
Composite Diffusion enables artists to generate and compose high-quality images with flexible spatial arrangements and multiple control inputs, enhancing creative control without retraining diffusion models.
Contribution
We introduce a modular, plug-and-play method for composite image generation using diffusion models, supporting diverse spatial layouts and control inputs for improved artistic expression.
Findings
Achieves greater spatial and semantic control in image synthesis.
Extensive user surveys validate improved creative flexibility.
Proposes novel quality metrics for composite image evaluation.
Abstract
For an artist or a graphic designer, the spatial layout of a scene is a critical design choice. However, existing text-to-image diffusion models provide limited support for incorporating spatial information. This paper introduces Composite Diffusion as a means for artists to generate high-quality images by composing from the sub-scenes. The artists can specify the arrangement of these sub-scenes through a flexible free-form segment layout. They can describe the content of each sub-scene primarily using natural text and additionally by utilizing reference images or control inputs such as line art, scribbles, human pose, canny edges, and more. We provide a comprehensive and modular method for Composite Diffusion that enables alternative ways of generating, composing, and harmonizing sub-scenes. Further, we wish to evaluate the composite image for effectiveness in both image quality and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Visual Attention and Saliency Detection · Computer Graphics and Visualization Techniques
MethodsBalanced Selection · Diffusion
