Be Decisive: Noise-Induced Layouts for Multi-Subject Generation
Omer Dahary, Yehonathan Cohen, Or Patashnik, Kfir Aberman, Daniel Cohen-Or

TL;DR
This paper introduces a noise-aligned layout prediction method for multi-subject text-to-image generation, improving alignment and stability without external layout conflicts by refining a predicted spatial layout during denoising.
Contribution
It proposes a novel noise-induced layout prediction approach that aligns with the prompt and refines during denoising, avoiding conflicts with external layouts and enhancing multi-subject generation.
Findings
Improved text-image alignment over existing methods
More stable multi-subject generation results
Preserves diversity of the model's original distribution
Abstract
Generating multiple distinct subjects remains a challenge for existing text-to-image diffusion models. Complex prompts often lead to subject leakage, causing inaccuracies in quantities, attributes, and visual features. Preventing leakage among subjects necessitates knowledge of each subject's spatial location. Recent methods provide these spatial locations via an external layout control. However, enforcing such a prescribed layout often conflicts with the innate layout dictated by the sampled initial noise, leading to misalignment with the model's prior. In this work, we introduce a new approach that predicts a spatial layout aligned with the prompt, derived from the initial noise, and refines it throughout the denoising process. By relying on this noise-induced layout, we avoid conflicts with externally imposed layouts and better preserve the model's prior. Our method employs a small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Evolutionary Algorithms and Applications
MethodsDiffusion
