Training-free Composite Scene Generation for Layout-to-Image Synthesis

Jiaqi Liu; Tao Huang; Chang Xu

arXiv:2407.13609·cs.CV·July 19, 2024

Training-free Composite Scene Generation for Layout-to-Image Synthesis

Jiaqi Liu, Tao Huang, Chang Xu

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel training-free method for layout-to-image synthesis that improves spatial accuracy and image fidelity by refining diffusion processes with semantic and attention constraints, avoiding extensive dataset requirements.

Contribution

It introduces a training-free approach with innovative constraints to enhance spatial and semantic accuracy in layout-to-image generation without needing annotated datasets.

Findings

01

Improved spatial arrangement accuracy in generated images.

02

Enhanced image fidelity and complexity.

03

Effective use of layout information in diffusion models.

Abstract

Recent breakthroughs in text-to-image diffusion models have significantly advanced the generation of high-fidelity, photo-realistic images from textual descriptions. Yet, these models often struggle with interpreting spatial arrangements from text, hindering their ability to produce images with precise spatial configurations. To bridge this gap, layout-to-image generation has emerged as a promising direction. However, training-based approaches are limited by the need for extensively annotated datasets, leading to high data acquisition costs and a constrained conceptual scope. Conversely, training-free methods face challenges in accurately locating and generating semantically similar objects within complex compositions. This paper introduces a novel training-free approach designed to overcome adversarial semantic intersections during the diffusion conditioning phase. By refining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Papple-F/csg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · 3D Shape Modeling and Analysis

MethodsSoftmax · Attention Is All You Need · Diffusion