Continuous Layout Editing of Single Images with Diffusion Models

Zhiyuan Zhang; Zhitong Huang; Jing Liao

arXiv:2306.13078·cs.CV·June 23, 2023

Continuous Layout Editing of Single Images with Diffusion Models

Zhiyuan Zhang, Zhitong Huang, Jing Liao

PDF

Open Access

TL;DR

This paper introduces a novel framework for editing the layout of individual images using diffusion models, enabling continuous and layout-specific modifications while maintaining visual fidelity.

Contribution

It presents the first method for layout editing of single images with diffusion models, using masked textual inversion and a training-free layout control optimization.

Findings

01

Outperforms baseline methods in layout editing tasks

02

Preserves visual properties of images during editing

03

Enables continuous and user-guided layout modifications

Abstract

Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our approach is achieved through two key modules. First, to preserve the characteristics of multiple objects within an image, we disentangle the concepts of different objects and embed them into separate textual tokens using a novel method called masked textual inversion. Next, we propose a training-free optimization method to perform layout control for a pre-trained diffusion model, which allows us to regenerate images with learned concepts and align them with user-specified layouts. As the first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging

MethodsNone · ALIGN · Diffusion