Design Booster: A Text-Guided Diffusion Model for Image Translation with   Spatial Layout Preservation

Shiqi Sun; Shancheng Fang; Qian He; Wei Liu

arXiv:2302.02284·cs.CV·February 7, 2023

Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation

Shiqi Sun, Shancheng Fang, Qian He, Wei Liu

PDF

Open Access

TL;DR

Design Booster introduces a flexible, layout-aware diffusion model for image translation that preserves spatial structure, allows multi-condition control, and outperforms existing methods in quality and speed.

Contribution

It proposes a novel training framework that co-encodes images and text for flexible, layout-preserving image translation with efficient inference.

Findings

01

Outperforms state-of-the-art in style and semantic translation

02

Achieves faster inference times

03

Maintains spatial layout effectively

Abstract

Diffusion models are able to generate photorealistic images in arbitrary scenes. However, when applying diffusion models to image translation, there exists a trade-off between maintaining spatial structure and high-quality content. Besides, existing methods are mainly based on test-time optimization or fine-tuning model for each input image, which are extremely time-consuming for practical applications. To address these issues, we propose a new approach for flexible image translation by learning a layout-aware image condition together with a text condition. Specifically, our method co-encodes images and text into a new domain during the training phase. In the inference stage, we can choose images/text or both as the conditions for each time step, which gives users more flexible control over layout and content. Experimental comparisons of our method with state-of-the-art methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Mycobacterium research and diagnosis

MethodsDiffusion