Text Semantics to Image Generation: A method of building facades design   base on Stable Diffusion model

Haoran Ma

arXiv:2303.12755·cs.CV·April 10, 2023·6 cites

Text Semantics to Image Generation: A method of building facades design base on Stable Diffusion model

Haoran Ma

PDF

Open Access

TL;DR

This paper enhances architectural facade image generation by fine-tuning Stable Diffusion with LoRA and applying ControlNet to improve controllability based on textual architectural descriptions.

Contribution

It introduces a combined multi-network approach using LoRA and ControlNet to improve controllability and efficiency in text-to-facade image generation.

Findings

01

LoRA significantly reduces fine-tuning complexity.

02

ControlNet increases controllability of generated images.

03

The method effectively generates architectural facades based on text descriptions.

Abstract

Stable Diffusion model has been extensively employed in the study of archi-tectural image generation, but there is still an opportunity to enhance in terms of the controllability of the generated image content. A multi-network combined text-to-building facade image generating method is proposed in this work. We first fine-tuned the Stable Diffusion model on the CMP Fa-cades dataset using the LoRA (Low-Rank Adaptation) approach, then we ap-ply the ControlNet model to further control the output. Finally, we contrast-ed the facade generating outcomes under various architectural style text con-tents and control strategies. The results demonstrate that the LoRA training approach significantly decreases the possibility of fine-tuning the Stable Dif-fusion large model, and the addition of the ControlNet model increases the controllability of the creation of text to building facade images. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Aesthetic Perception and Analysis · 3D Surveying and Cultural Heritage

MethodsDiffusion