LSReGen: Large-Scale Regional Generator via Backward Guidance Framework

Bowen Zhang; Cheng Yang; Xuanhui Liu

arXiv:2407.15066·cs.CV·July 23, 2024

LSReGen: Large-Scale Regional Generator via Backward Guidance Framework

Bowen Zhang, Cheng Yang, Xuanhui Liu

PDF

Open Access

TL;DR

This paper introduces LSReGen, a novel large-scale layout-to-image generation framework that improves controllability and quality by generalizing backward guidance, outperforming existing methods in large-scale tasks.

Contribution

The paper presents a new backward guidance framework and LSReGen, enabling high-quality, layout-controlled image generation at large scale with better performance than prior approaches.

Findings

01

LSReGen achieves superior quality in large-scale layout-to-image tasks.

02

The framework generalizes backward guidance without model-specific assumptions.

03

Experimental results outperform existing methods.

Abstract

In recent years, advancements in AIGC (Artificial Intelligence Generated Content) technology have significantly enhanced the capabilities of large text-to-image models. Despite these improvements, controllable image generation remains a challenge. Current methods, such as training, forward guidance, and backward guidance, have notable limitations. The first two approaches either demand substantial computational resources or produce subpar results. The third approach depends on phenomena specific to certain model architectures, complicating its application to large-scale image generation.To address these issues, we propose a novel controllable generation framework that offers a generalized interpretation of backward guidance without relying on specific assumptions. Leveraging this framework, we introduce LSReGen, a large-scale layout-to-image method designed to generate high-quality,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment · Distributed and Parallel Computing Systems · Speech and dialogue systems