Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit, Bansal

TL;DR
This paper introduces LayoutBench, a diagnostic benchmark for evaluating layout-guided image generation models, and proposes IterInpaint, a new method that improves out-of-distribution layout generalization through step-by-step inpainting.
Contribution
The paper presents LayoutBench for assessing spatial control skills and introduces IterInpaint, a novel inpainting-based baseline that enhances OOD layout generalization.
Findings
Existing models struggle with OOD layouts, especially at boundaries.
IterInpaint outperforms SOTA models on LayoutBench and LayoutBench-COCO.
Inpainting-based approach improves generalization to unseen layouts.
Abstract
Spatial control is a core capability in controllable image generation. Advancements in layout-guided image generation have shown promising results on in-distribution (ID) datasets with similar spatial configurations. However, it is unclear how these models perform when facing out-of-distribution (OOD) samples with arbitrary, unseen layouts. In this paper, we propose LayoutBench, a diagnostic benchmark for layout-guided image generation that examines four categories of spatial control skills: number, position, size, and shape. We benchmark two recent representative layout-guided image generation methods and observe that the good ID layout control may not generalize well to arbitrary layouts in the wild (e.g., objects at the boundary). Next, we propose IterInpaint, a new baseline that generates foreground and background regions step-by-step via inpainting, demonstrating stronger…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
MethodsIterative Inpainting
