LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Shariq Farooq Bhat, Niloy J. Mitra, Peter Wonka

TL;DR
LooseControl extends ControlNet to enable generalized depth conditioning with scene boundary and 3D box controls, facilitating flexible scene creation and editing in diffusion-based image generation without requiring detailed depth maps.
Contribution
It introduces LooseControl, a novel method allowing generalized depth conditioning with scene boundary and 3D box controls, enhancing flexibility in scene creation and editing.
Findings
Demonstrates ability to create complex environments with minimal input.
Shows effective editing through 3D box modifications and attribute adjustments.
Outperforms baselines in generality and quality of generated scenes.
Abstract
We present LooseControl to allow generalized depth conditioning for diffusion-based image generation. ControlNet, the SOTA for depth-conditioned image generation, produces remarkable results but relies on having access to detailed depth maps for guidance. Creating such exact depth maps, in many scenarios, is challenging. This paper introduces a generalized version of depth conditioning that enables many new content-creation workflows. Specifically, we allow (C1) scene boundary control for loosely specifying scenes with only boundary conditions, and (C2) 3D box control for specifying layout locations of the target objects rather than the exact shape and appearance of the objects. Using LooseControl, along with text guidance, users can create complex environments (e.g., rooms, street views, etc.) by specifying only scene boundaries and locations of primary objects. Further, we provide two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
