End-to-End Optimization of Scene Layout

Andrew Luo; Zhoutong Zhang; Jiajun Wu; Joshua B. Tenenbaum

arXiv:2007.11744·cs.CV·July 24, 2020·1 cites

End-to-End Optimization of Scene Layout

Andrew Luo, Zhoutong Zhang, Jiajun Wu, Joshua B. Tenenbaum

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an end-to-end variational model for scene layout synthesis conditioned on scene graphs, enabling flexible, diverse, and refined scene generation from various inputs.

Contribution

It presents a novel conditional scene layout generator with a differentiable rendering module for layout refinement, improving control, diversity, and accuracy over prior methods.

Findings

01

Higher accuracy in conditional scene synthesis

02

Enhanced diversity of generated layouts

03

Effective refinement using 2D projections

Abstract

We propose an end-to-end variational generative model for scene layout synthesis conditioned on scene graphs. Unlike unconditional scene layout generation, we use scene graphs as an abstract but general representation to guide the synthesis of diverse scene layouts that satisfy relationships included in the scene graph. This gives rise to more flexible control over the synthesis process, allowing various forms of inputs such as scene layouts extracted from sentences or inferred from a single color image. Using our conditional layout synthesizer, we can generate various layouts that share the same structure of the input example. In addition to this conditional generation design, we also integrate a differentiable rendering module that enables layout refinement using only 2D projections of the scene. Given a depth and a semantics map, the differentiable rendering module enables optimizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aluo-x/3D_SLN
pytorchOfficial

Videos

End-to-End Optimization of Scene Layout· youtube

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques