GRAINS: Generative Recursive Autoencoders for INdoor Scenes

Manyi Li; Akshay Gadi Patil; Kai Xu; Siddhartha Chaudhuri; Owais Khan,; Ariel Shamir; Changhe Tu; Baoquan Chen; Daniel Cohen-Or; Hao Zhang

arXiv:1807.09193·cs.GR·May 9, 2019·50 cites

GRAINS: Generative Recursive Autoencoders for INdoor Scenes

Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan,, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, Hao Zhang

PDF

Open Access

TL;DR

GRAINS introduces a hierarchical generative model using recursive autoencoders to produce diverse, plausible 3D indoor scenes efficiently, capturing scene structure and object relationships.

Contribution

The paper presents a novel recursive variational autoencoder for hierarchical 3D scene generation, leveraging scene structure to improve diversity and plausibility.

Findings

01

Successfully generates diverse 3D indoor scenes

02

Improves scene modeling from 2D layouts

03

Enhances semantic segmentation performance

Abstract

We present a generative neural network which enables us to generate plausible 3D indoor scenes in large quantities and varieties, easily and highly efficiently. Our key observation is that indoor scene structures are inherently hierarchical. Hence, our network is not convolutional; it is a recursive neural network or RvNN. Using a dataset of annotated scene hierarchies, we train a variational recursive autoencoder, or RvNN-VAE, which performs scene object grouping during its encoding phase and scene generation during decoding. Specifically, a set of encoders are recursively applied to group 3D objects based on support, surround, and co-occurrence relations in a scene, encoding information about object spatial properties, semantics, and their relative positioning with respect to other objects in the hierarchy. By training a variational autoencoder (VAE), the resulting fixed-length codes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage