# Auto-Encoding Progressive Generative Adversarial Networks For 3D Multi   Object Scenes

**Authors:** Vedant Singh, Manan Oza, Himanshu Vaghela, Pratik Kanani

arXiv: 1903.03477 · 2019-03-11

## TL;DR

This paper introduces a novel 3D auto-encoding GAN architecture trained progressively to generate high-resolution, multi-object 3D scenes with improved object count and realism, addressing limitations of traditional models.

## Contribution

The proposed model combines adversarial auto-encoders with progressive training and WGAN-GP loss to enhance multi-object 3D scene generation capabilities.

## Key findings

- Generated realistic high-resolution 3D scenes of various room types
- Produced scenes with more objects per scene compared to previous models
- Achieved improved realism and diversity in multi-object scene synthesis

## Abstract

3D multi object generative models allow us to synthesize a large range of novel 3D multi object scenes and also identify objects, shapes, layouts and their positions. But multi object scenes are difficult to create because of the dataset being multimodal in nature. The conventional 3D generative adversarial models are not efficient in generating multi object scenes, they usually tend to generate either one object or generate fuzzy results of multiple objects. Auto-encoder models have much scope in feature extraction and representation learning using the unsupervised paradigm in probabilistic spaces. We try to make use of this property in our proposed model. In this paper we propose a novel architecture using 3DConvNets trained with the progressive training paradigm that has been able to generate realistic high resolution 3D scenes of rooms, bedrooms, offices etc. with various pieces of furniture and objects. We make use of the adversarial auto-encoder along with the WGAN-GP loss parameter in our discriminator loss function. Finally this new approach to multi object scene generation has also been able to generate more number of objects per scene.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.03477/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1903.03477/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/1903.03477/full.md

---
Source: https://tomesphere.com/paper/1903.03477