RELATE: Physically Plausible Multi-Object Scene Synthesis Using   Structured Latent Spaces

Sebastien Ehrhardt; Oliver Groth; Aron Monszpart; Martin; Engelcke; Ingmar Posner; Niloy Mitra; Andrea Vedaldi

arXiv:2007.01272·cs.CV·November 10, 2020·22 cites

RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

Sebastien Ehrhardt, Oliver Groth, Aron Monszpart, Martin, Engelcke, Ingmar Posner, Niloy Mitra, Andrea Vedaldi

PDF

Open Access 1 Repo 1 Video

TL;DR

RELATE is a novel generative model that produces physically plausible multi-object scenes and videos by explicitly modeling object correlations, enabling realistic scene editing and outperforming prior methods on synthetic and real-world data.

Contribution

It introduces a physically interpretable, object-centric generative model that captures object correlations for improved scene and video synthesis.

Findings

01

Outperforms prior art in object-centric scene generation

02

Enables realistic scene editing

03

Extends naturally to dynamic scenes and videos

Abstract

We present RELATE, a model that learns to generate physically plausible scenes and videos of multiple interacting objects. Similar to other generative approaches, RELATE is trained end-to-end on raw, unlabeled data. RELATE combines an object-centric GAN formulation with a model that explicitly accounts for correlations between individual objects. This allows the model to generate realistic scenes and videos from a physically-interpretable parameterization. Furthermore, we show that modeling the object correlation is necessary to learn to disentangle object positions and identity. We find that RELATE is also amenable to physically realistic scene editing and that it significantly outperforms prior art in object-centric scene generation in both synthetic (CLEVR, ShapeStacks) and real-world data (cars). In addition, in contrast to state-of-the-art methods in object-centric generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hyenal/relate
pytorchOfficial

Videos

RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Human Pose and Action Recognition