Assembling Semantically-Disentangled Representations for   Predictive-Generative Models via Adaptation from Synthetic Domain

Burkay Donderici; Caleb New; Chenliang Xu

arXiv:2002.09818·cs.CV·February 25, 2020·1 cites

Assembling Semantically-Disentangled Representations for Predictive-Generative Models via Adaptation from Synthetic Domain

Burkay Donderici, Caleb New, Chenliang Xu

PDF

Open Access

TL;DR

This paper introduces SYNTH-VAE-GAN, a method that leverages synthetic data and physics-based engines to learn semantically-aligned representations for predictive-generative models, reducing reliance on costly real data labels.

Contribution

The paper presents a novel approach to generate semantically-disentangled representations using synthetic data and domain adaptation, avoiding the need for large annotated real datasets.

Findings

01

Successfully constructs a conditional predictive-generative model of human face attributes.

02

Demonstrates effective semantic alignment between synthetic and real domains.

03

Reduces dependency on real data labels for training complex models.

Abstract

Deep neural networks can form high-level hierarchical representations of input data. Various researchers have demonstrated that these representations can be used to enable a variety of useful applications. However, such representations are typically based on the statistics within the data, and may not conform with the semantic representation that may be necessitated by the application. Conditional models are typically used to overcome this challenge, but they require large annotated datasets which are difficult to come by and costly to create. In this paper, we show that semantically-aligned representations can be generated instead with the help of a physics based engine. This is accomplished by creating a synthetic dataset with decoupled attributes, learning an encoder for the synthetic dataset, and augmenting prescribed attributes from the synthetic domain with attributes from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Human Pose and Action Recognition