Assembling Semantically-Disentangled Representations for Predictive-Generative Models via Adaptation from Synthetic Domain
Burkay Donderici, Caleb New, Chenliang Xu

TL;DR
This paper introduces SYNTH-VAE-GAN, a method that leverages synthetic data and physics-based engines to learn semantically-aligned representations for predictive-generative models, reducing reliance on costly real data labels.
Contribution
The paper presents a novel approach to generate semantically-disentangled representations using synthetic data and domain adaptation, avoiding the need for large annotated real datasets.
Findings
Successfully constructs a conditional predictive-generative model of human face attributes.
Demonstrates effective semantic alignment between synthetic and real domains.
Reduces dependency on real data labels for training complex models.
Abstract
Deep neural networks can form high-level hierarchical representations of input data. Various researchers have demonstrated that these representations can be used to enable a variety of useful applications. However, such representations are typically based on the statistics within the data, and may not conform with the semantic representation that may be necessitated by the application. Conditional models are typically used to overcome this challenge, but they require large annotated datasets which are difficult to come by and costly to create. In this paper, we show that semantically-aligned representations can be generated instead with the help of a physics based engine. This is accomplished by creating a synthetic dataset with decoupled attributes, learning an encoder for the synthetic dataset, and augmenting prescribed attributes from the synthetic domain with attributes from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Human Pose and Action Recognition
