Generating Furry Cars: Disentangling Object Shape & Appearance across   Multiple Domains

Utkarsh Ojha; Krishna Kumar Singh; Yong Jae Lee

arXiv:2104.02052·cs.CV·April 6, 2021

Generating Furry Cars: Disentangling Object Shape & Appearance across Multiple Domains

Utkarsh Ojha, Krishna Kumar Singh, Yong Jae Lee

PDF

Open Access

TL;DR

This paper introduces a method for learning disentangled representations of object shape and appearance across multiple domains, enabling the generation of novel hybrid images by interchanging these factors.

Contribution

It extends existing disentanglement techniques to handle cross-domain scenarios using a differentiable histogram of visual features for appearance representation.

Findings

01

Effective shape and appearance transfer across domains

02

Accurate disentanglement of shape and appearance factors

03

Generates novel images combining features from different domains

Abstract

We consider the novel task of learning disentangled representations of object shape and appearance across multiple domains (e.g., dogs and cars). The goal is to learn a generative model that learns an intermediate distribution, which borrows a subset of properties from each domain, enabling the generation of images that did not exist in any domain exclusively. This challenging problem requires an accurate disentanglement of object shape, appearance, and background from each domain, so that the appearance and shape factors from the two domains can be interchanged. We augment an existing approach that can disentangle factors within a single domain but struggles to do so across domains. Our key technical contribution is to represent object appearance with a differentiable histogram of visual features, and to optimize the generator so that two images with the same latent appearance factor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis