cFineGAN: Unsupervised multi-conditional fine-grained image generation

Gunjan Aggarwal; Abhishek Sinha

arXiv:1912.05028·cs.CV·December 12, 2019

cFineGAN: Unsupervised multi-conditional fine-grained image generation

Gunjan Aggarwal, Abhishek Sinha

PDF

Open Access

TL;DR

cFineGAN is an unsupervised image generation method that combines texture and shape from two images using shape-biased models, demonstrated on multiple datasets.

Contribution

It extends FineGAN to multi-conditional generation with shape-biased models, enabling texture-shape controlled image synthesis without supervision.

Findings

01

Shape-biased models improve generation quality.

02

Effective across multiple benchmark datasets.

03

Unsupervised approach achieves promising results.

Abstract

We propose an unsupervised multi-conditional image generation pipeline: cFineGAN, that can generate an image conditioned on two input images such that the generated image preserves the texture of one and the shape of the other input. To achieve this goal, we extend upon the recently proposed work of FineGAN \citep{singh2018finegan} and make use of standard as well as shape-biased pre-trained ImageNet models. We demonstrate both qualitatively as well as quantitatively the benefit of using the shape-biased network. We present our image generation result across three benchmark datasets- CUB-200-2011, Stanford Dogs and UT Zappos50k.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques