Multi-attribute Pizza Generator: Cross-domain Attribute Control with   Conditional StyleGAN

Fangda Han; Guoyao Hao; Ricardo Guerrero; Vladimir Pavlovic

arXiv:2110.11830·cs.CV·October 25, 2021

Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

Fangda Han, Guoyao Hao, Ricardo Guerrero, Vladimir Pavlovic

PDF

Open Access 1 Repo

TL;DR

This paper introduces MPG, a GAN-based framework that synthesizes realistic pizza images with controllable ingredients, view angles, and styles, by extending StyleGAN2 and bridging real and CGI datasets for improved attribute control.

Contribution

The paper presents a novel multi-attribute conditional GAN architecture that effectively controls content, view, and style attributes in image synthesis, including a new CGI dataset and attribute regularization techniques.

Findings

01

MPG generates photo-realistic pizza images with specified attributes.

02

The model successfully controls multiple attributes beyond training data.

03

Bridging real and CGI datasets improves attribute generalization.

Abstract

Multi-attribute conditional image generation is a challenging problem in computervision. We propose Multi-attribute Pizza Generator (MPG), a conditional Generative Neural Network (GAN) framework for synthesizing images from a trichotomy of attributes: content, view-geometry, and implicit visual style. We design MPG by extending the state-of-the-art StyleGAN2, using a new conditioning technique that guides the intermediate feature maps to learn multi-scale multi-attribute entangled representationsof controlling attributes. Because of the complex nature of the multi-attribute image generation problem, we regularize the image generation by predicting the explicit conditioning attributes (ingredients and view). To synthesize a pizza image with view attributesoutside the range of natural training images, we design a CGI pizza dataset PizzaView using 3D pizza models and employ it to train a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

klory/mpg2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging

MethodsTest · Convolution · R1 Regularization · Weight Demodulation · HuMan(Expedia)||How do I get a human at Expedia? · Path Length Regularization