FurryGAN: High Quality Foreground-aware Image Synthesis

Jeongmin Bae; Mingi Kwon; Youngjung Uh

arXiv:2208.10422·cs.CV·August 23, 2022

FurryGAN: High Quality Foreground-aware Image Synthesis

Jeongmin Bae, Mingi Kwon, Youngjung Uh

PDF

Open Access

TL;DR

FurryGAN is a novel generative adversarial network that produces high-quality, detailed foreground masks and images in an unsupervised manner by combining realism constraints, multi-scale masks, and auxiliary mask prediction.

Contribution

It introduces a new framework with three key components that effectively generate realistic foreground-aware images and masks without supervision.

Findings

01

Produces highly detailed alpha masks covering complex features like hair and fur.

02

Achieves realistic image synthesis with meaningful foreground-background separation.

03

Operates fully unsupervised, eliminating the need for labeled data.

Abstract

Foreground-aware image synthesis aims to generate images as well as their foreground masks. A common approach is to formulate an image as an masked blending of a foreground image and a background image. It is a challenging problem because it is prone to reach the trivial solution where either image overwhelms the other, i.e., the masks become completely full or empty, and the foreground and background are not meaningfully separated. We present FurryGAN with three key components: 1) imposing both the foreground image and the composite image to be realistic, 2) designing a mask as a combination of coarse and fine masks, and 3) guiding the generator by an auxiliary mask predictor in the discriminator. Our method produces realistic images with remarkably detailed alpha masks which cover hair, fur, and whiskers in a fully unsupervised manner.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Computer Graphics and Visualization Techniques