SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis

Wengling Chen; James Hays

arXiv:1801.02753·cs.CV·April 16, 2018·39 cites

SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis

Wengling Chen, James Hays

PDF

Open Access 1 Repo

TL;DR

SketchyGAN introduces a novel GAN-based method for generating diverse, realistic images from sketches across multiple categories, utilizing automatic data augmentation and a new network block to enhance image quality.

Contribution

The paper presents a new GAN architecture with a multi-scale input injection block and an automatic sketch data augmentation technique for improved sketch-to-image synthesis.

Findings

01

Produces more realistic images than previous methods

02

Achieves higher Inception Scores on multiple categories

03

Effective data augmentation improves synthesis quality

Abstract

Synthesizing realistic images from human drawn sketches is a challenging problem in computer graphics and vision. Existing approaches either need exact edge maps, or rely on retrieval of existing photographs. In this work, we propose a novel Generative Adversarial Network (GAN) approach that synthesizes plausible images from 50 categories including motorcycles, horses and couches. We demonstrate a data augmentation technique for sketches which is fully automatic, and we show that the augmented data is helpful to our task. We introduce a new network building block suitable for both the generator and discriminator which improves the information flow by injecting the input image at multiple scales. Compared to state-of-the-art image translation methods, our approach generates more realistic images and achieves significantly higher Inception Scores.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wchen342/SketchyGAN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques