Picture that Sketch: Photorealistic Image Generation from Abstract   Sketches

Subhadeep Koley; Ayan Kumar Bhunia; Aneeshan Sain; Pinaki Nath; Chowdhury; Tao Xiang; Yi-Zhe Song

arXiv:2303.11162·cs.CV·March 31, 2023·1 cites

Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath, Chowdhury, Tao Xiang, Yi-Zhe Song

PDF

Open Access

TL;DR

This paper introduces a novel method for converting abstract, free-hand sketches into photorealistic images using a decoupled encoder-decoder framework with StyleGAN, making sketch-to-photo generation accessible to amateurs.

Contribution

It presents a new decoupled training paradigm and an autoregressive sketch mapper that effectively bridge the abstraction gap in sketch-to-photo synthesis, enabling photorealistic outputs from simple sketches.

Findings

01

Achieved photorealistic image generation from amateur sketches.

02

Surpassed state-of-the-art in sketch-based image retrieval tasks.

03

Democratized sketch-to-photo pipeline for non-expert users.

Abstract

Given an abstract, deformed, ordinary sketch from untrained amateurs like you and me, this paper turns it into a photorealistic image - just like those shown in Fig. 1(a), all non-cherry-picked. We differ significantly from prior art in that we do not dictate an edgemap-like sketch to start with, but aim to work with abstract free-hand human sketches. In doing so, we essentially democratise the sketch-to-photo pipeline, "picturing" a sketch regardless of how good you sketch. Our contribution at the outset is a decoupled encoder-decoder training paradigm, where the decoder is a StyleGAN trained on photos only. This importantly ensures that generated results are always photorealistic. The rest is then all centred around how best to deal with the abstraction gap between sketch and photo. For that, we propose an autoregressive sketch mapper trained on sketch-photo pairs that maps a sketch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Visual Attention and Saliency Detection

MethodsHuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Dense Connections · Convolution · Feedforward Network · Adaptive Instance Normalization · StyleGAN