SketchTransfer: A Challenging New Task for Exploring Detail-Invariance   and the Abstractions Learned by Deep Networks

Alex Lamb; Sherjil Ozair; Vikas Verma; David Ha

arXiv:1912.11570·cs.CV·December 30, 2019

SketchTransfer: A Challenging New Task for Exploring Detail-Invariance and the Abstractions Learned by Deep Networks

Alex Lamb, Sherjil Ozair, Vikas Verma, David Ha

PDF

Open Access 1 Repo

TL;DR

This paper introduces the SketchTransfer dataset and task to evaluate deep networks' ability to generalize across detail variations, revealing current models' limitations in achieving human-like detail-invariance.

Contribution

The authors propose a new dataset and task, SketchTransfer, to study detail-invariance in deep networks, highlighting the gap between human and machine generalization abilities.

Findings

01

State-of-the-art domain transfer algorithms perform poorly on SketchTransfer

02

Current models achieve only 59% accuracy compared to 87% when trained directly on sketches

03

The task exposes significant room for improvement in deep network generalization

Abstract

Deep networks have achieved excellent results in perceptual tasks, yet their ability to generalize to variations not seen during training has come under increasing scrutiny. In this work we focus on their ability to have invariance towards the presence or absence of details. For example, humans are able to watch cartoons, which are missing many visual details, without being explicitly trained to do so. As another example, 3D rendering software is a relatively recent development, yet people are able to understand such rendered scenes even though they are missing details (consider a film like Toy Story). The failure of machine learning algorithms to do this indicates a significant gap in generalization between human abilities and the abilities of deep networks. We propose a dataset that will make it easier to study the detail-invariance problem concretely. We produce a concrete task for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

googlecreativelab/quickdraw-dataset
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques