SketchTransfer: A Challenging New Task for Exploring Detail-Invariance and the Abstractions Learned by Deep Networks
Alex Lamb, Sherjil Ozair, Vikas Verma, David Ha

TL;DR
This paper introduces the SketchTransfer dataset and task to evaluate deep networks' ability to generalize across detail variations, revealing current models' limitations in achieving human-like detail-invariance.
Contribution
The authors propose a new dataset and task, SketchTransfer, to study detail-invariance in deep networks, highlighting the gap between human and machine generalization abilities.
Findings
State-of-the-art domain transfer algorithms perform poorly on SketchTransfer
Current models achieve only 59% accuracy compared to 87% when trained directly on sketches
The task exposes significant room for improvement in deep network generalization
Abstract
Deep networks have achieved excellent results in perceptual tasks, yet their ability to generalize to variations not seen during training has come under increasing scrutiny. In this work we focus on their ability to have invariance towards the presence or absence of details. For example, humans are able to watch cartoons, which are missing many visual details, without being explicitly trained to do so. As another example, 3D rendering software is a relatively recent development, yet people are able to understand such rendered scenes even though they are missing details (consider a film like Toy Story). The failure of machine learning algorithms to do this indicates a significant gap in generalization between human abilities and the abilities of deep networks. We propose a dataset that will make it easier to study the detail-invariance problem concretely. We produce a concrete task for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
