Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Aneeshan Sain,, Subhadeep Koley, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song

TL;DR
This paper introduces a novel approach that adapts CLIP for human sketch understanding, enabling better generalisation across categories and abstraction levels, surpassing existing methods in zero-shot and few-shot scenarios.
Contribution
It proposes a sketch-aware CLIP model with sketch-specific prompts and a codebook of abstraction prompts, advancing generalised sketch classification across unknown categories and abstraction levels.
Findings
Outperforms existing sketch representation methods in zero-shot learning
Effective across different abstraction levels from edge-maps to doodles
Surpasses prior algorithms in few-shot learning scenarios
Abstract
This paper, for the first time, marries large foundation models with human sketch understanding. We demonstrate what this brings -- a paradigm shift in terms of generalised sketch representation learning (e.g., classification). This generalisation happens on two fronts: (i) generalisation across unknown categories (i.e., open-set), and (ii) generalisation traversing abstraction levels (i.e., good and bad sketches), both being timely challenges that remain unsolved in the sketch literature. Our design is intuitive and centred around transferring the already stellar generalisation ability of CLIP to benefit generalised learning for sketches. We first "condition" the vanilla CLIP model by learning sketch-specific prompts using a novel auxiliary head of raster to vector sketch conversion. This importantly makes CLIP "sketch-aware". We then make CLIP acute to the inherently different sketch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis
MethodsContrastive Language-Image Pre-training
