Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Aneeshan Sain, Subhajit Maity, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

TL;DR
This paper introduces a novel approach to adapt efficient photo-based neural networks for sketch data, significantly reducing FLOPs while maintaining accuracy, thus enabling practical and efficient sketch recognition systems.
Contribution
It proposes a sketch-specific plug-in module and a cross-modal knowledge distillation method to adapt existing efficient networks for sketch data, achieving over 99% FLOPs reduction.
Findings
FLOPs reduced by 99.37% compared to full networks.
Maintained comparable accuracy with significantly fewer computations.
Effective adaptation of photo-efficient models to sketch data.
Abstract
As sketch research has collectively matured over time, its adaptation for at-mass commercialisation emerges on the immediate horizon. Despite an already mature research endeavour for photos, there is no research on the efficient inference specifically designed for sketch data. In this paper, we first demonstrate existing state-of-the-art efficient light-weight models designed for photos do not work on sketches. We then propose two sketch-specific components which work in a plug-n-play manner on any photo efficient network to adapt them to work on sketch data. We specifically chose fine-grained sketch-based image retrieval (FG-SBIR) as a demonstrator as the most recognised sketch problem with immediate commercial value. Technically speaking, we first propose a cross-modal knowledge distillation network to transfer existing photo efficient networks to be compatible with sketch, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Face recognition and analysis · Visual Attention and Saliency Detection
MethodsKnowledge Distillation
