Learning Geometry-aware Representations by Sketching
Hyundo Lee, Inwoo Hwang, Hyunsung Go, Won-Seok Choi, Kibeom Kim,, Byoung-Tak Zhang

TL;DR
This paper introduces Learning by Sketching (LBS), a method that converts images into sketches with geometric information, improving scene understanding and downstream vision tasks without requiring sketch datasets.
Contribution
LBS is a novel approach that learns to generate geometric sketches from images in a single step, preserving geometric information without needing explicit sketch supervision.
Findings
Improves object attribute classification on CLEVR dataset
Enhances domain transfer between CLEVR and STL-10 datasets
Provides rich geometric information for various downstream tasks
Abstract
Understanding geometric concepts, such as distance and shape, is essential for understanding the real world and also for many vision tasks. To incorporate such information into a visual representation of a scene, we propose learning to represent the scene by sketching, inspired by human behavior. Our method, coined Learning by Sketching (LBS), learns to convert an image into a set of colored strokes that explicitly incorporate the geometric information of the scene in a single inference step without requiring a sketch dataset. A sketch is then generated from the strokes where CLIP-based perceptual loss maintains a semantic similarity between the sketch and the image. We show theoretically that sketching is equivariant with respect to arbitrary affine transformations and thus provably preserves geometric information. Experimental results show that LBS substantially improves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
