SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering

Jiho Park; Sieun Choi; Jaeyoon Seo; Minho Sohn; Yeana Kim; Jihie Kim

arXiv:2603.28363·cs.CV·March 31, 2026

SEA: Evaluating Sketch Abstraction Efficiency via Element-level Commonsense Visual Question Answering

Jiho Park, Sieun Choi, Jaeyoon Seo, Minho Sohn, Yeana Kim, Jihie Kim

PDF

1 Datasets

TL;DR

SEA is a new reference-free metric for evaluating sketch abstraction efficiency by measuring semantic element retention using commonsense knowledge and visual question answering models.

Contribution

The paper introduces SEA, a novel metric for semantic sketch evaluation, and presents CommonSketch, a large annotated dataset for benchmarking sketch understanding.

Findings

01

SEA correlates well with human judgments of sketch abstraction.

02

CommonSketch provides systematic evaluation across vision-language models.

03

SEA effectively discriminates levels of abstraction efficiency.

Abstract

A sketch is a distilled form of visual abstraction that conveys core concepts through simplified yet purposeful strokes while omitting extraneous detail. Despite its expressive power, quantifying the efficiency of semantic abstraction in sketches remains challenging. Existing evaluation methods that rely on reference images, low-level visual features, or recognition accuracy do not capture abstraction, the defining property of sketches. To address these limitations, we introduce SEA (Sketch Evaluation metric for Abstraction efficiency), a reference-free metric that assesses how economically a sketch represents class-defining visual elements while preserving semantic recognizability. These elements are derived per class from commonsense knowledge about features typically depicted in sketches. SEA leverages a visual question answering model to determine the presence of each element and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

ziiio/CommonSketch
dataset· 296 dl
296 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.