SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation

Ellie Arar; Yarden Frenkel; Daniel Cohen-Or; Ariel Shamir; Yael Vinker

arXiv:2502.08642·cs.CV·February 13, 2025

SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation

Ellie Arar, Yarden Frenkel, Daniel Cohen-Or, Ariel Shamir, Yael Vinker

PDF

Open Access

TL;DR

SwiftSketch is a fast diffusion model that generates high-quality, image-conditioned vector sketches in under a second, overcoming the slow optimization of previous methods.

Contribution

It introduces SwiftSketch, a novel diffusion-based approach with a transformer-decoder architecture for rapid, high-quality vector sketch generation conditioned on images.

Findings

01

Generates sketches in less than a second.

02

Produces high-fidelity, natural sketches.

03

Generalizes across diverse concepts.

Abstract

Recent advancements in large vision-language models have enabled highly expressive and diverse vector sketch generation. However, state-of-the-art methods rely on a time-consuming optimization process involving repeated feedback from a pretrained model to determine stroke placement. Consequently, despite producing impressive sketches, these methods are limited in practical applications. In this work, we introduce SwiftSketch, a diffusion model for image-conditioned vector sketch generation that can produce high-quality sketches in less than a second. SwiftSketch operates by progressively denoising stroke control points sampled from a Gaussian distribution. Its transformer-decoder architecture is designed to effectively handle the discrete nature of vector representation and capture the inherent global dependencies between strokes. To train SwiftSketch, we construct a synthetic dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis

MethodsDiffusion