Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Peiying Zhang; Nanxuan Zhao; Jing Liao

arXiv:2505.10558·cs.GR·May 16, 2025

Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Peiying Zhang, Nanxuan Zhao, Jing Liao

PDF

Open Access

TL;DR

This paper introduces a two-stage pipeline for style customization in text-to-vector SVG generation, combining diffusion models and image priors to produce diverse, high-quality, styled SVGs efficiently.

Contribution

It presents a novel two-stage method that leverages diffusion models and image priors for customizable, structurally regular SVG generation from text prompts.

Findings

01

Effective style customization demonstrated through experiments

02

High-quality SVGs with structural regularity achieved

03

Diverse styled SVGs generated efficiently

Abstract

Scalable Vector Graphics (SVGs) are highly favored by designers due to their resolution independence and well-organized layer structure. Although existing text-to-vector (T2V) generation methods can create SVGs from text prompts, they often overlook an important need in practical applications: style customization, which is vital for producing a collection of vector graphics with consistent visual appearance and coherent aesthetics. Extending existing T2V methods for style customization poses certain challenges. Optimization-based T2V models can utilize the priors of text-to-image (T2I) models for customization, but struggle with maintaining structural regularity. On the other hand, feed-forward T2V models can ensure structural regularity, yet they encounter difficulties in disentangling content and style due to limited SVG training data. To address these challenges, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization

MethodsDiffusion