Style Customization of Text-to-Vector Generation with Image Diffusion Priors
Peiying Zhang, Nanxuan Zhao, Jing Liao

TL;DR
This paper introduces a two-stage pipeline for style customization in text-to-vector SVG generation, combining diffusion models and image priors to produce diverse, high-quality, styled SVGs efficiently.
Contribution
It presents a novel two-stage method that leverages diffusion models and image priors for customizable, structurally regular SVG generation from text prompts.
Findings
Effective style customization demonstrated through experiments
High-quality SVGs with structural regularity achieved
Diverse styled SVGs generated efficiently
Abstract
Scalable Vector Graphics (SVGs) are highly favored by designers due to their resolution independence and well-organized layer structure. Although existing text-to-vector (T2V) generation methods can create SVGs from text prompts, they often overlook an important need in practical applications: style customization, which is vital for producing a collection of vector graphics with consistent visual appearance and coherent aesthetics. Extending existing T2V methods for style customization poses certain challenges. Optimization-based T2V models can utilize the priors of text-to-image (T2I) models for customization, but struggle with maintaining structural regularity. On the other hand, feed-forward T2V models can ensure structural regularity, yet they encounter difficulties in disentangling content and style due to limited SVG training data. To address these challenges, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization
MethodsDiffusion
