Reason-SVG: Enhancing Structured Reasoning for Vector Graphics Generation with Reinforcement Learning

Ximing Xing; Ziteng Xue; Yandong Guan; Jing Zhang; Dong Xu; Qian Yu

arXiv:2505.24499·cs.CV·April 10, 2026

Reason-SVG: Enhancing Structured Reasoning for Vector Graphics Generation with Reinforcement Learning

Ximing Xing, Ziteng Xue, Yandong Guan, Jing Zhang, Dong Xu, Qian Yu

PDF

TL;DR

Reason-SVG introduces a structured reasoning framework with a two-stage training process, combining supervised fine-tuning and reinforcement learning, to enhance the quality and coherence of SVG generation by large language models.

Contribution

It pioneers the 'Drawing-with-Thought' paradigm and develops a hybrid reward function, significantly improving SVG generation accuracy and reasoning capabilities.

Findings

01

Achieved higher structural validity and semantic accuracy in SVGs.

02

Demonstrated improved visual coherence in generated SVGs.

03

Created a new dataset of 10,000 SVG-DwT pairs for training and evaluation.

Abstract

Generating high-quality Scalable Vector Graphics (SVGs) is challenging for Large Language Models (LLMs), as it requires advanced reasoning for structural validity, semantic accuracy, and visual coherence -- areas where current LLMs often struggle. In this work, we introduce Reason-SVG, a novel framework equipped with enhanced structured reasoning for SVG generation. Reason-SVG pioneers the ``Drawing-with-Thought'' (DwT) paradigm, in which models generate both SVG code and explicit design rationales. Reason-SVG follows a two-stage training strategy: First, Supervised Fine-Tuning (SFT) trains the LLM on the DwT paradigm to develop foundational reasoning abilities. Second, Reinforcement Learning (RL), utilizing Group Relative Policy Optimization (GRPO), empowers the model to generate both DwT and SVG rationales through refined, reward-driven reasoning. To enable reasoning-driven SVG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.