Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

Ximing Xing; Ziteng Xue; Zhenxi Li; Weicong Liang; Linqing Wang; Zhantao Yang; Tiankai Hang; Zijin Yin; Qinglin Lu; Chunyu Wang; Qian Yu

arXiv:2604.05072·cs.LG·April 13, 2026

Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

Ximing Xing, Ziteng Xue, Zhenxi Li, Weicong Liang, Linqing Wang, Zhantao Yang, Tiankai Hang, Zijin Yin, Qinglin Lu, Chunyu Wang, Qian Yu

PDF

1 Repo 1 Models

TL;DR

HiVG introduces a hierarchical SVG tokenization framework that enhances vector graphics generation by improving sequence efficiency and spatial consistency through structured tokens and a novel initialization strategy.

Contribution

The paper presents HiVG, a hierarchical tokenization method and training paradigm that significantly improves SVG program synthesis over traditional byte-level approaches.

Findings

01

Enhanced generation fidelity and spatial consistency in SVG outputs.

02

Improved sequence efficiency and reduced token redundancy.

03

Better stability in learning executable SVG programs.

Abstract

Recent large language models have shifted SVG generation from differentiable rendering optimization to autoregressive program synthesis. However, existing approaches still rely on generic byte-level tokenization inherited from natural language processing, which poorly reflects the geometric structure of vector graphics. Numerical coordinates are fragmented into discrete symbols, destroying spatial relationships and introducing severe token redundancy, often leading to coordinate hallucination and inefficient long-sequence generation. To address these challenges, we propose HiVG, a hierarchical SVG tokenization framework tailored for autoregressive vector graphics generation. HiVG decomposes raw SVG strings into structured \textit{atomic tokens} and further compresses executable command--parameter groups into geometry-constrained \textit{segment tokens}, substantially improving sequence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ximinng/HiVG
github

Models

🤗
xingxm/HiVG-3B-Base
model· 302 dl· ♡ 6
302 dl♡ 6

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.