VecGlypher: Unified Vector Glyph Generation with Language Models

Xiaoke Huang; Bhavul Gauri; Kam Woh Ng; Tony Ng; Mengmeng Xu; Zhiheng Liu; Weiming Ren; Zhaochong An; Zijian Zhou; Haonan Qiu; Yuyin Zhou; Sen He; Ziheng Wang; Tao Xiang; Xiao Han

arXiv:2602.21461·cs.CL·February 26, 2026

VecGlypher: Unified Vector Glyph Generation with Language Models

Xiaoke Huang, Bhavul Gauri, Kam Woh Ng, Tony Ng, Mengmeng Xu, Zhiheng Liu, Weiming Ren, Zhaochong An, Zijian Zhou, Haonan Qiu, Yuyin Zhou, Sen He, Ziheng Wang, Tao Xiang, Xiao Han

PDF

Open Access

TL;DR

VecGlypher is a multimodal language model that directly generates editable vector glyphs from text or images, improving accessibility and quality in digital typography.

Contribution

It introduces a unified model that generates high-fidelity SVG glyphs from descriptions or exemplars, combining large-scale training and expert annotation for improved performance.

Findings

01

Outperforms general-purpose LLMs and specialized baselines in cross-family OOD evaluation.

02

Achieves state-of-the-art results in image-referenced glyph generation.

03

Model scale and a two-stage training recipe are critical for performance.

Abstract

Vector glyphs are the atomic units of digital typography, yet most learning-based pipelines still depend on carefully curated exemplar sheets and raster-to-vector postprocessing, which limits accessibility and editability. We introduce VecGlypher, a single multimodal language model that generates high-fidelity vector glyphs directly from text descriptions or image exemplars. Given a style prompt, optional reference glyph images, and a target character, VecGlypher autoregressively emits SVG path tokens, avoiding raster intermediates and producing editable, watertight outlines in one pass. A typography-aware data and training recipe makes this possible: (i) a large-scale continuation stage on 39K noisy Envato fonts to master SVG syntax and long-horizon geometry, followed by (ii) post-training on 2.5K expert-annotated Google Fonts with descriptive tags and exemplars to align language and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Handwritten Text Recognition Techniques · Digital Humanities and Scholarship