Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors

Fuwen Luo; Zihao Wan; Ziyue Wang; Yaluo Liu; Pau Tong Lin Xu; Xuanjia Qiao; Xiaolong Wang; Peng Li; Yang Liu

arXiv:2601.05508·cs.CV·April 21, 2026

Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors

Fuwen Luo, Zihao Wan, Ziyue Wang, Yaluo Liu, Pau Tong Lin Xu, Xuanjia Qiao, Xiaolong Wang, Peng Li, Yang Liu

PDF

1 Repo 1 Models

TL;DR

HieroSA is a novel framework that enables multimodal models to analyze hieroglyphic characters at the stroke level without language-specific prior knowledge.

Contribution

It introduces a generalizable method to extract stroke-level structures from hieroglyphs directly from images, enhancing structural understanding across scripts.

Findings

01

HieroSA effectively captures internal character structures and semantics.

02

The method generalizes across modern and ancient hieroglyphs.

03

Experimental results show improved structural analysis without handcrafted data.

Abstract

Hieroglyphs, as logographic writing systems, encode rich semantic and cultural information within their internal structural composition. Yet, current advanced Large Language Models (LLMs) and Multimodal LLMs (MLLMs) usually remain structurally blind to this information. LLMs process characters as textual tokens, while MLLMs additionally view them as raw pixel grids. Both fall short to model the underlying logic of character strokes. Furthermore, existing structural analysis methods are often script-specific and labor-intensive. In this paper, we propose Hieroglyphic Stroke Analyzer (HieroSA), a novel and generalizable framework that enables MLLMs to automatically derive stroke-level structures from character bitmaps without handcrafted data. It transforms modern logographic and ancient hieroglyphs character images into explicit, interpretable line-segment representations in a normalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

THUNLP-MT/HieroSA
github

Models

🤗
roufaen/HieroSA
model· 4 dl· ♡ 1
4 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.