Chronicles-OCR: A Cross-Temporal Perception Benchmark for the Evolutionary Trajectory of Chinese Characters

Gengluo Li; Shangpin Peng; Xingyu Wan; Chengquan Zhang; Hao Feng; Xin Xu; Pian Wu; Bang Li; Zengmao Ding; Yongge Liu; Yipei Ye; Yang Yang; Zhan Shu; Guojun Yan; Zhe Li; Can Ma; Weiping Wang; Yu Zhou; Han Hu

arXiv:2605.11960·cs.CV·May 13, 2026

Chronicles-OCR: A Cross-Temporal Perception Benchmark for the Evolutionary Trajectory of Chinese Characters

Gengluo Li, Shangpin Peng, Xingyu Wan, Chengquan Zhang, Hao Feng, Xin Xu, Pian Wu, Bang Li, Zengmao Ding, Yongge Liu, Yipei Ye, Yang Yang, Zhan Shu, Guojun Yan, Zhe Li, Can Ma, Weiping Wang, Yu Zhou, Han Hu

PDF

1 Repo 1 Datasets

TL;DR

Chronicles-OCR is a comprehensive benchmark dataset designed to evaluate vision large language models' ability to perceive and interpret the morphological evolution of Chinese characters across thousands of years.

Contribution

It introduces a novel dataset with diverse historical Chinese script images, a stage-adaptive annotation paradigm, and four tasks to assess cross-temporal visual perception capabilities.

Findings

01

Current VLLMs show limitations in recognizing ancient Chinese characters.

02

The dataset enables systematic evaluation of perception across different historical stages.

03

The benchmark facilitates development of more robust, evolution-aware text perception models.

Abstract

Vision Large Language Models (VLLMs) have achieved remarkable success in modern text-rich visual understanding. However, their perceptual robustness in the face of the continuous morphological evolution of historical writing systems remains largely unexplored. Existing ancient text datasets typically focus on isolated historical periods, failing to capture the systematic visual distribution shifts spanning thousands of years. To bridge this gap and empower Digital Humanities, we introduce Chronicles-OCR, the first comprehensive benchmark specifically designed to evaluate the cross-temporal visual perception capabilities of VLLMs across the complete evolutionary trajectory of Chinese characters, known as the Seven Chinese Scripts. Curated in collaboration with top-tier institutional domain experts, the dataset comprises 2,800 strictly balanced images encompassing highly diverse physical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VirtualLUOUCAS/Chronicles-OCR
github

Datasets

VirtualLUO/Chronicles-OCR
dataset· 1.6k dl
1.6k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.