Scribe Verification in Chinese manuscripts using Siamese, Triplet, and Vision Transformer Neural Networks

Dimitrios-Chrysovalantis Liakopoulos; Yanbo Zhang; Chongsheng Zhang; Constantine Kotropoulos

arXiv:2603.13877·cs.LG·March 17, 2026

Scribe Verification in Chinese manuscripts using Siamese, Triplet, and Vision Transformer Neural Networks

Dimitrios-Chrysovalantis Liakopoulos, Yanbo Zhang, Chongsheng Zhang, Constantine Kotropoulos

PDF

Open Access

TL;DR

This paper explores deep learning techniques, including Siamese, Triplet, and Vision Transformer models, to verify if Chinese manuscript fragments are by the same scribe, achieving high accuracy on two datasets.

Contribution

It introduces and compares multiple neural network architectures, including Transformer-based models, for scribe verification in Chinese manuscripts, demonstrating their effectiveness.

Findings

01

MobileNetV3+ Custom Siamese model achieves top accuracy

02

Contrastive loss improves model performance

03

Transformer models perform competitively in scribe verification

Abstract

The paper examines deep learning models for scribe verification in Chinese manuscripts. That is, to automatically determine whether two manuscript fragments were written by the same scribe using deep metric learning methods. Two datasets were used: the Tsinghua Bamboo Slips Dataset and a selected subset of the Multi-Attribute Chinese Calligraphy Dataset, focusing on the calligraphers with a large number of samples. Siamese and Triplet neural network architectures are implemented, including convolutional and Transformer-based models. The experimental results show that the MobileNetV3+ Custom Siamese model trained with contrastive loss achieves either the best or the second-best overall accuracy and area under the Receiver Operating Characteristic Curve on both datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Artificial Intelligence in Games