Beyond the Pipeline: Analyzing Key Factors in End-to-End Deep Learning for Historical Writer Identification

Hanif Rasyidi; Moshiur Farazi

arXiv:2510.18671·cs.CV·October 22, 2025

Beyond the Pipeline: Analyzing Key Factors in End-to-End Deep Learning for Historical Writer Identification

Hanif Rasyidi, Moshiur Farazi

PDF

Open Access

TL;DR

This paper analyzes factors affecting end-to-end deep learning models for historical writer identification, highlighting challenges and proposing a simpler setup that performs comparably to complex systems.

Contribution

It provides a comprehensive analysis of various configurations and identifies key challenges, offering insights to improve end-to-end HWI systems.

Findings

01

Most configurations perform poorly due to weak visual feature capture.

02

Inconsistent patch representations and content noise affect performance.

03

A simpler end-to-end setup can achieve results comparable to top systems.

Abstract

This paper investigates various factors that influence the performance of end-to-end deep learning approaches for historical writer identification (HWI), a task that remains challenging due to the diversity of handwriting styles, document degradation, and the limited number of labelled samples per writer. These conditions often make accurate recognition difficult, even for human experts. Traditional HWI methods typically rely on handcrafted image processing and clustering techniques, which tend to perform well on small and carefully curated datasets. In contrast, end-to-end pipelines aim to automate the process by learning features directly from document images. However, our experiments show that many of these models struggle to generalise in more realistic, document-level settings, especially under zero-shot scenarios where writers in the test set are not present in the training data.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Topic Modeling · Authorship Attribution and Profiling