Beyond the Pipeline: Analyzing Key Factors in End-to-End Deep Learning for Historical Writer Identification
Hanif Rasyidi, Moshiur Farazi

TL;DR
This paper analyzes factors affecting end-to-end deep learning models for historical writer identification, highlighting challenges and proposing a simpler setup that performs comparably to complex systems.
Contribution
It provides a comprehensive analysis of various configurations and identifies key challenges, offering insights to improve end-to-end HWI systems.
Findings
Most configurations perform poorly due to weak visual feature capture.
Inconsistent patch representations and content noise affect performance.
A simpler end-to-end setup can achieve results comparable to top systems.
Abstract
This paper investigates various factors that influence the performance of end-to-end deep learning approaches for historical writer identification (HWI), a task that remains challenging due to the diversity of handwriting styles, document degradation, and the limited number of labelled samples per writer. These conditions often make accurate recognition difficult, even for human experts. Traditional HWI methods typically rely on handcrafted image processing and clustering techniques, which tend to perform well on small and carefully curated datasets. In contrast, end-to-end pipelines aim to automate the process by learning features directly from document images. However, our experiments show that many of these models struggle to generalise in more realistic, document-level settings, especially under zero-shot scenarios where writers in the test set are not present in the training data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Topic Modeling · Authorship Attribution and Profiling
