End-to-End Page-Level Assessment of Handwritten Text Recognition
Enrique Vidal, Alejandro H. Toselli, Antonio R\'ios-Vila, Jorge, Calvo-Zaragoza

TL;DR
This paper proposes a two-fold evaluation method for handwritten text recognition at the page level, combining transcription accuracy and reading order quality, validated through experiments showing its effectiveness.
Contribution
It introduces a novel two-fold evaluation framework for page-level HTR that separately assesses transcription accuracy and reading order, supported by empirical comparison of metrics.
Findings
The combined use of WER and bWER effectively evaluates transcription and layout errors.
The difference between WER and bWER correlates with layout analysis errors.
The proposed evaluation approach is validated through real and simulated experiments.
Abstract
The evaluation of Handwritten Text Recognition (HTR) systems has traditionally used metrics based on the edit distance between HTR and ground truth (GT) transcripts, at both the character and word levels. This is very adequate when the experimental protocol assumes that both GT and HTR text lines are the same, which allows edit distances to be independently computed to each given line. Driven by recent advances in pattern recognition, HTR systems increasingly face the end-to-end page-level transcription of a document, where the precision of locating the different text lines and their corresponding reading order (RO) play a key role. In such a case, the standard metrics do not take into account the inconsistencies that might appear. In this paper, the problem of evaluating HTR systems at the page level is introduced in detail. We analyse the convenience of using a two-fold evaluation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction
