End-To-End Measure for Text Recognition
Gundram Leifert, Roger Labahn, Tobias Gr\"uning, Svenja, Leifert

TL;DR
This paper introduces a flexible, end-to-end evaluation measure for text recognition systems that accounts for alignment, reading order, and segmentation issues, providing a more accurate performance assessment.
Contribution
It proposes a novel, configurable evaluation measure based on character error rate for end-to-end text recognition systems, addressing limitations of existing metrics.
Findings
The measure accounts for reading order differences.
It can incorporate geometric positioning of text lines.
It effectively ignores over- and under-segmentation issues.
Abstract
Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-known methodology, which is sometimes used in this context. Nevertheless, it does not take into account the alignment of hypothesis and ground truth text and can lead to deceptive results. Since users of automatic information retrieval pipelines in the context of text recognition are mainly interested in the end-to-end performance of a given system, there is a strong need for such a measure. Hence, we present a measure to evaluate the quality of an end-to-end text recognition system. The basis for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
