End-To-End Measure for Text Recognition

Gundram Leifert; Roger Labahn; Tobias Gr\"uning; Svenja; Leifert

arXiv:1908.09584·cs.CV·August 27, 2019

End-To-End Measure for Text Recognition

Gundram Leifert, Roger Labahn, Tobias Gr\"uning, Svenja, Leifert

PDF

TL;DR

This paper introduces a flexible, end-to-end evaluation measure for text recognition systems that accounts for alignment, reading order, and segmentation issues, providing a more accurate performance assessment.

Contribution

It proposes a novel, configurable evaluation measure based on character error rate for end-to-end text recognition systems, addressing limitations of existing metrics.

Findings

01

The measure accounts for reading order differences.

02

It can incorporate geometric positioning of text lines.

03

It effectively ignores over- and under-segmentation issues.

Abstract

Measuring the performance of text recognition and text line detection engines is an important step to objectively compare systems and their configuration. There exist well-established measures for both tasks separately. However, there is no sophisticated evaluation scheme to measure the quality of a combined text line detection and text recognition system. The F-measure on word level is a well-known methodology, which is sometimes used in this context. Nevertheless, it does not take into account the alignment of hypothesis and ground truth text and can lead to deceptive results. Since users of automatic information retrieval pipelines in the context of text recognition are mainly interested in the end-to-end performance of a given system, there is a strong need for such a measure. Hence, we present a measure to evaluate the quality of an end-to-end text recognition system. The basis for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.