The Character Error Vector: Decomposable errors for page-level OCR evaluation
Jonathan Bourne, Mwiza Simbeye, Joseph Nockels

TL;DR
The paper introduces the Character Error Vector (CEV), a decomposable OCR evaluation metric that addresses limitations of CER at page-level, enabling better assessment of OCR and parsing errors in complex documents.
Contribution
The paper proposes the CEV, a novel decomposable metric for OCR evaluation that bridges parsing and character-level errors, validated on complex archival newspaper data.
Findings
CEV correlates well with CER and parse quality.
Traditional pipeline approaches outperform end-to-end models on complex layouts.
Thresholding on CEV easily predicts main error sources with high F1 score.
Abstract
The Character Error Rate (CER) is a key metric for evaluating the quality of Optical Character Recognition (OCR). However, this metric assumes that text has been perfectly parsed, which is often not the case. Under page-parsing errors, CER becomes undefined, limiting its use as a metric and making evaluating page-level OCR challenging, particularly when using data that do not share a labelling schema. We introduce the Character Error Vector (CEV), a bag-of-characters evaluator for OCR. The CEV can be decomposed into parsing and OCR, and interaction error components. This decomposability allows practitioners to focus on the part of the Document Understanding pipeline that will have the greatest impact on overall text extraction quality. The CEV can be implemented using a variety of methods, of which we demonstrate SpACER (Spatially Aware Character Error Rate) and a Character distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
