The Character Error Vector: Decomposable errors for page-level OCR evaluation

Jonathan Bourne; Mwiza Simbeye; Joseph Nockels

arXiv:2604.06160·cs.CV·April 8, 2026

The Character Error Vector: Decomposable errors for page-level OCR evaluation

Jonathan Bourne, Mwiza Simbeye, Joseph Nockels

PDF

1 Datasets

TL;DR

The paper introduces the Character Error Vector (CEV), a decomposable OCR evaluation metric that addresses limitations of CER at page-level, enabling better assessment of OCR and parsing errors in complex documents.

Contribution

The paper proposes the CEV, a novel decomposable metric for OCR evaluation that bridges parsing and character-level errors, validated on complex archival newspaper data.

Findings

01

CEV correlates well with CER and parse quality.

02

Traditional pipeline approaches outperform end-to-end models on complex layouts.

03

Thresholding on CEV easily predicts main error sources with high F1 score.

Abstract

The Character Error Rate (CER) is a key metric for evaluating the quality of Optical Character Recognition (OCR). However, this metric assumes that text has been perfectly parsed, which is often not the case. Under page-parsing errors, CER becomes undefined, limiting its use as a metric and making evaluating page-level OCR challenging, particularly when using data that do not share a labelling schema. We introduce the Character Error Vector (CEV), a bag-of-characters evaluator for OCR. The CEV can be decomposed into parsing and OCR, and interaction error components. This decomposability allows practitioners to focus on the part of the Document Understanding pipeline that will have the greatest impact on overall text extraction quality. The CEV can be implemented using a variety of methods, of which we demonstrate SpACER (Spatially Aware Character Error Rate) and a Character distribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Jonnob/the-spiritualist-enriched
dataset· 48 dl
48 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.