SELECT: Detecting Label Errors in Real-world Scene Text Data

Wenjun Liu; Qian Wu; Yifeng Hu; Yuke Li

arXiv:2512.14050·cs.CV·December 17, 2025

SELECT: Detecting Label Errors in Real-world Scene Text Data

Wenjun Liu, Qian Wu, Yifeng Hu, Yuke Li

PDF

Open Access

TL;DR

SELECT is a novel multi-modal approach that effectively detects label errors in real-world scene text datasets, addressing challenges like variable-length labels and character errors, and improving scene text recognition accuracy.

Contribution

Introduces SELECT, a multi-modal method with SSLC for realistic label error detection in scene text data, handling variable-length labels and character similarities.

Findings

01

SELECT outperforms existing methods in accuracy.

02

Detects label errors in real-world datasets effectively.

03

Improves scene text recognition performance.

Abstract

We introduce SELECT (Scene tExt Label Errors deteCTion), a novel approach that leverages multi-modal training to detect label errors in real-world scene text datasets. Utilizing an image-text encoder and a character-level tokenizer, SELECT addresses the issues of variable-length sequence labels, label sequence misalignment, and character-level errors, outperforming existing methods in accuracy and practical utility. In addition, we introduce Similarity-based Sequence Label Corruption (SSLC), a process that intentionally introduces errors into the training labels to mimic real-world error scenarios during training. SSLC not only can cause a change in the sequence length but also takes into account the visual similarity between characters during corruption. Our method is the first to detect label errors in real-world scene text datasets successfully accounting for variable-length labels.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Handwritten Text Recognition Techniques · Text and Document Classification Technologies