Self-paced learning to improve text row detection in historical documents with missing labels
Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu

TL;DR
This paper introduces a self-paced learning algorithm that enhances text row detection in historical documents with missing labels by progressively training on batches with varying annotation completeness, leading to significant accuracy improvements.
Contribution
The paper presents a novel self-paced learning approach that leverages pseudo-labeling and batch organization based on annotation completeness to improve detection performance in historical documents.
Findings
Improved average precision by over 12% on one dataset.
Achieved up to 39% increase in detection accuracy on another dataset.
Demonstrated effectiveness of self-paced learning in handling missing labels.
Abstract
An important preliminary step of optical character recognition systems is the detection of text rows. To address this task in the context of historical data with missing labels, we propose a self-paced learning algorithm capable of improving the row detection performance. We conjecture that pages with more ground-truth bounding boxes are less likely to have missing annotations. Based on this hypothesis, we sort the training examples in descending order with respect to the number of ground-truth bounding boxes, and organize them into k batches. Using our self-paced learning method, we train a row detector over k iterations, progressively adding batches with less ground-truth annotations. At each iteration, we combine the ground-truth bounding boxes with pseudo-bounding boxes (bounding boxes predicted by the model itself) using non-maximum suppression, and we include the resulting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Retrieval and Classification Techniques
MethodsBNB Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Feature Pyramid Network · Bottom-up Path Augmentation · Max Pooling · Convolution · Residual Connection · Tanh Activation · (TravEL!!Guide)How Do I File a Claim with Expedia? · + ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881 How do I file a claim with Expedia?
