SPAN: a Simple Predict & Align Network for Handwritten Paragraph   Recognition

Denis Coquenet; Cl\'ement Chatelain; Thierry Paquet

arXiv:2102.08742·cs.CV·September 13, 2021

SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

Denis Coquenet, Cl\'ement Chatelain, Thierry Paquet

PDF

1 Repo

TL;DR

The paper introduces SPAN, an end-to-end, segmentation-free neural network for recognizing handwritten paragraphs, achieving competitive accuracy without prior line segmentation or dataset adaptation.

Contribution

It presents a simple, recurrence-free, fully convolutional model that recognizes handwritten paragraphs directly at the document level, eliminating the need for segmentation and line break annotations.

Findings

01

Achieves competitive results on RIMES, IAM, and READ 2016 datasets.

02

Does not require segmentation labels or line break annotations.

03

Can be trained from scratch without dataset adaptation.

Abstract

Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without any prior segmentation stage. The framework is as simple as the one used for the recognition of isolated lines and we achieve competitive results on three popular datasets: RIMES, IAM and READ 2016. The proposed model does not require any dataset adaptation, it can be trained from scratch, without segmentation labels, and it does not require line breaks in the transcription labels. Our code and trained model weights are available at https://github.com/FactoDeepLearning/SPAN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FactoDeepLearning/SPAN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN