Joint Line Segmentation and Transcription for End-to-End Handwritten   Paragraph Recognition

Th\'eodore Bluche

arXiv:1604.08352·cs.CV·April 29, 2016·50 cites

Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

Th\'eodore Bluche

PDF

Open Access

TL;DR

This paper introduces an end-to-end neural network model that jointly segments and transcribes handwritten paragraphs, reducing reliance on costly annotations and improving recognition accuracy.

Contribution

It modifies MDLSTM-RNNs to perform implicit line segmentation and transcription simultaneously, advancing towards full document recognition.

Findings

01

Competitive results with line-level trained networks

02

Effective implicit line segmentation via attention mechanisms

03

Significant progress towards full document transcription

Abstract

Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient multi-dimensional long short-term memory recurrent neural networks (MDLSTM-RNNs) to enable end-to-end processing of handwritten paragraphs. More particularly, we replace the collapse layer transforming the two-dimensional representation into a sequence of predictions by a recurrent version which can recognize one line at a time. In the proposed model, a neural network performs a kind of implicit line segmentation by computing attention weights on the image representation. The experiments on paragraphs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Image Processing and 3D Reconstruction