Towards End-to-end Handwritten Document Recognition
Denis Coquenet

TL;DR
This paper introduces an end-to-end approach for handwritten document recognition that processes entire documents without segmentation, achieving state-of-the-art results and handling complex layouts more effectively.
Contribution
It presents the first end-to-end method for recognizing handwritten text and layout in whole documents, moving beyond traditional multi-step segmentation approaches.
Findings
Achieved state-of-the-art results at paragraph level on RIMES 2011, IAM, and READ 2016 datasets.
Outperformed line-level state of the art on the same datasets.
Proposed new metrics for evaluating document-level handwritten recognition.
Abstract
Handwritten text recognition has been widely studied in the last decades for its numerous applications. Nowadays, the state-of-the-art approach consists in a three-step process. The document is segmented into text lines, which are then ordered and recognized. However, this three-step approach has many drawbacks. The three steps are treated independently whereas they are closely related. Errors accumulate from one step to the other. The ordering step is based on heuristic rules which prevent its use for documents with a complex layouts or for heterogeneous documents. The need for additional physical segmentation annotations for training the segmentation stage is inherent to this approach. In this thesis, we propose to tackle these issues by performing the handwritten text recognition of whole document in an end-to-end way. To this aim, we gradually increase the difficulty of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Natural Language Processing Techniques
