Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis
Lorenzo Quir\'os, Enrique Vidal

TL;DR
This paper evaluates the Mask-RCNN architecture for integrated baseline detection and region segmentation in handwritten documents, demonstrating its effectiveness across multiple datasets and outperforming existing methods.
Contribution
It introduces an integrated approach using Mask-RCNN for simultaneous baseline detection and region segmentation in handwritten documents, showing promising results.
Findings
Outperforms state-of-the-art techniques on three datasets
Effective for both handwritten text and music documents
Demonstrates the versatility of Mask-RCNN in document layout analysis
Abstract
Automatically recognizing the layout of handwritten documents is an important step towards useful extraction of information from those documents. The most common application is to feed downstream applications such as automatic text recognition and keyword spotting; however, the recognition of the layout also helps to establish relationships between elements in the document which allows to enrich the information that can be extracted. Most of the modern document layout analysis systems are designed to address only one part of the document layout problem, namely: baseline detection or region segmentation. In contrast, we evaluate the effectiveness of the Mask-RCNN architecture to address the problem of baseline detection and region segmentation in an integrated manner. We present experimental results on two handwritten text datasets and one handwritten music dataset. The analyzed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
