Handwritten and Printed Text Segmentation: A Signature Case Study

Sina Gholamian; Ali Vahdat

arXiv:2307.07887·cs.CV·August 29, 2023·2 cites

Handwritten and Printed Text Segmentation: A Signature Case Study

Sina Gholamian, Ali Vahdat

PDF

Open Access

TL;DR

This paper introduces a novel approach and dataset for segmenting overlapping handwritten and printed text in scanned documents, significantly improving segmentation accuracy especially in overlapping regions.

Contribution

It presents a new dataset, SignaTR6K, and a novel model architecture that enhances segmentation of overlapping handwritten and printed text, outperforming prior methods.

Findings

01

Outperforms prior work by 17.9% and 7.3% on IoU scores

02

Introduces a new dataset for real legal documents

03

Develops a model that better handles overlapping text segments

Abstract

While analyzing scanned documents, handwritten text can overlap with printed text. This overlap causes difficulties during the optical character recognition (OCR) and digitization process of documents, and subsequently, hurts downstream NLP tasks. Prior research either focuses solely on the binary classification of handwritten text or performs a three-class segmentation of the document, i.e., recognition of handwritten, printed, and background pixels. This approach results in the assignment of overlapping handwritten and printed pixels to only one of the classes, and thus, they are not accounted for in the other class. Thus, in this research, we develop novel approaches to address the challenges of handwritten and printed text segmentation. Our objective is to recover text from different classes in their entirety, especially enhancing the segmentation performance on overlapping…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Hand Gesture Recognition Systems