Weakly supervised information extraction from inscrutable handwritten   document images

Sujoy Paul; Gagan Madan; Akankshya Mishra; Narayan Hegde and; Pradeep Kumar; Gaurav Aggarwal

arXiv:2306.06823·cs.CV·June 13, 2023·2 cites

Weakly supervised information extraction from inscrutable handwritten document images

Sujoy Paul, Gagan Madan, Akankshya Mishra, Narayan Hegde and, Pradeep Kumar, Gaurav Aggarwal

PDF

Open Access

TL;DR

This paper presents a weakly supervised method for extracting medicine names from handwritten prescriptions, overcoming OCR errors and limited annotations by identifying relevant regions and using a domain-specific language model trained on synthetic data.

Contribution

It introduces a novel weakly supervised approach that combines region identification with synthetic data-trained language models for handwritten medicine extraction.

Findings

01

>2.5x improvement over state-of-the-art methods

02

Effective extraction with only weak labels

03

Domain-specific language model enhances accuracy

Abstract

State-of-the-art information extraction methods are limited by OCR errors. They work well for printed text in form-like documents, but unstructured, handwritten documents still remain a challenge. Adapting existing models to domain-specific training data is quite expensive, because of two factors, 1) limited availability of the domain-specific documents (such as handwritten prescriptions, lab notes, etc.), and 2) annotations become even more challenging as one needs domain-specific knowledge to decode inscrutable handwritten document images. In this work, we focus on the complex problem of extracting medicine names from handwritten prescriptions using only weakly labeled data. The data consists of images along with the list of medicine names in it, but not their location in the image. We solve the problem by first identifying the regions of interest, i.e., medicine lines from just weak…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsFocus