Indiscapes: Instance Segmentation Networks for Layout Parsing of   Historical Indic Manuscripts

Abhishek Prusty; Sowmya Aitha; Abhishek Trivedi; Ravi Kiran; Sarvadevabhatla

arXiv:1912.07025·cs.CV·December 17, 2019

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts

Abhishek Prusty, Sowmya Aitha, Abhishek Trivedi, Ravi Kiran, Sarvadevabhatla

PDF

1 Repo

TL;DR

This paper introduces Indiscapes, a novel dataset with layout annotations for historical Indic manuscripts, and proposes a deep learning architecture for automatic instance-level layout parsing, facilitating OCR and related tasks.

Contribution

The paper presents the first dataset with multi-regional layout annotations for Indic manuscripts and adapts a neural network for automatic layout parsing in complex, dense manuscript images.

Findings

01

Effective layout parsing on the Indiscapes dataset

02

Development of a flexible annotation tool for domain experts

03

Enabling downstream OCR and word-spotting applications

Abstract

Historical palm-leaf manuscript and early paper documents from Indian subcontinent form an important part of the world's literary and cultural heritage. Despite their importance, large-scale annotated Indic manuscript image datasets do not exist. To address this deficiency, we introduce Indiscapes, the first ever dataset with multi-regional layout annotations for historical Indic manuscripts. To address the challenge of large diversity in scripts and presence of dense, irregular layout elements (e.g. text lines, pictures, multiple documents per image), we adapt a Fully Convolutional Deep Neural Network architecture for fully automatic, instance-level spatial layout parsing of manuscript images. We demonstrate the effectiveness of proposed architecture on images from the Indiscapes dataset. For annotation flexibility and keeping the non-technical nature of domain experts in mind, we also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ihdia/instance-segmentation-v1
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.