Tabular Structure Detection from Document Images for Resource Constrained Devices Using A Row Based Similarity Measure
Soumyadeep Dey, Jayanta Mukhopadhyay, Shamik Sural

TL;DR
This paper introduces a layout-independent, resource-efficient method for detecting tables in document images by using a row similarity measure, suitable for deployment on resource-constrained devices.
Contribution
The authors propose a novel row similarity measure that detects tabular regions without relying on prior layout knowledge or extensive parameters, enabling use on resource-limited devices.
Findings
Effective detection of various table layouts.
Operates efficiently on resource-constrained devices.
Does not require large training datasets or deep learning models.
Abstract
Tabular structures are used to present crucial information in a structured and crisp manner. Detection of such regions is of great importance for proper understanding of a document. Tabular structures can be of various layouts and types. Therefore, detection of these regions is a hard problem. Most of the existing techniques detect tables from a document image by using prior knowledge of the structures of the tables. However, these methods are not applicable for generalized tabular structures. In this work, we propose a similarity measure to find similarities between pairs of rows in a tabular structure. This similarity measure is utilized to identify a tabular region. Since the tabular regions are detected exploiting the similarities among all rows, the method is inherently independent of layouts of the tabular regions present in the training data. Moreover, the proposed similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Digital Media Forensic Detection
