Multi-Type-TD-TSR -- Extracting Tables from Document Images using a   Multi-stage Pipeline for Table Detection and Table Structure Recognition:   from OCR to Structured Table Representations

Pascal Fischer; Alen Smajic; Alexander Mehler; Giuseppe Abrami

arXiv:2105.11021·cs.CV·May 25, 2021

Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations

Pascal Fischer, Alen Smajic, Alexander Mehler, Giuseppe Abrami

PDF

1 Repo

TL;DR

This paper introduces Multi-Type-TD-TSR, an end-to-end multi-stage pipeline leveraging deep learning for detecting and recognizing table structures in document images, addressing real-world challenges like noise and rotation.

Contribution

The paper presents a novel multi-stage pipeline that combines deep learning for table detection with deterministic algorithms for structure recognition, differentiating table types and handling unbordered and bordered tables.

Findings

01

Achieved state-of-the-art results on ICDAR 2019 dataset.

02

Effectively distinguishes three table types based on borders.

03

Handles rotated and noisy document images robustly.

Abstract

As global trends are shifting towards data-driven industries, the demand for automated algorithms that can convert digital images of scanned documents into machine readable information is rapidly growing. Besides the opportunity of data digitization for the application of data analytic tools, there is also a massive improvement towards automation of processes, which previously would require manual inspection of the documents. Although the introduction of optical character recognition technologies mostly solved the task of converting human-readable characters from images into machine-readable characters, the task of extracting table semantics has been less focused on over the years. The recognition of tables consists of two main tasks, namely table detection and table structure recognition. Most prior work on this problem focuses on either task without offering an end-to-end solution or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Psarpei/Multi-Type-TD-TSR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.