TL;DR
This paper introduces Multi-Type-TD-TSR, an end-to-end multi-stage pipeline leveraging deep learning for detecting and recognizing table structures in document images, addressing real-world challenges like noise and rotation.
Contribution
The paper presents a novel multi-stage pipeline that combines deep learning for table detection with deterministic algorithms for structure recognition, differentiating table types and handling unbordered and bordered tables.
Findings
Achieved state-of-the-art results on ICDAR 2019 dataset.
Effectively distinguishes three table types based on borders.
Handles rotated and noisy document images robustly.
Abstract
As global trends are shifting towards data-driven industries, the demand for automated algorithms that can convert digital images of scanned documents into machine readable information is rapidly growing. Besides the opportunity of data digitization for the application of data analytic tools, there is also a massive improvement towards automation of processes, which previously would require manual inspection of the documents. Although the introduction of optical character recognition technologies mostly solved the task of converting human-readable characters from images into machine-readable characters, the task of extracting table semantics has been less focused on over the years. The recognition of tables consists of two main tasks, namely table detection and table structure recognition. Most prior work on this problem focuses on either task without offering an end-to-end solution or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
