Robust Table Detection and Structure Recognition from Heterogeneous Document Images
Chixiang Ma, Weihong Lin, Lei Sun, Qiang Huo

TL;DR
This paper presents RobusTabNet, a robust method for table detection and structure recognition in diverse document images, achieving state-of-the-art results through innovative use of CornerNet and spatial CNN techniques.
Contribution
The paper introduces a novel combination of CornerNet with Faster R-CNN for improved table detection and a spatial CNN-based approach for accurate table structure recognition, handling complex and distorted tables.
Findings
Achieves state-of-the-art performance on multiple benchmarks.
Effectively recognizes complex, distorted, and curved tables.
Demonstrates robustness on challenging in-house datasets.
Abstract
We introduce a new table detection and structure recognition approach named RobusTabNet to detect the boundaries of tables and reconstruct the cellular structure of each table from heterogeneous document images. For table detection, we propose to use CornerNet as a new region proposal network to generate higher quality table proposals for Faster R-CNN, which has significantly improved the localization accuracy of Faster R-CNN for table detection. Consequently, our table detection approach achieves state-of-the-art performance on three public table detection benchmarks, namely cTDaR TrackA, PubLayNet and IIIT-AR-13K, by only using a lightweight ResNet-18 backbone network. Furthermore, we propose a new split-and-merge based table structure recognition approach, in which a novel spatial CNN based separation line prediction module is proposed to split each detected table into a grid of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRoIPool · Region Proposal Network · Convolution · 1x1 Convolution · Softmax · Faster R-CNN · *Communicated@Fast*How Do I Communicate to Expedia? · Corner Pooling · Residual Connection · Max Pooling
