TL;DR
FastTab is a fast, grid-centric table structure recognition model that combines lightweight global reasoning and long-range dependency capture, achieving competitive results with low latency across multiple benchmarks.
Contribution
It introduces a novel combination of a Tiny Recursive Module and axial 1D Transformers for efficient table recognition without autoregressive decoding.
Findings
FastTab achieves competitive structure recovery performance on four benchmarks.
The model operates at low-latency inference suitable for real-time applications.
It demonstrates robustness under pixel-level anonymisation and extends to curved separators.
Abstract
Table structure recognition (TSR) requires both table-level coherence (row/column counts, headers, spanning cells) and precise separator localization. We introduce FastTab, a grid-centric TSR model that avoids autoregressive HTML decoding by combining (i) a lightweight Tiny Recursive Module (TRM) for global reasoning and (ii) axial 1D Transformer encoders that capture long-range dependencies along rows and columns. The model predicts row/column counts, header rows, and separators to construct a grid, then infers rowspan/colspan using ROI-aligned cell features. Across four benchmarks (PubTabNet, FinTabNet, PubTables-1M, and SciTSR), FastTab achieves competitive structure recovery performance while operating at low-latency inference. We further study robustness under pixel-level anonymisation and show an extension to curved separators for camera-captured documents. The source code will be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
