Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Hao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang,, Yinsong Liu, Xing Sun

TL;DR
This paper introduces GrabTab, a novel method for complex table structure recognition that leverages multiple components efficiently, outperforming existing methods especially in challenging scenarios without complex post-processing.
Contribution
The paper proposes GrabTab with a Component Deliberator, enabling flexible handling of complex tables and reducing reliance on post-processing, advancing TSR capabilities.
Findings
Significantly outperforms state-of-the-art methods on public benchmarks.
Effective in handling unregularized and complex table structures.
Reduces post-processing complexity in TSR tasks.
Abstract
Recently, Table Structure Recognition (TSR) task, aiming at identifying table structure into machine readable formats, has received increasing interest in the community. While impressive success, most single table component-based methods can not perform well on unregularized table cases distracted by not only complicated inner structure but also exterior capture distortion. In this paper, we raise it as Complex TSR problem, where the performance degeneration of existing methods is attributable to their inefficient component usage and redundant post-processing. To mitigate it, we shift our perspective from table component extraction towards the efficient multiple components leverage, which awaits further exploration in the field. Specifically, we propose a seminal method, termed GrabTab, equipped with newly proposed Component Deliberator. Thanks to its progressive deliberation mechanism,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Currency Recognition and Detection · Digital Media Forensic Detection
