TableNet A Large-Scale Table Dataset with LLM-Powered Autonomous
Ruilin Zhang, Kai Yang

TL;DR
This paper introduces TableNet, a large-scale table dataset created using an LLM-powered autonomous system that synthesizes and recognizes diverse table images, enhancing table structure recognition research.
Contribution
It presents the first LLM-powered autonomous system for generating and recognizing diverse tables, enabling large-scale dataset creation and improved recognition performance.
Findings
Achieved competitive recognition performance with fewer training samples.
Generated a wide variety of semantically coherent tables with controllable parameters.
Utilized active learning to improve model training efficiency and accuracy.
Abstract
Table Structure Recognition (TSR) requires the logical reasoning ability of large language models (LLMs) to handle complex table layouts, but current datasets are limited in scale and quality, hindering effective use of this reasoning capacity. We thus present TableNet dataset, a new table structure recognition dataset collected and generated through multiple sources. Central to our approach is the first LLM-powered autonomous table generation and recognition multi-agent system that we developed. The generation part of our system integrates controllable visual, structural, and semantic parameters into the synthesis of table images. It facilitates the creation of a wide array of semantically coherent tables, adaptable to user-defined configurations along with annotations, thereby supporting large-scale and detailed dataset construction. This capability enables a comprehensive and nuanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
