TableSeq: Unified Generation of Structure, Content, and Layout

Laziz Hamdi; Amine Tamasna; Pascal Boisson; Thierry Paquet

arXiv:2604.16070·cs.CV·April 20, 2026

TableSeq: Unified Generation of Structure, Content, and Layout

Laziz Hamdi, Amine Tamasna, Pascal Boisson, Thierry Paquet

PDF

1 Repo

TL;DR

TableSeq is a unified, end-to-end framework for table structure, content, and layout recognition from images, achieving state-of-the-art results with a simple architecture and versatile sequence generation.

Contribution

It introduces a single sequence-generation model that unifies multiple table recognition tasks without external OCR or complex post-processing.

Findings

01

Achieves state-of-the-art results on PubTabNet and FinTabNet benchmarks.

02

Effectively generalizes to index-based table querying tasks.

03

Multi-token prediction reduces inference latency with minimal accuracy loss.

Abstract

We present TableSeq, an image-only, end-to-end framework for joint table structure recognition, content recognition, and cell localization. The model formulates these tasks as a single sequence-generation problem: one decoder produces an interleaved stream of \texttt{HTML} tags, cell text, and discretized coordinate tokens, thereby aligning logical structure, textual content, and cell geometry within a unified autoregressive sequence. This design avoids external OCR, auxiliary decoders, and complex multi-stage post-processing. TableSeq combines a lightweight high-resolution FCN-H16 encoder with a minimal structure-prior head and a single-layer transformer encoder, yielding a compact architecture that remains effective on challenging layouts. Across standard benchmarks, TableSeq achieves competitive or state-of-the-art results while preserving architectural simplicity. It reaches 95.23…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hamdilaziz/TableSeq
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.