RAPTOR: Refined Approach for Product Table Object Recognition
Eliott Thomas, Mickael Coustaty, Aurelie Joseph, Gaspar, Deloin, Elodie Carel, Vincent Poulain D'Andecy, Jean-Marc Ogier

TL;DR
RAPTOR is a modular post-processing system that enhances existing table detection and recognition models, significantly improving accuracy for product tables in documents by addressing common errors and optimizing parameters with genetic algorithms.
Contribution
This work introduces RAPTOR, a novel modular post-processing framework that refines state-of-the-art table detection and structure recognition models, tailored for industrial product tables.
Findings
Improved precision and structural accuracy on product table datasets.
Effective optimization of module parameters using genetic algorithms.
Maintains reasonable performance across diverse table formats.
Abstract
Extracting tables from documents is a critical task across various industries, especially on business documents like invoices and reports. Existing systems based on DEtection TRansformer (DETR) such as TAble TRansformer (TATR), offer solutions for Table Detection (TD) and Table Structure Recognition (TSR) but face challenges with diverse table formats and common errors like incorrect area detection and overlapping columns. This research introduces RAPTOR, a modular post-processing system designed to enhance state-of-the-art models for improved table extraction, particularly for product tables. RAPTOR addresses recurrent TD and TSR issues, improving both precision and structural predictions. For TD, we use DETR (trained on ICDAR 2019) and TATR (trained on PubTables-1M and FinTabNet), while TSR only relies on TATR. A Genetic Algorithm is incorporated to optimize RAPTOR's module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Handwritten Text Recognition Techniques · Vehicle License Plate Recognition
MethodsAttention Is All You Need · Absolute Position Encodings · Feedforward Network · Linear Layer · Layer Normalization · Byte Pair Encoding · Dense Connections · Residual Connection · Convolution · Label Smoothing
