End-to-End Semi-Supervised approach with Modulated Object Queries for Table Detection in Documents
Iqraa Ehsan, Tahira Shehzadi, Didier Stricker, Muhammad Zeshan Afzal

TL;DR
This paper introduces a novel semi-supervised transformer-based method for table detection in documents, significantly improving training efficiency and accuracy over previous approaches by enhancing pseudo-label quality.
Contribution
It proposes a new matching strategy combining one-to-one and one-to-many assignments, leading to state-of-the-art results in semi-supervised table detection.
Findings
Achieves 95.7% mAP on TableBank with 30% labels
Surpasses previous methods by 7.4-7.6 points in mAP
Demonstrates superior pseudo-label quality and training efficiency
Abstract
Table detection, a pivotal task in document analysis, aims to precisely recognize and locate tables within document images. Although deep learning has shown remarkable progress in this realm, it typically requires an extensive dataset of labeled data for proficient training. Current CNN-based semi-supervised table detection approaches use the anchor generation process and Non-Maximum Suppression (NMS) in their detection process, limiting training efficiency. Meanwhile, transformer-based semi-supervised techniques adopted a one-to-one match strategy that provides noisy pseudo-labels, limiting overall efficiency. This study presents an innovative transformer-based semi-supervised table detector. It improves the quality of pseudo-labels through a novel matching strategy combining one-to-one and one-to-many assignment techniques. This approach significantly enhances training efficiency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Data Quality and Management · Topic Modeling
