DATa: Domain Adaptation-Aided Deep Table Detection Using Visual-Lexical Representations
Hyebin Kwon, Joungbin An, Dongwoo Lee, Won-Yong Shin

TL;DR
DATa is a novel domain adaptation method for deep table detection that improves performance in target domains with limited labels by combining visual and lexical features and re-training with an augmented model.
Contribution
It introduces a new domain adaptation approach using lexical features and an augmented model to enhance deep table detection in new domains with few labels.
Findings
Outperforms existing visual-only methods in target domain detection accuracy.
Effectively reduces false positives and false negatives through confidence score optimization.
Demonstrates significant improvements on real-world benchmark datasets.
Abstract
Considerable research attention has been paid to table detection by developing not only rule-based approaches reliant on hand-crafted heuristics but also deep learning approaches. Although recent studies successfully perform table detection with enhanced results, they often experience performance degradation when they are used for transferred domains whose table layout features might differ from the source domain in which the underlying model has been trained. To overcome this problem, we present DATa, a novel Domain Adaptation-aided deep Table detection method that guarantees satisfactory performance in a specific target domain where few trusted labels are available. To this end, we newly design lexical features and an augmented model used for re-training. More specifically, after pre-training one of state-of-the-art vision-based models as our backbone network, we re-train our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Infrastructure Maintenance and Monitoring · Multimodal Machine Learning Applications
