Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction
Suren Bandara

TL;DR
This paper introduces a multi-scale signal-processing approach for robust table segment boundary detection in noisy, low-resolution images, enhancing structured data extraction accuracy from document images.
Contribution
It proposes a novel multi-scale Gaussian convolution method for detecting table edges from masks, improving robustness and accuracy over existing image-based approaches.
Findings
Improved CASA score from 67% to 76% on PubLayNet-1M.
Enhanced robustness to resolution variations.
Effective suppression of noise while preserving structural edges.
Abstract
Structured data extraction from tables plays a crucial role in document image analysis for scanned documents and digital archives. Although many methods have been proposed to detect table structures and extract cell contents, accurately identifying table segment boundaries (rows and columns) remains challenging, particularly in low-resolution or noisy images. In many real-world scenarios, table data are incomplete or degraded, limiting the adaptability of transformer-based methods to noisy inputs. Mask-based edge detection techniques have shown greater robustness under such conditions, as their sensitivity can be adjusted through threshold tuning; however, existing approaches typically apply masks directly to images, leading to noise sensitivity, resolution loss, or high computational cost. This paper proposes a novel multi-scale signal-processing method for detecting table edges from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Currency Recognition and Detection
