TL;DR
BoundaryNet is a novel semi-automatic deep learning approach that accurately annotates complex document image regions without resizing, leveraging attention mechanisms and fast marching distance maps for high-quality boundary detection.
Contribution
The paper introduces BoundaryNet, a resizing-free, semi-automatic boundary annotation method combining attention-guided networks and fast marching maps, outperforming existing approaches on complex document images.
Findings
Outperforms strong baselines on manuscript datasets
Generalizes across multiple document layouts and scripts
Provides high annotation throughput in practical systems
Abstract
Precise boundary annotations of image regions can be crucial for downstream applications which rely on region-class semantics. Some document collections contain densely laid out, highly irregular and overlapping multi-class region instances with large range in aspect ratio. Fully automatic boundary estimation approaches tend to be data intensive, cannot handle variable-sized images and produce sub-optimal results for aforementioned images. To address these issues, we propose BoundaryNet, a novel resizing-free approach for high-precision semi-automatic layout annotation. The variable-sized user selected region of interest is first processed by an attention-guided skip network. The network optimization is guided via Fast Marching distance maps to obtain a good quality initial boundary estimate and an associated feature representation. These outputs are processed by a Residual Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBoundaryNet · Convolution
