BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps   for Semi-automatic Layout Annotation

Abhishek Trivedi; Ravi Kiran Sarvadevabhatla

arXiv:2108.09433·cs.CV·August 24, 2021

BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation

Abhishek Trivedi, Ravi Kiran Sarvadevabhatla

PDF

1 Repo

TL;DR

BoundaryNet is a novel semi-automatic deep learning approach that accurately annotates complex document image regions without resizing, leveraging attention mechanisms and fast marching distance maps for high-quality boundary detection.

Contribution

The paper introduces BoundaryNet, a resizing-free, semi-automatic boundary annotation method combining attention-guided networks and fast marching maps, outperforming existing approaches on complex document images.

Findings

01

Outperforms strong baselines on manuscript datasets

02

Generalizes across multiple document layouts and scripts

03

Provides high annotation throughput in practical systems

Abstract

Precise boundary annotations of image regions can be crucial for downstream applications which rely on region-class semantics. Some document collections contain densely laid out, highly irregular and overlapping multi-class region instances with large range in aspect ratio. Fully automatic boundary estimation approaches tend to be data intensive, cannot handle variable-sized images and produce sub-optimal results for aforementioned images. To address these issues, we propose BoundaryNet, a novel resizing-free approach for high-precision semi-automatic layout annotation. The variable-sized user selected region of interest is first processed by an attention-guided skip network. The network optimization is guided via Fast Marching distance maps to obtain a good quality initial boundary estimate and an associated feature representation. These outputs are processed by a Residual Graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ihdia/BoundaryNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBoundaryNet · Convolution