Polygon-free: Unconstrained Scene Text Detection with Box Annotations
Weijia Wu, Enze Xie, Ruimao Zhang, Wenhai Wang, Hong Zhou, Ping Luo

TL;DR
This paper introduces Polygon-free, a text detection system that uses only bounding box annotations and knowledge transfer from synthetic data, achieving high accuracy comparable to polygon-based methods while significantly reducing labeling costs.
Contribution
It proposes a novel approach combining a segmentation network and knowledge transfer to enable polygon-free scene text detection using only bounding box annotations.
Findings
Achieves 80.5% F-score on TotalText without polygon annotations.
Outperforms fully supervised methods trained with only bounding boxes.
Reduces labeling costs by over 80%.
Abstract
Although a polygon is a more accurate representation than an upright bounding box for text detection, the annotations of polygons are extremely expensive and challenging. Unlike existing works that employ fully-supervised training with polygon annotations, this study proposes an unconstrained text detection system termed Polygon-free (PF), in which most existing polygon-based text detectors (e.g., PSENet [33],DB [16]) are trained with only upright bounding box annotations. Our core idea is to transfer knowledge from synthetic data to real data to enhance the supervision information of upright bounding boxes. This is made possible with a simple segmentation network, namely Skeleton Attention Segmentation Network (SASN), that includes three vital components (i.e., channel attention, spatial attention and skeleton attention map) and one soft cross-entropy loss. Experiments demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Multimodal Machine Learning Applications · Advanced Neural Network Applications
