PixelLink: Detecting Scene Text via Instance Segmentation

Dan Deng; Haifeng Liu; Xuelong Li; Deng Cai

arXiv:1801.01315·cs.CV·January 8, 2018·53 cites

PixelLink: Detecting Scene Text via Instance Segmentation

Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai

PDF

Open Access 5 Repos

TL;DR

PixelLink introduces an instance segmentation approach for scene text detection that links pixels within the same text instance, enabling direct extraction of bounding boxes without regression, leading to improved efficiency and comparable accuracy.

Contribution

The paper presents PixelLink, a novel instance segmentation method for scene text detection that eliminates the need for bounding box regression, simplifying the detection process.

Findings

01

Achieves better or comparable performance on benchmarks

02

Requires fewer training iterations

03

Uses less training data

Abstract

Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition