PuzzleNet: Scene Text Detection by Segment Context Graph Learning

Hao Liu; Antai Guo; Deqiang Jiang; Yiqing Hu; Bo Ren

arXiv:2002.11371·cs.CV·February 27, 2020·6 cites

PuzzleNet: Scene Text Detection by Segment Context Graph Learning

Hao Liu, Antai Guo, Deqiang Jiang, Yiqing Hu, Bo Ren

PDF

Open Access

TL;DR

PuzzleNet introduces a novel scene text detection approach that leverages segment context graphs and a two-branch graph convolutional network to improve detection accuracy for arbitrary-shaped text regions.

Contribution

It proposes PuzzleNet, combining segment proposals with a context-aware graph convolutional network to enhance scene text detection by modeling appearance and geometry correlations.

Findings

01

Achieves better or comparable performance on benchmark datasets.

02

Effectively models segment context for improved detection.

03

Demonstrates the benefit of context graph exploitation in text detection.

Abstract

Recently, a series of decomposition-based scene text detection methods has achieved impressive progress by decomposing challenging text regions into pieces and linking them in a bottom-up manner. However, most of them merely focus on linking independent text pieces while the context information is underestimated. In the puzzle game, the solver often put pieces together in a logical way according to the contextual information of each piece, in order to arrive at the correct solution. Inspired by it, we propose a novel decomposition-based method, termed Puzzle Networks (PuzzleNet), to address the challenging scene text detection task in this work. PuzzleNet consists of the Segment Proposal Network (SPN) that predicts the candidate text segments fitting arbitrary shape of text region, and the two-branch Multiple-Similarity Graph Convolutional Network (MSGCN) that models both appearance and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization