PuzzleNet: Scene Text Detection by Segment Context Graph Learning
Hao Liu, Antai Guo, Deqiang Jiang, Yiqing Hu, Bo Ren

TL;DR
PuzzleNet introduces a novel scene text detection approach that leverages segment context graphs and a two-branch graph convolutional network to improve detection accuracy for arbitrary-shaped text regions.
Contribution
It proposes PuzzleNet, combining segment proposals with a context-aware graph convolutional network to enhance scene text detection by modeling appearance and geometry correlations.
Findings
Achieves better or comparable performance on benchmark datasets.
Effectively models segment context for improved detection.
Demonstrates the benefit of context graph exploitation in text detection.
Abstract
Recently, a series of decomposition-based scene text detection methods has achieved impressive progress by decomposing challenging text regions into pieces and linking them in a bottom-up manner. However, most of them merely focus on linking independent text pieces while the context information is underestimated. In the puzzle game, the solver often put pieces together in a logical way according to the contextual information of each piece, in order to arrive at the correct solution. Inspired by it, we propose a novel decomposition-based method, termed Puzzle Networks (PuzzleNet), to address the challenging scene text detection task in this work. PuzzleNet consists of the Segment Proposal Network (SPN) that predicts the candidate text segments fitting arbitrary shape of text region, and the two-branch Multiple-Similarity Graph Convolutional Network (MSGCN) that models both appearance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
