TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu, Cong, Yao

TL;DR
TextSnake introduces a flexible, geometry-based representation for scene text detection that effectively captures arbitrary-shaped text, including curved text, achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes TextSnake, a novel representation for scene text detection that models text as a sequence of overlapping disks, enabling detection of arbitrary-shaped text.
Findings
Achieves state-of-the-art performance on Total-Text and SCUT-CTW1500 datasets.
Outperforms baseline by over 40% in F-measure on Total-Text.
Effectively detects curved and arbitrarily shaped text in natural images.
Abstract
Driven by deep neural networks and large scale datasets, scene text detection methods have progressed substantially over the past years, continuously refreshing the performance records on various standard benchmarks. However, limited by the representations (axis-aligned rectangles, rotated rectangles or quadrangles) adopted to describe text, existing methods may fall short when dealing with much more free-form text instances, such as curved text, which are actually very common in real-world scenarios. To tackle this problem, we propose a more flexible representation for scene text, termed as TextSnake, which is able to effectively represent text instances in horizontal, oriented and curved forms. In TextSnake, a text instance is described as a sequence of ordered, overlapping disks centered at symmetric axes, each of which is associated with potentially variable radius and orientation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
