TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation
Wei Wang, Yu Zhou, Jiahao Lv, Dayan Wu, Guoqing Zhao, Ning Jiang,, Weiping Wang

TL;DR
TPSNet introduces a novel scene text detection method using Thin-Plate-Spline (TPS) parameters as a compact, complete, and efficient shape representation, improving arbitrary shape text detection and recognition accuracy.
Contribution
The paper proposes using TPS as a new shape representation for scene text, along with Border Alignment Loss, enabling direct rectification and improved detection and spotting performance.
Findings
Achieves 4.4% higher F-Measure on Art dataset
Achieves 5.0% higher end-to-end spotting F-Measure on Total-Text
Demonstrates effectiveness and superiority over existing methods
Abstract
The research focus of scene text detection and recognition has shifted to arbitrary shape text in recent years, where the text shape representation is a fundamental problem. An ideal representation should be compact, complete, efficient, and reusable for subsequent recognition in our opinion. However, previous representations have flaws in one or more aspects. Thin-Plate-Spline (TPS) transformation has achieved great success in scene text recognition. Inspired by this, we reversely think of its usage and sophisticatedly take TPS as an exquisite representation for arbitrary shape text representation. The TPS representation is compact, complete, and efficient. With the predicted TPS parameters, the detected text region can be directly rectified to a near-horizontal one to assist the subsequent recognition. To further exploit the potential of the TPS representation, the Border Alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis
