Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and Recognition
Jingjing Wu, Pengyuan Lyu, Guangming Lu, Chengquan Zhang, and Wenjie, Pei

TL;DR
This paper introduces SRSTS v2, a single shot scene text spotting method that decouples detection and recognition, enabling more accurate recognition even with imprecise detection, through collaborative feature sampling and parallel processing.
Contribution
The novel SRSTS v2 framework allows detection and recognition to operate independently yet collaboratively, reducing error propagation and improving accuracy in scene text spotting.
Findings
Outperforms state-of-the-art methods on four benchmarks.
Recognizes text accurately despite challenging boundary detection.
Decouples detection and recognition for more robust performance.
Abstract
Typical text spotters follow the two-stage spotting paradigm which detects the boundary for a text instance first and then performs text recognition within the detected regions. Despite the remarkable progress of such spotting paradigm, an important limitation is that the performance of text recognition depends heavily on the precision of text detection, resulting in the potential error propagation from detection to recognition. In this work, we propose the single shot Self-Reliant Scene Text Spotter v2 (SRSTS v2), which circumvents this limitation by decoupling recognition from detection while optimizing two tasks collaboratively. Specifically, our SRSTS v2 samples representative feature points around each potential text instance, and conducts both text detection and recognition in parallel guided by these sampled points. Thus, the text recognition is no longer dependent on detection,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
