LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
Yuchen Su, Zhineng Chen, Yongkun Du, Zuxuan Wu, Hongtao Xie, Yu-Gang Jiang

TL;DR
LRANet++ introduces a low-rank shape representation and a triple assignment detection scheme to improve the accuracy and efficiency of end-to-end arbitrary-shaped text spotting.
Contribution
It proposes a novel low-rank approximation method for precise shape detection and a triple assignment detection head for faster inference in text spotting.
Findings
Outperforms state-of-the-art methods on challenging benchmarks.
Achieves high accuracy with reduced inference time.
Effectively handles arbitrary-shaped text detection and recognition.
Abstract
End-to-end text spotting aims to jointly optimize text detection and recognition within a unified framework. Despite significant progress, designing an accurate and efficient end-to-end text spotter for arbitrary-shaped text remains challenging. We identify the primary bottleneck as the lack of a reliable and efficient text detection method. To address this, we propose a novel parameterized text shape representation based on low-rank approximation for precise detection and a triple assignment detection head for fast inference. Specifically, unlike current data-irrelevant shape representation methods, we exploit shape correlations among labeled text boundaries to construct a robust low-rank subspace. By minimizing an -norm objective, we extract orthogonal vectors that capture the intrinsic text shape from noisy annotations, enabling precise reconstruction via the linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Topic Modeling · Advanced Neural Network Applications
