LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng, Peng Wang, Cheng Da, Qi Zheng, Cong Yao

TL;DR
LISTER introduces a neighbor decoding approach with feature enhancement to achieve robust scene text recognition across varying lengths, especially excelling in recognizing longer texts and extrapolating length without prior length information.
Contribution
The paper presents the first effective method for length-insensitive scene text recognition, combining neighbor decoding and feature enhancement for improved long text recognition.
Findings
Superior performance on long text recognition
Effective length extrapolation capability
Outperforms previous state-of-the-art methods on benchmarks
Abstract
The diversity in length constitutes a significant characteristic of text. Due to the long-tail distribution of text lengths, most existing methods for scene text recognition (STR) only work well on short or seen-length text, lacking the capability of recognizing longer text or performing length extrapolation. This is a crucial issue, since the lengths of the text to be recognized are usually not given in advance in real-world applications, but it has not been adequately investigated in previous works. Therefore, we propose in this paper a method called Length-Insensitive Scene TExt Recognizer (LISTER), which remedies the limitation regarding the robustness to various text lengths. Specifically, a Neighbor Decoder is proposed to obtain accurate character attention maps with the assistance of a novel neighbor matrix regardless of the text lengths. Besides, a Feature Enhancement Module is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
