Scene Text Recognition with Single-Point Decoding Network

Lei Chen; Haibo Qin; Shi-Xue Zhang; Chun Yang; Xucheng Yin

arXiv:2209.01914·cs.CV·September 7, 2022

Scene Text Recognition with Single-Point Decoding Network

Lei Chen, Haibo Qin, Shi-Xue Zhang, Chun Yang, Xucheng Yin

PDF

Open Access

TL;DR

This paper introduces SPDN, an attention-free scene text recognition network that efficiently decodes characters by sampling key points, significantly reducing computation while maintaining high accuracy.

Contribution

The paper presents a novel single-point decoding network with a sampling module that replaces traditional attention mechanisms for improved efficiency in scene text recognition.

Findings

01

Significantly faster decoding without performance loss

02

Effective key point localization for each character

03

Outperforms attention-based methods on benchmarks

Abstract

In recent years, attention-based scene text recognition methods have been very popular and attracted the interest of many researchers. Attention-based methods can adaptively focus attention on a small area or even single point during decoding, in which the attention matrix is nearly one-hot distribution. Furthermore, the whole feature maps will be weighted and summed by all attention matrices during inference, causing huge redundant computations. In this paper, we propose an efficient attention-free Single-Point Decoding Network (dubbed SPDN) for scene text recognition, which can replace the traditional attention-based decoding network. Specifically, we propose Single-Point Sampling Module (SPSM) to efficiently sample one key point on the feature map for decoding one character. In this way, our method can not only precisely locate the key point of each character but also remove…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques