Adaptive Embedding Gate for Attention-Based Scene Text Recognition
Xiaoxue Chen, Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo

TL;DR
This paper introduces the adaptive embedding gate (AEG), a novel module for attention-based scene text recognition that improves accuracy and robustness by better modeling character dependencies.
Contribution
The paper proposes the adaptive embedding gate (AEG), a new module that enhances attention mechanisms with high-order language modeling for scene text recognition.
Findings
AEG significantly improves recognition accuracy on multiple benchmarks.
AEG enhances robustness against challenging scene text conditions.
AEG can be integrated into existing attention-based models with ease.
Abstract
Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications. The most cutting-edge methods are attentional encoder-decoder frameworks that learn the alignment between the input image and output sequences. In particular, the decoder recurrently outputs predictions, using the prediction of the previous step as a guidance for every time step. In this study, we point out that the inappropriate use of previous predictions in existing attention mechanisms restricts the recognition performance and brings instability. To handle this problem, we propose a novel module, namely adaptive embedding gate(AEG). The proposed AEG focuses on introducing high-order character language models to attention mechanism by controlling the information transmission between adjacent characters. AEG is a flexible module and can be easily…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Topic Modeling
