One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer
Hang Guo, Tao Dai, Mingyan Zhu, Guanghao Meng, Bin Chen, Zhi Wang,, Shu-Tao Xia

TL;DR
This paper introduces a one-stage low-resolution text recognition method that transfers high-resolution knowledge to improve accuracy and efficiency, avoiding the need for separate super-resolution steps.
Contribution
It proposes a novel knowledge distillation framework with multi-level transfer, including visual focus, semantic contrastive, and soft logits losses, to enhance low-resolution text recognition.
Findings
Outperforms two-stage super-resolution methods in accuracy and efficiency
Demonstrates robustness across various low-resolution scenarios
Provides a practical, end-to-end recognition pipeline
Abstract
Recognizing characters from low-resolution (LR) text images poses a significant challenge due to the information deficiency as well as the noise and blur in low-quality images. Current solutions for low-resolution text recognition (LTR) typically rely on a two-stage pipeline that involves super-resolution as the first stage followed by the second-stage recognition. Although this pipeline is straightforward and intuitive, it has to use an additional super-resolution network, which causes inefficiencies during training and testing. Moreover, the recognition accuracy of the second stage heavily depends on the reconstruction quality of the first stage, causing ineffectiveness. In this work, we attempt to address these challenges from a novel perspective: adapting the recognizer to low-resolution inputs by transferring the knowledge from the high-resolution. Guided by this idea, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image Processing Techniques · Seismic Imaging and Inversion Techniques
MethodsKnowledge Distillation · Focus
