Ultra Light OCR Competition Technical Report

Shuhan Zhang; Yuxin Zou; Tianhe Wang; Yichao Xiong

arXiv:2110.12623·cs.CV·October 26, 2021

Ultra Light OCR Competition Technical Report

Shuhan Zhang, Yuxin Zou, Tianhe Wang, Yichao Xiong

PDF

Open Access

TL;DR

This paper reports on the Ultra Light OCR Competition focusing on Chinese scene text recognition within a 10M model size limit, proposing effective methods that achieved second place with 81.7% accuracy.

Contribution

It introduces a general and effective approach for Chinese scene text recognition balancing model size and accuracy, validated through competitive results.

Findings

01

Achieved 81.7% accuracy in TestB dataset

02

Developed a method balancing model scale and recognition performance

03

Secured second place among over 100 teams

Abstract

Ultra Light OCR Competition is a Chinese scene text recognition competition jointly organized by CSIG (China Society of Image and Graphics) and Baidu, Inc. In addition to focusing on common problems in Chinese scene text recognition, such as long text length and massive characters, we need to balance the trade-off of model scale and accuracy since the model size limitation in the competition is 10M. From experiments in aspects of data, model, training, etc, we proposed a general and effective method for Chinese scene text recognition, which got us second place among over 100 teams with accuracy 0.817 in TestB dataset. The code is available at https://aistudio.baidu.com/aistudio/projectdetail/2159102.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques