PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun, Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

TL;DR
PP-OCRv3 significantly enhances lightweight OCR performance by upgrading detection and recognition models with novel modules, strategies, and training techniques, achieving 5% higher accuracy than PP-OCRv2 while maintaining efficiency.
Contribution
The paper introduces PP-OCRv3, a more robust and accurate lightweight OCR system with multiple model improvements and training strategies over previous versions.
Findings
PP-OCRv3 achieves 5% higher hmean than PP-OCRv2.
Introduces LK-PAN and RSE-FPN modules for better text detection.
Employs SVTR LCNet and TextConAug for improved text recognition.
Abstract
Optical character recognition (OCR) technology has been widely used in various scenes, as shown in Figure 1. Designing a practical OCR system is still a meaningful but challenging task. In previous work, considering the efficiency and accuracy, we proposed a practical ultra lightweight OCR system (PP-OCR), and an optimized version PP-OCRv2. In order to further improve the performance of PP-OCRv2, a more robust OCR system PP-OCRv3 is proposed in this paper. PP-OCRv3 upgrades the text detection model and text recognition model in 9 aspects based on PP-OCRv2. For text detector, we introduce a PAN module with large receptive field named LK-PAN, a FPN module with residual attention mechanism named RSE-FPN, and DML distillation strategy. For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Balanced Selection · 1x1 Convolution · Feature Pyramid Network
