High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps
Zhuoyao Zhong, Lianwen Jin, Zecheng Xie

TL;DR
This paper introduces a deep, efficient GoogLeNet-based CNN architecture combined with traditional directional features for offline handwritten Chinese character recognition, achieving state-of-the-art accuracy with fewer parameters.
Contribution
The paper designs a 19-layer streamlined GoogLeNet model tailored for HCCR, demonstrating that deeper networks with traditional features significantly improve recognition performance.
Findings
Achieved 96.74% accuracy on ICDAR 2013 dataset.
Proposed model outperforms previous methods with fewer parameters.
Traditional directional features enhance CNN performance.
Abstract
Just like its great success in solving many computer vision problems, the convolutional neural networks (CNN) provided new end-to-end approach to handwritten Chinese character recognition (HCCR) with very promising results in recent years. However, previous CNNs so far proposed for HCCR were neither deep enough nor slim enough. We show in this paper that, a deeper architecture can benefit HCCR a lot to achieve higher performance, meanwhile can be designed with less parameters. We also show that the traditional feature extraction methods, such as Gabor or gradient feature maps, are still useful for enhancing the performance of CNN. We design a streamlined version of GoogLeNet [13], which was original proposed for image classification in recent years with very deep architecture, for HCCR (denoted as HCCR-GoogLeNet). The HCCR-GoogLeNet we used is 19 layers deep but involves with only 7.26…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques
Methods1x1 Convolution · Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling
