Handwritten digit string recognition by combination of residual network and RNN-CTC
Hongjian Zhan, Qingqing Wang, Yue Lu

TL;DR
This paper introduces a novel end-to-end neural network combining residual CNN, RNN, and CTC for handwritten digit string recognition, achieving state-of-the-art accuracy and demonstrating effectiveness on long string sequences.
Contribution
The paper presents a new architecture integrating residual networks with RNN and CTC for improved handwritten digit string recognition.
Findings
Achieved 89.75% and 91.14% recognition rates on ORAND-CAR datasets.
Demonstrated effectiveness on longer string sequences in captcha data.
Proposed model outperforms previous methods in accuracy.
Abstract
Recurrent neural network (RNN) and connectionist temporal classification (CTC) have showed successes in many sequence labeling tasks with the strong ability of dealing with the problems where the alignment between the inputs and the target labels is unknown. Residual network is a new structure of convolutional neural network and works well in various computer vision tasks. In this paper, we take advantage of the architectures mentioned above to create a new network for handwritten digit string recognition. First we design a residual network to extract features from input images, then we employ a RNN to model the contextual information within feature sequences and predict recognition results. At the top of this network, a standard CTC is applied to calculate the loss and yield the final results. These three parts compose an end-to-end trainable network. The proposed new architecture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Music and Audio Processing · Digital Media Forensic Detection
