Learning Spatial-Semantic Context with Fully Convolutional Recurrent   Network for Online Handwritten Chinese Text Recognition

Zecheng Xie; Zenghui Sun; Lianwen Jin; Hao Ni; Terry Lyons

arXiv:1610.02616·cs.CV·May 26, 2017·2 cites

Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition

Zecheng Xie, Zenghui Sun, Lianwen Jin, Hao Ni, Terry Lyons

PDF

Open Access

TL;DR

This paper introduces a novel fully convolutional recurrent network that leverages path signature features and an implicit language model to improve online handwritten Chinese text recognition, achieving state-of-the-art accuracy.

Contribution

It proposes a new multi-spatial-context network and an implicit language model that effectively handle segmentation and semantic context in Chinese handwriting recognition.

Findings

01

Achieved 97.10% and 97.15% accuracy on two benchmarks.

02

Outperformed all previous methods in recognition accuracy.

03

Successfully integrated semantic context for improved recognition.

Abstract

Online handwritten Chinese text recognition (OHCTR) is a challenging problem as it involves a large-scale character set, ambiguous segmentation, and variable-length input sequences. In this paper, we exploit the outstanding capability of path signature to translate online pen-tip trajectories into informative signature feature maps using a sliding window-based method, successfully capturing the analytic and geometric properties of pen strokes with strong local invariance and robustness. A multi-spatial-context fully convolutional recurrent network (MCFCRN) is proposed to exploit the multiple spatial contexts from the signature feature maps and generate a prediction sequence while completely avoiding the difficult segmentation problem. Furthermore, an implicit language model is developed to make predictions based on semantic context within a predicting feature sequence, providing a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Image Processing and 3D Reconstruction