Parsimonious HMMs for Offline Handwritten Chinese Text Recognition

Wenchao Wang; Jun Du; Zi-Rui Wang

arXiv:1808.04138·cs.CV·August 14, 2018

Parsimonious HMMs for Offline Handwritten Chinese Text Recognition

Wenchao Wang, Jun Du, Zi-Rui Wang

PDF

Open Access

TL;DR

This paper introduces parsimonious HMMs with state tying for offline handwritten Chinese text recognition, reducing model size and decoding time while improving accuracy by leveraging character similarities.

Contribution

The paper proposes a novel state tying approach for HMMs that enhances model efficiency and accuracy in Chinese text recognition, using data-driven algorithms and deep neural networks.

Findings

01

Achieved 6.2% relative CER reduction with DNN-HMMs.

02

Reduced model size by 25% and decoding time by 60%.

03

Outperformed conventional DNN-HMMs in compact settings.

Abstract

Recently, hidden Markov models (HMMs) have achieved promising results for offline handwritten Chinese text recognition. However, due to the large vocabulary of Chinese characters with each modeled by a uniform and fixed number of hidden states, a high demand of memory and computation is required. In this study, to address this issue, we present parsimonious HMMs via the state tying which can fully utilize the similarities among different Chinese characters. Two-step algorithm with the data-driven question-set is adopted to generate the tied-state pool using the likelihood measure. The proposed parsimonious HMMs with both Gaussian mixture models (GMMs) and deep neural networks (DNNs) as the emission distributions not only lead to a compact model but also improve the recognition accuracy via the data sharing for the tied states and the confusion decreasing among state classes. Tested on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Speech Recognition and Synthesis · Natural Language Processing Techniques