Parsimonious HMMs for Offline Handwritten Chinese Text Recognition
Wenchao Wang, Jun Du, Zi-Rui Wang

TL;DR
This paper introduces parsimonious HMMs with state tying for offline handwritten Chinese text recognition, reducing model size and decoding time while improving accuracy by leveraging character similarities.
Contribution
The paper proposes a novel state tying approach for HMMs that enhances model efficiency and accuracy in Chinese text recognition, using data-driven algorithms and deep neural networks.
Findings
Achieved 6.2% relative CER reduction with DNN-HMMs.
Reduced model size by 25% and decoding time by 60%.
Outperformed conventional DNN-HMMs in compact settings.
Abstract
Recently, hidden Markov models (HMMs) have achieved promising results for offline handwritten Chinese text recognition. However, due to the large vocabulary of Chinese characters with each modeled by a uniform and fixed number of hidden states, a high demand of memory and computation is required. In this study, to address this issue, we present parsimonious HMMs via the state tying which can fully utilize the similarities among different Chinese characters. Two-step algorithm with the data-driven question-set is adopted to generate the tied-state pool using the likelihood measure. The proposed parsimonious HMMs with both Gaussian mixture models (GMMs) and deep neural networks (DNNs) as the emission distributions not only lead to a compact model but also improve the recognition accuracy via the data sharing for the tied states and the confusion decreasing among state classes. Tested on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Speech Recognition and Synthesis · Natural Language Processing Techniques
