Caveats on Bayesian and hidden-Markov models (v2.8)

Lambert Schomaker

arXiv:1608.05277·cs.LG·January 4, 2017

Caveats on Bayesian and hidden-Markov models (v2.8)

Lambert Schomaker

PDF

Open Access

TL;DR

This paper discusses fundamental issues in applying Bayesian and hidden-Markov models to cursive-script recognition, highlighting error propagation, the Markov assumption's validity, and alternative methods with promising results.

Contribution

It identifies key problems in Bayesian and HMM applications, provides Monte Carlo error analysis, and explores non-Bayesian, non-Markov handwriting recognition methods.

Findings

01

Error propagation follows a Poisson distribution over log probabilities.

02

Basic tests can determine the necessity of modeling serial dependencies.

03

Non-Bayesian, nearest-mean classification yields acceptable handwriting recognition results.

Abstract

This paper describes a number of fundamental and practical problems in the application of hidden-Markov models and Bayes when applied to cursive-script recognition. Several problems, however, will have an effect in other application areas. The most fundamental problem is the propagation of error in the product of probabilities. This is a common and pervasive problem which deserves more attention. On the basis of Monte Carlo modeling, tables for the expected relative error are given. It seems that it is distributed according to a continuous Poisson distribution over log probabilities. A second essential problem is related to the appropriateness of the Markov assumption. Basic tests will reveal whether a problem requires modeling of the stochastics of seriality, at all. Examples are given of lexical encodings which cover 95-99% classification accuracy of a lexicon, with removed sequence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis