Classification error in multiclass discrimination from Markov data
S\"oren Christensen, Albrecht Irle, and Lars Willert

TL;DR
This paper investigates how incorporating past observations in a Markov-dependent classification setting can significantly reduce misclassification risk, especially when using just one previous observation.
Contribution
It demonstrates that including a single preceding observation in Markov data improves classification accuracy and provides theoretical and empirical evidence for this benefit.
Findings
Using one previous observation reduces misclassification risk substantially.
The risk difference decreases exponentially with more past observations.
Practical results show significant improvement in handwritten character classification.
Abstract
As a model for an on-line classification setting we consider a stochastic process , the present time-point being denoted by 0, with observables from which the pattern is to be inferred. So in this classification setting, in addition to the present observation a number of preceding observations may be used for classification, thus taking a possible dependence structure into account as it occurs e.g. in an ongoing classification of handwritten characters. We treat the question how the performance of classifiers is improved by using such additional information. For our analysis, a hidden Markov model is used. Letting denote the minimal risk of misclassification using preceding observations we show that the difference decreases exponentially fast as increases. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
