Classification error in multiclass discrimination from Markov data

S\"oren Christensen; Albrecht Irle; and Lars Willert

arXiv:1509.06673·stat.ML·July 7, 2017

Classification error in multiclass discrimination from Markov data

S\"oren Christensen, Albrecht Irle, and Lars Willert

PDF

TL;DR

This paper investigates how incorporating past observations in a Markov-dependent classification setting can significantly reduce misclassification risk, especially when using just one previous observation.

Contribution

It demonstrates that including a single preceding observation in Markov data improves classification accuracy and provides theoretical and empirical evidence for this benefit.

Findings

01

Using one previous observation reduces misclassification risk substantially.

02

The risk difference decreases exponentially with more past observations.

03

Practical results show significant improvement in handwritten character classification.

Abstract

As a model for an on-line classification setting we consider a stochastic process $(X_{- n}, Y_{- n})_{n}$ , the present time-point being denoted by 0, with observables $\dots, X_{- n}, X_{- n + 1}, \dots, X_{- 1}, X_{0}$ from which the pattern $Y_{0}$ is to be inferred. So in this classification setting, in addition to the present observation $X_{0}$ a number $l$ of preceding observations may be used for classification, thus taking a possible dependence structure into account as it occurs e.g. in an ongoing classification of handwritten characters. We treat the question how the performance of classifiers is improved by using such additional information. For our analysis, a hidden Markov model is used. Letting $R_{l}$ denote the minimal risk of misclassification using $l$ preceding observations we show that the difference $sup_{k} ∣ R_{l} - R_{l + k} ∣$ decreases exponentially fast as $l$ increases. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.