On the Convergence Speed of MDL Predictions for Bernoulli Sequences

Jan Poland; Marcus Hutter

arXiv:cs/0407039·cs.LG·July 16, 2007

On the Convergence Speed of MDL Predictions for Bernoulli Sequences

Jan Poland, Marcus Hutter

PDF

Open Access

TL;DR

This paper analyzes the convergence speed of MDL-based predictions for Bernoulli sequences, providing new bounds on prediction error and demonstrating near-Bayes performance for certain model classes.

Contribution

It derives a novel upper bound on MDL prediction error for countable Bernoulli classes, showing improved convergence rates over previous exponential bounds.

Findings

01

Bounded total expected square loss implies convergence with probability one.

02

New upper bounds on prediction error for countable Bernoulli classes.

03

Results applicable to classification and hypothesis testing tasks.

Abstract

We consider the Minimum Description Length principle for online sequence prediction. If the underlying model class is discrete, then the total expected square loss is a particularly interesting performance measure: (a) this quantity is bounded, implying convergence with probability one, and (b) it additionally specifies a `rate of convergence'. Generally, for MDL only exponential loss bounds hold, as opposed to the linear bounds for a Bayes mixture. We show that this is even the case if the model class contains only Bernoulli distributions. We derive a new upper bound on the prediction error for countable Bernoulli classes. This implies a small bound (comparable to the one for Bayes mixtures) for certain important model classes. The results apply to many Machine Learning tasks including classification and hypothesis testing. We provide arguments that our theorems generalize to countable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Machine Learning and Algorithms · Algorithms and Data Compression