Convergence and Loss Bounds for Bayesian Sequence Prediction
Marcus Hutter

TL;DR
This paper analyzes Bayesian sequence prediction, establishing convergence of the mixture posterior to the true distribution, providing bounds on prediction loss, and demonstrating the effectiveness of Bayesian methods in unknown sequence models.
Contribution
It introduces a new elementary derivation of convergence rates for Bayesian mixture predictors and bounds their prediction loss relative to the true distribution.
Findings
Convergence of the mixture posterior to the true posterior is established.
Prediction loss of the Bayesian mixture predictor is bounded and asymptotically optimal.
The paper provides convergence rates and bounds without assumptions on loss structure.
Abstract
The probability of observing at time , given past observations can be computed with Bayes' rule if the true generating distribution of the sequences is known. If is unknown, but known to belong to a class one can base ones prediction on the Bayes mix defined as a weighted sum of distributions . Various convergence results of the mixture posterior to the true posterior are presented. In particular a new (elementary) derivation of the convergence is provided, which additionally gives the rate of convergence. A general sequence predictor is allowed to choose an action based on and receives loss if is the next symbol of the sequence. No assumptions are made on the structure of (apart from being bounded) and . The Bayes-optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
