Freezing and Sleeping: Tracking Experts that Learn by Evolving Past   Posteriors

Wouter M. Koolen; Tim van Erven

arXiv:1008.4654·cs.LG·August 30, 2010·2 cites

Freezing and Sleeping: Tracking Experts that Learn by Evolving Past Posteriors

Wouter M. Koolen, Tim van Erven

PDF

Open Access

TL;DR

This paper generalizes the mixing past posteriors algorithm to handle experts with internal learning capabilities, specifically within hidden Markov models, introducing two schemes—freezing and sleeping—and providing efficient strategies with loss bounds.

Contribution

It extends MPP to experts that learn from data, addressing whether they learn from all data or just tracked subsequences, using hidden Markov models.

Findings

01

Efficient prediction strategies for freezing and sleeping schemes.

02

Proven loss bounds for both schemes.

03

Applicable to structured experts modeled by hidden Markov models.

Abstract

A problem posed by Freund is how to efficiently track a small pool of experts out of a much larger set. This problem was solved when Bousquet and Warmuth introduced their mixing past posteriors (MPP) algorithm in 2001. In Freund's problem the experts would normally be considered black boxes. However, in this paper we re-examine Freund's problem in case the experts have internal structure that enables them to learn. In this case the problem has two possible interpretations: should the experts learn from all data or only from the subsequence on which they are being tracked? The MPP algorithm solves the first case. Our contribution is to generalise MPP to address the second option. The results we obtain apply to any expert structure that can be formalised using (expert) hidden Markov models. Curiously enough, for our interpretation there are \emph{two} natural reference schemes: freezing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems