Switching between Hidden Markov Models using Fixed Share

Wouter M. Koolen; Tim van Erven

arXiv:1008.4532·cs.LG·August 27, 2010·1 cites

Switching between Hidden Markov Models using Fixed Share

Wouter M. Koolen, Tim van Erven

PDF

Open Access

TL;DR

This paper explores how to adapt the fixed-share algorithm for switching between experts modeled as Hidden Markov Models, enabling efficient segmentation-based learning in changing data environments.

Contribution

It introduces a method to efficiently switch between HMM experts in the fixed-share framework without the typical slowdown, addressing learning within segments.

Findings

01

Fixed-share can be extended to HMM experts without slowdown.

02

The approach improves prediction in non-stationary data environments.

03

The method maintains small regret compared to segment-specific best experts.

Abstract

In prediction with expert advice the goal is to design online prediction algorithms that achieve small regret (additional loss on the whole data) compared to a reference scheme. In the simplest such scheme one compares to the loss of the best expert in hindsight. A more ambitious goal is to split the data into segments and compare to the best expert on each segment. This is appropriate if the nature of the data changes between segments. The standard fixed-share algorithm is fast and achieves small regret compared to this scheme. Fixed share treats the experts as black boxes: there are no assumptions about how they generate their predictions. But if the experts are learning, the following question arises: should the experts learn from all data or only from data in their own segment? The original algorithm naturally addresses the first case. Here we consider the second option, which is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics