Adaptive and Efficient Algorithms for Tracking the Best Expert

Shiyin Lu; Lijun Zhang

arXiv:1909.02187·cs.LG·February 11, 2020·1 cites

Adaptive and Efficient Algorithms for Tracking the Best Expert

Shiyin Lu, Lijun Zhang

PDF

Open Access

TL;DR

This paper introduces two adaptive algorithms for prediction with expert advice in dynamic environments, achieving improved data-dependent tracking regret bounds and extending to online matrix prediction.

Contribution

The paper presents novel adaptive algorithms with data-dependent bounds for tracking regret, including the first for online matrix prediction.

Findings

01

Second-order tracking regret bound achieved

02

Path-length bound for slowly moving environments

03

Extension to online matrix prediction with data-dependent bounds

Abstract

In this paper, we consider the problem of prediction with expert advice in dynamic environments. We choose tracking regret as the performance metric and develop two adaptive and efficient algorithms with data-dependent tracking regret bounds. The first algorithm achieves a second-order tracking regret bound, which improves existing first-order bounds. The second algorithm enjoys a path-length bound, which is generally not comparable to the second-order bound but offers advantages in slowly moving environments. Both algorithms are developed under the online mirror descent framework and draw inspiration from existing algorithms that attain data-dependent bounds of static regret. The key idea is to use a clipped simplex in the updating step of online mirror descent. Finally, we extend our algorithms and analysis to online matrix prediction and provide the first data-dependent tracking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Reinforcement Learning in Robotics